[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index] | [List Home]
Subject: RE: [xliff-comment] using xliff to translate html
Yves, Yves Savourel wrote: > ... > Just to add more on the discussion, I've attached is an > example of HTML extracted to XLIFF: > > - TestPage.htm is the original file. > - TestPage.xlf is the XLIFF extraction. Thank you for the examples. If you don't mind I'd like to discuss a couple of items. Readability of the inline marked up text ======================================== The inline markup seems to add a lot of bulk to the text. Consider this paragraph from TestPage.htm: <p>Some text in <b>bold</b>, <i>italic</i>, <b><i> bold and italic</i></b>, and a <a href="#topoffile">link</a>.</p> The inline marked up text in TestPage.htm becomes(1): <source xml:lang='en'>Some text in <bpt id='1' ctype='bold'><b></bpt>bold<ept id='1'></b></ept>, <bpt id='2' ctype='italic'><i></bpt>italic <ept id='2'></i></ept>, <bpt id='3' ctype='bold'><b></bpt> <bpt id='4' ctype='italic'><i></bpt> bold and italic <ept id='4'></i></ept><ept id='3'></b></ept>, and a <bpt id='5' ctype='link'><a href="#topoffile"></bpt>link <ept id='5'></a></ept>.</source> Its my current thinking that the original version is fairly readable and I'd be comfortable presenting this to a localizer. I am very concerned that the inline marked up version is a bit hard to read and I am concerned that it is not appropriate for a localizer to work with. It would be possible for a tool to convert the data back to the more readable form. However, unless this is a requirement of XLIFF I would expect that we will see at least a generation of tools that present the inline marked up form directly to the translator. Using the <g> tag =============================================== Using the <g> tag this text: <p>Some text in <b>bold</b>, <i>italic</i>, <b><i> bold and italic</i></b>, and a <a href="#topoffile">link</a>.</p> would become (something like) this(2): <g xid="g1">Some text in <g xid="g2">bold</g>, <g xid="g3">italic</g>, <g xid="g2"><g xid="g3"> bold and italic</g></g>, and a <g xid="g4">link</g>.</g> <trans-unit id='g1'> <source xml:lang='en'>p</source> </trans-unit> <trans-unit id='g2'> <source xml:lang='en'>b</source> </trans-unit> <trans-unit id='g3'> <source xml:lang='en'>i</source> </trans-unit> <trans-unit id='g4'> <source xml:lang='en'>a href="#topoffile"</source> </trans-unit> </trans-unit> I like the <g> tag is a lot less bulky than the <it>,<bpt>,<ept> tags but then the meaning of the tag is removed away from the source element. Notes: (1) For this discussion let's ignore the bracketing <p> tags. (2) Yves, perhaps you could show the proper form. -- Brian Stell AIM: brianstell00 (650) 937-2797
[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index] | [List Home]