OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.


Help: OASIS Mailing Lists Help | MarkMail Help

xliff message

[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index] | [List Home]

Subject: RE: [xliff] Event "XLIFF TC Call" - Summary

Hi Yves,

In Swordfish your sample text is extracted like this:

<source xml:lang="en">The title says "<ph id="0" ctype="x-other">&lt;span dir="rtl"&gt;</ph>פעילות הבינאום, W3C<ph id="1">&lt;/span&gt;</ph>" in Hebrew.</source>

Rodolfo M. Raya       rmraya@maxprograms.com
Maxprograms       http://www.maxprograms.com

-----Original Message-----
From: xliff@lists.oasis-open.org [mailto:xliff@lists.oasis-open.org] On Behalf Of Yves Savourel
Sent: Wednesday, October 19, 2011 9:55 AM
To: xliff@lists.oasis-open.org
Subject: RE: [xliff] Event "XLIFF TC Call" - Summary

Hi Rodolfo,

> Swordfish analyzes the text to translate and renders it using the BiDi 
> algorithm from Unicode. The program doesn't need a hint from the XLIFF 
> markup to render mixed text.

So Swordfish uses Unicode bidi characters control for hints (like Trados and Lionbridge's editors). Sounds fine to me.

One follow up questions then (and not just for Rodolfo):

When extracting a text like this:

<p>The title says "<span dir="rtl">פעילות הבינאום, W3C</span>" in Hebrew.</p>

Where the "W3C, " part should be on the left side of the quoted text when editing/rendering properly. As shown in the screenshot attached.

How does Swordfish (or other tools) stores the original HTML codes and markup the text?
Do they store the <span> as like any other elements and just add U+202B and U+202C to the text? Something like this:

<source> The title says "<g id="1">[U+202B]פעילות הבינאום, W3C[U+202C]</g>" in Hebrew.</span>


[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index] | [List Home]