OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.

 


Help: OASIS Mailing Lists Help | MarkMail Help

xliff-seg message

[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index] | [List Home]


Subject: Re: [xliff-seg] Re: XLIFF Segmentation at trans-unit or group level



Good point, Magnus, and I agree - it will be necessary to associate the attributes of the original trans-unit with the segmentation group it's replaced by.  The only constructs in your list that I find missing in group that exist at trans-unit level are : 

    elements:  alt-trans
    attributes: approved and ts  (the latter is deprecated). 

We can safely ignore ts since it's deprecated,  and I think approved should remain at trans-unit level,  with all the individual trans-units contained  in the segment group assuming the value of the original trans-unit attribute.

How to deal with alt-trans is more complicated.  The way I see it,  once we resegment,  the alt-trans->segment relationship is no longer meaningful  (ie.,  whatever the relationship existed between alt-trans and trans-unit no longer holds,  since we didn't resegment alt-trans comparably).  So whatever meaning the alt-trans "match-quality" or "crc" attributes may have had before the resegmentation are likely to be vastly different afterwards.  But we run into the same alt-trans->segment association problem with any of the other segmentation models,  including the the proposed new segment element,  where the act of resegmentation transforms our  (many alt trans)->(one segment) logical association to a (many alt-trans)-> (many segments) logical association.    I guess if I had to list my preference for how to handle alt-trans right now,  my 1st choice is transfer to all trans-units,.  and my 2nd choice is transfer alt-trans to 1st trans-unit in group.  But if I were implementing a tool to do this work,  I'd just get rid of the old alt-trans data,  and upon resegmentation generate new alt-trans data by releveraing for each trans-unit.  But I may use alt-trans differently how others use it.

Regards,
Tony




Magnus Martikainen wrote:

Hi Tony,

 

I just thought of one more issue related to this approach, related to Yves’ comment in the meeting today about regarding segmentation as a process separate from filtering.

 

As a result of filtering (or other processing of the XLIFF file before it reaches the segmentation tool) the original <trans-unit> before segmentation may have any of the following associated information:

- Attributes: approved, translate, reformat, xml:space, datatype, ts, phase-name, restype, resname, extradata, help-id, menu, menu-option, menu-name, coord, font, css-style, style, exstyle, extype, maxbytes, minbytes, size-unit, maxheight, minheight, maxwidth, minwidth, charclass.

- Elements of type <context-group>, <count-group>, <prop-group>, <note>, <alt-trans> (appearing after the <source> and/or <target>).

- Non-XLIFF elements.

 

If we need to segment this <trans-unit> (and thus replace it with a <group> of new <trans-unit> elements) we need to decide how to handle each of these associated pieces of data. The following options come to my mind:

-          Transfer/copy to each <trans-unit>

-          Transfer to the first (or last) <trans-unit> only.

-          Associate information with the <group>

-          Various combinations of these

 

For many of these cases they only option that seems correct/safe is to associate the information with the <group>, which means that we would need to extend/modify the current definition of <group> to support almost everything that a <trans-unit> supports.

 

Cheers,

Magnus

 

 

 


From: Tony Jewtushenko [mailto:Tony.Jewtushenko@oracle.com]
Sent: Tuesday, May 25, 2004 8:14 AM
To: xliff-seg@lists.oasis-open.org
Subject: [xliff-seg] Re: XLIFF Segmentation at trans-unit or group level

 

One more Con:
7. May not be suitable for resegmenting across inline tags.


Tony Jewtushenko wrote:

Summary:

In a nutshell,  the concept behind this scenario is that segments can be defined by the hard structure of trans-units,  or optionally within groups. All segmentation functionality is offloaded to the tools that generate or process the XLIFF content,  including resegmentation.

Original Discussions:
http://www.oasis-open.org/archives/xliff-seg/200404/msg00024.html
http://www.oasis-open.org/archives/xliff-seg/200404/msg00012.html

Original XLIFF file:

<trans-unit id="1">
   <source xml:lang="en-US">Long sentence. Short sentence.</source>
   <target xml:lang="sv-SE" state="final">Lång mening. Mer mening. Kort mening.</target>
</trans-unit>
 

Which can be segmented using a tool into the following "hard" segments:

<trans-unit id="1.1">
   <source xml:lang="en-US">Long sentence.</source>
   <target xml:lang="sv-SE" state="translated">Lång mening. </target>
</trans-unit>
<trans-unit id="1.2">
   <source xml:lang="en-US">Short sentence.</source>
   <target xml:lang="sv-SE" state="translated">Kort mening.</target>
</trans-unit>

Or,  we can create segmentation groups that contain the individual segments,  and tag them with additional metadata that identifies them as segment-groups/segments:

<group extype="segment-group" id="1">
   <<trans-unit extype="segment" id="1">
   <source xml:lang="en-US">Long sentence.</source>
   <target xml:lang="sv-SE" state="translated">Lång mening.</target>
</trans-unit>
<trans-unit extype="segment-group" id="2">
   <source xml:lang="en-US">Short sentence.</source>
   <target xml:lang="sv-SE" state="translated">Kort mening.</target>
</trans-unit>
</group>

When reconciling with the skeleton file,  presumably when building out the translated file,  a tool would need to match up the segment-group with the original segment it replaces.  Similar reconciliation would be required if updating the TM.

PRO's:

  1. Uses existing XLIFF structures:  no additional changes to XLIFF specification is required.
  2. For most use cases,  very simple to implement.

CON's:

  1. Matching up resegmented data becomes tool dependent,  and may be handled differently by individual tool providers.
  2. May not be suitable or very complicated for moving segments across groups (ie, resegmenting the segment-groups).
  3. May not be suitable for processing target only changes returned by vendor.
  4. <>Implementation is non-normative,  and would require adherence to profile rather than XSD validation.
  5. Complicate process of building out translated content using skeleton files.
  6. Multiple iterations of resegmentation may create tangle of groups.
-- 
Tony Jewtushenko                              
Principal Product Manager - Oracle Application Development Tools
Oracle Corporation, Ireland
 
mailto:tony.jewtushenko@oracle.com
Direct tel: +353.1.8039080     



-- 
Tony Jewtushenko                              
Principal Product Manager - Oracle Application Development Tools
Oracle Corporation, Ireland
 
mailto:tony.jewtushenko@oracle.com
Direct tel: +353.1.8039080     

-- 
Tony Jewtushenko				
Principal Product Manager - Oracle Application Development Tools
Oracle Corporation, Ireland

mailto:tony.jewtushenko@oracle.com
Direct tel: +353.1.8039080	


[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index] | [List Home]