OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.

 


Help: OASIS Mailing Lists Help | MarkMail Help

xliff message

[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index] | [List Home]


Subject: Re: [xliff] How to refer to the proper CRC32 algorithm for the track changes module?


Since I do not hear any differing opinions, I believe we have consensus on this one.
@Ryan, can you please implement CRC with the ITU normative reference and Joachim's pseudocode as an example?
Would this be feasible by Thursday this week?
And would you also please check if the pseudocode does what ITU describes, while changing the spec? :-)

Thanks and regards
dF

Dr. David Filip
=======================
LRC | CNGL | LT-Web | CSIS
University of Limerick, Ireland
telephone: +353-6120-2781
cellphone: +353-86-0222-158
facsimile: +353-6120-2734
mailto: david.filip@ul.ie


On Tue, Apr 9, 2013 at 12:44 AM, Dr. David Filip <David.Filip@ul.ie> wrote:
All, do we have consensus here wrt pointing to ITU?
Do we have a volunteer (except Joachim) who would double check that the pseudocode does what ITU specifies?
@Ryan? @Fredrik?

Cheers
dF

Dr. David Filip
=======================
LRC | CNGL | LT-Web | CSIS
University of Limerick, Ireland
telephone: +353-6120-2781
cellphone: +353-86-0222-158
facsimile: +353-6120-2734


On Tue, Apr 2, 2013 at 1:13 PM, Dr. David Filip <David.Filip@ul.ie> wrote:
+1
Pointing to ITU with pseudocode as non-normative example sounds right.
Some one should double check that our example indeed behaves as the mathematical foundation states, any volunteers?

Thanks
dF


Dr. David Filip
=======================
LRC | CNGL | LT-Web | CSIS
University of Limerick, Ireland
telephone: +353-6120-2781
cellphone: +353-86-0222-158
facsimile: +353-6120-2734


On Mon, Apr 1, 2013 at 8:30 PM, Helena S Chapman <hchapman@us.ibm.com> wrote:
I prefer pointing to an external reference but what we can do is to include the pseudo code as a sample so people understand what the compliant behavior would look like.




From:        "Schurig, Joachim" <Joachim.Schurig@lionbridge.com>
To:        "xliff@lists.oasis-open.org" <xliff@lists.oasis-open.org>
Date:        03/19/2013 01:50 PM
Subject:        [xliff] How to refer to the proper CRC32 algorithm for the track changes module?
Sent by:        <xliff@lists.oasis-open.org>




Hi everyone,
 
as briefly discussed today in the call I wanted to hear your opinion how one can best refer to the CRC32 implementation I proposed to use in the Track Changes module which Ryan prepares.
 
The problem with CRCs is that there are many algorithms. For CRC32 there are still 3 used ones, using different polynomials, so one needs to refer to the right one, and then it needs to be clarified which init and exit operations are done on the calculated CRC value, because that is not determined by the polynomial.
 
Just so everyone understands: the bulk of CRC32 implementations (in ZIP, Ethernet, ..) use the same algorithm and values – but there are more. And if you look e.g. at the English Wikipedia page about CRCs (http://en.wikipedia.org/wiki/Cyclic_redundancy_check) you do not get the impression as if you can easily tell which one to pick (which is in practice not correct).
 
The initial idea was to point to an existing standard. But that is difficult, because commonly people use a reference to RFC 1952 (http://www.ietf.org/rfc/rfc1952.txt), which includes as appendix only a C code sample of an implementation. For a formal definition it refers to ISO 3309 (which is not publicly available, so I could not check the contents) and to ITU-T V.42. The latter is available, but it covers a lot more items than only the CRC32 generation, which itself it explains correctly from the mathematical level, but probably not in a way that would help the average implementer.
 
So the idea was to come up with pseudo code to make sure the right algorithm and init and exit values are understood.
 
Is this anything you would deem suitable for a standards document like the XLIFF draft? Or should we only refer to the mathematical foundations, by pointing to ITU-T V.42 (http://www.itu.int/rec/T-REC-V.42/en) ?
 
My proposed pseudo code looks like this:
 
Pseudo code to generate the CRC32 bit-by-bit looks as follows (where it is assumed that the least significant bit is the rightmost):
 
crc = 0xFFFFFFFF
for each bit in input do
  if rightmost bit in crc <> current bit in input
    crc = right shift(crc) XOR 0xEDB88320
  else
    crc = right shift(crc)
end for
crc = crc XOR 0xFFFFFFFF
 
Pseudo code to generate the CRC32 in the typical table driven approach looks as follows:
 
Table generation:
for n = 0 to 255 do
  crc = n
  for i = 0 to 7 do
    if (crc AND 1 <> 0)
      crc = right shift(crc) XOR 0xEDB88320
    else
      crc = right shift(crc)
  end for
  CRC_TABLE[n] = crc
end for
 
CRC calculation:
crc = 0xFFFFFFFF
for n = 0 to (octet count of input data)-1 do
  crc = CRC_TABLE[ (crc XOR input[n]) AND 0xFF ] XOR right shift(crc, 8)
end for
crc = crc XOR 0xFFFFFFFF
 
 
0xEDB88320 is the reverse order representation of the coefficients of the polynomial x^32+x^26+x^23+x^22+x^16+x^12+x^11+x^10+x^8+x^7+x^5+x^4+x^2+x+1
 
 
Thanks for any comments!
(and I hope the code survives the list re-send halfways formatted)
 
Joachim
 
________________________________
Joachim Schurig

Senior Technical Director,

Lionbridge Fellow

Lionbridge

1240 Route des Dolines
06560 Sophia Antipolis
France
 





[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index] | [List Home]