[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index] | [List Home]
Subject: RE: [oic] xml element/attribute coverage/analysis
I want to add that it is important to know attributes by element in many cases, because the semantics are element specific. So there are a number of slice-and-dice cases that will be of interest. Meanwhile, I think this is a great step. Thanks, Michael. - Dennis -----Original Message----- From: robert_weir@us.ibm.com [mailto:robert_weir@us.ibm.com] Sent: Friday, February 27, 2009 08:33 To: Hanssens Bart Cc: jiamingf@cn.ibm.com; michael.brauer@sun.com; oic@lists.oasis-open.org Subject: Re: [oic] xml element/attribute coverage/analysis "Hanssens Bart" <Bart.Hanssens@fedict.be> wrote on 02/27/2009 04:02:56 AM: > > > > 1) a tool that would process a directory of ODF-documents, counting the > elements, attributes and attribute values being used in these documents > > Using a high-level ODF library like jOpenDocument, it should be fairly > easy to generate a report, for example in ODS, saying something like > > * text:p, used 716547 times, found in 685 documents (out of 687 docs) > * meta:user-defined, used 53 times, found in 43 documents (out of 687) > I have some python code that I should be able to adapt to do this. But note that there are probably two metrics of interest: 1) Which ODF "features" (elements/attributes/attribute values)are used most frequently by raw counts? 2) Which features are used in most documents? It is possible that a particular element is used only once per document, but is used in every ODF document. If so it is important, since everyone will need to understand it. In other cases, a particular element will be repeated a thousand times when it is used, but is used only in rare documents. So there are two different kinds of "popularity" here. Also, does anyone have a large collection of ODF documents that could be tested in this way? I suppose, even if you had such documents internally,you could use the python script to do the study and just upload the results. This is easier than uploading the documents themselves. -Rob --------------------------------------------------------------------- To unsubscribe from this mail list, you must leave the OASIS TC that generates this mail. Follow this link to all your TCs in OASIS at: https://www.oasis-open.org/apps/org/workgroup/portal/my_workgroups.php
[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index] | [List Home]