[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index] | [List Home]
Subject: Document statistics
-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 Greetings! We have spoken of the need for reliable numbers on features desired by users and I think MS may have released a tool that will aid in that determination. See: http://msopentech.com/blog/2014/12/19/new-open-xml-powertool-cmdlet-simplifies-retrieval-document-metrics/ Some of the information it provides: ***** The style hierarchy – styles can inherit from other styles, and it is helpful to know what styles are defined in a document. The content control hierarchy. We can examine the hierarchy, and design an XSD schema to validate them. The list of languages used in a document, such as en-US, fr-FR, and so on. Whether a document contains tracked revisions, text boxes, complex fields, simple fields, altChunk content, tables, hyperlinks, legacy frames, ActiveX controls, sub documents, references to null images, embedded spreadsheets, document protection, multi-font runs, the list of numbering formats used, and more. Metrics on how large the document is, including element counts, average paragraph lengths, run count, zero length text elements, ASCII character counts, complex script character counts, East Asia character counts, and the count of runs of each of the variety of characters. ***** Think of advanced use as a measure of transition pain. Hope everyone is at the start of a great week! Patrick PS: Suggestions of DOCX repositories in general or should I search the Common Crawl archives and simply create such a repository? - -- Patrick Durusau patrick@durusau.net Technical Advisory Board, OASIS (TAB) Co-Chair, OpenDocument Format TC (OASIS) Editor, OpenDocument Format TC, Project Editor ISO/IEC 26300 Former Chair, V1 - US TAG to JTC 1/SC 34 Convener, JTC 1/SC 34/WG 3 (Topic Maps) Co-Editor, ISO 13250-5 (Topic Maps) Another Word For It (blog): http://tm.durusau.net Homepage: http://www.durusau.net Twitter: patrickDurusau -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.11 (GNU/Linux) iQIcBAEBAgAGBQJUl4VoAAoJEAudyeI2QFGoRikQAN+rfWPQma1yC1KIkzL9ZmFf 0s4z64m1Gp0FD8zVKIJYiHlX0XwpuHOJqxq/ObQTSjd9/szd9WtGjyFxN5APmjBT KJSPhXEG5AZJs5Yl5LPBLPQd5HW0ag9hmbMwXTU8L4aQlw8hwNqHRMfT/sX2ClkS I1pOXGJBdi15a9tZZ6+Hcz4HVc3wVgFq5kLYVyQJ+Qox5iDXlSNh97MYm/nDfA7U POHiq6rOqVERmNqQoLVIzKwv9MYwVNPpSEJDzypbFvIomP5EnAFQEJMskVhqi2yH wL34XcyleyD31i78KkMmT0K1IE65E+darbvDsZIz3QhWDtHbhTT1j4mjzpinqx+x brh2FxVhjJPvCESWozDqGSc/QLNj3AYyXNG87evgWCjL7TAALIxd1gAeTJNjeg8M Of6wI45bSNaf6VaYKVnuGrbPLlJaTo6UONNdNpktik5igLx7YjMj5DioBr88CtT1 15bVRZ7dh+pF76+EVHbh27rAm3RAfwz0/eh+xExYByPLlNMcYs+hB+Nh8lelxHjo DsyKVU1kd/GkJbd3ya5GTdnNezP9yw5F9RL3FkZ2Vf9fsl8ZfdjYlCoBFmKck2kO 5nzcUvS85VkgNretUihc7PzVNDtykKlQjUT6TXPc4SXkH14fpaDBXsNC662Cyljq z1WCUphSj7senoBj+P5r =gd9d -----END PGP SIGNATURE-----
[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index] | [List Home]