OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.

 


Help: OASIS Mailing Lists Help | MarkMail Help

docbook-apps message

[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index] | [List Home]


Subject: RE: [docbook-apps] RE: Converting MS Word documents to DocBook 5 XML


Thanks, I appreciate the tip. I gave it a try, and Herold works wonderfully!

 

Regards,

Jeff

 

From: Michael Fuchs [mailto:mlist@dbdoclet.org]
Sent: Thursday, May 24, 2012 1:19 AM
To: Jeff Powanda
Cc: 'docbook-apps@lists.oasis-open.org'
Subject: Re: [docbook-apps] RE: Converting MS Word documents to DocBook 5 XML

 

Hello Jeff,

to transform HTML to DocBook5 please use http://www.dbdoclet.org/archives/herold-6_0_1-68.exe. Save your Word document as "filtered HTML". Run herold --profile C:\Program Files (x86)\Herold\profiles\word.her -i <Document.htm>. I tested this procedure with Word 2003 documents, so if encounter any problems, please let me know.

Regards,
Michael Fuchs
http://www.dbdoclet.org

Am 24.05.2012 08:12, schrieb Jeff Powanda:

Sorry, just saw there was a recent post about saving Word to HTML and then using dbdoclet to convert to DocBook XML. I’ll give that a try.

 

Regards,

Jeff Powanda

Vocera Communications, Inc.

 

From: Jeff Powanda [mailto:jpowanda@vocera.com]
Sent: Wednesday, May 23, 2012 10:39 PM
To: 'docbook-apps@lists.oasis-open.org'
Subject: [docbook-apps] Converting MS Word documents to DocBook 5 XML

 

What’s the easiest way to convert MS Word 2007 documents to DocBook 5 XML?

 

I’ve tried using the DocBook roundtrip stylesheets. They seemed to work OK if I did the following:

1.       Copied the DocBook styles in template.dot to the document.

2.       Applied the DocBook styles to the document.

3.       Saved the document as a Word 2003 XML file.

4.       Converted the Word 2003 XML file to DocBook 5 XML.

 

This worked OK, but it was a lot of work to apply the DocBook styles to the document (and there are several documents to convert). Also, the resulting DocBook XML file has dbk namespace prefixes on all the elements. How do I remove them?

 

I’m not interested in the roundtrip aspect of the roundtrip stylesheets. I just want to get Word content into DocBook 5.

 

Regards,

Jeff Powanda

Vocera Communications, Inc.

 

 



[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index] | [List Home]