[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index] | [List Home]
Subject: Re: [docbook-apps]PDF downconversion to docBook XML
One last thing, you can extract your presentation as SVG using PDG2SVG: http://www.cityinthesky.co.uk/pdf2svg.html HTH On Mon, Jun 14, 2010 at 10:41 AM, Mathieu Malaterre <mathieu.malaterre@gmail.com> wrote: > Hi Kurt, > > From my very limited experience I found that kword did a pretty good > job at importing PDF. I used also OpenOffice to write out -poor- > docbook. > You should be able to import your PDF file directly in KWord and > write out (X)HTML file. Watch out that all your formatting will be > lost (no more title, section...). > > I used the following script to convert HTML to docbook: > > http://wiki.docbook.org/topic/Html2DocBook > > But in my case, my input HTML was -somewhat- organized. > > Good luck > > On Fri, Jun 11, 2010 at 11:31 PM, Kurt A Richardson > <kurt@iscepublishing.com> wrote: >> Hi list >> >> I am new to DocBook, and XML-based publishing in general. I run a small >> publishing company (30 titles), that specializes in complexity theory and I >> have been looking for ways to not only improve my little doc flow >> methodology, but also make our content available to our readers in a variety >> of new modes and formats. I have been drawn to DocBook and the possibility >> of using XSLT as a means to realize these goals. I have little trouble >> figuring out how to prepare new content and am hoping to produce our next >> two titles purely from DocBook XML. However, I also have about 6000 pages >> of PDFs (not all having the same format) that I'd like to 'down convert' to >> DocBook XML. I am making SLOW progress and wondered if anyone here had any >> bright ideas about how to approach this task... e.g., is PDF to html the >> best first step? Or does anyone know of any affordable services being >> provided to do the down conversion for me. >> >> Many thanks in advance for any guidance you can provide. >> >> I'm really rather excited about the possibilities that arise once I move our >> publishing from Adobe CS to XML-based! >> >> Kind regards, Kurt >> >> >> >> --------------------------------------------------------------------- >> To unsubscribe, e-mail: docbook-apps-unsubscribe@lists.oasis-open.org >> For additional commands, e-mail: docbook-apps-help@lists.oasis-open.org >> >> > > > > -- > Mathieu > -- Mathieu
[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index] | [List Home]