docbook message

Subject: Re: [docbook] Biblioentry markup standards -- identifying the type of entry

From: Peter Flynn <peter@silmaril.ie>
To: docbook@lists.oasis-open.org
Date: Thu, 11 Jun 2020 13:06:24 +0100

On 11/06/2020 01:55, Richard Hamilton wrote:

Regarding LaTeX, do you use this tool chain in place of the DocBook
XSLT .fo/.html stylesheets or do you integrate the two somehow?

I was using LaTeX before FO, so we stuck with it; so the XSLT is all ourown work.

If itâs the former, thatâs not going to work for us because we havestylesheet customizations for both .fo and epub that I wouldnât wantto try and duplicate in LaTeX.

Then definitely stick with what you've got. If you're outputting HTMLand EPUB, LaTeX is no use to you, except for one thing: bibliographicreferences (more on this below).

Regarding marking the type on biblioentry, I agree that this shouldbe done on the top-level biblioentry element. I was wondering aboutusing the @typeof RDF attribute, but that seems to be abusing RDFmarkup (as opposed to abusing @role:-). The one advantage of abusing@role is that the existing iso690 customization uses @role.


Yep, stick with it. I used @type because I wanted the token list so
that it provided an extra safety blanket.

The list of biblatex types you pointed me to looks comprehensive. Itcertainly didnât miss any type that I would ever use. I could seeusing that list, though Iâm not sure Iâd want to try and implementall of them.

Article, book, inbook, incollection, manual, and thesis cover most ofwhat we need. Occasionally 'misc' :-)

I wasnât thinking of requiring biblioset on both parts of an

inclusion.

Only for those needing the separation. DocBook has no journaltitle orbooktitle element type, so if you use title for the title of thearticle, what do you then use for the title of the journal? I guess youcould use titleabbrev (we use if for the biblatex shorttitle field), oryou could use citetitle (we use it for titles-within-titles).

The biblatex types probably make that unambiguous, but I had been
thinking that the biblioentry âtypeâ would refer to the actual thing
youâre referring to (article, etc.), and biblioset would be used for
the publication it was included in (journal, proceedings, etc).


Yes, you could use just one biblioset, and everything outside it is
the parent or child document.

What is the advantage of enclosing authors, editors, etc., in
authorgroup? Is it just to make the processing easier

Always :-) Especially if you're *not* using biber, which does a good jobon multiple names. One of the many good things in Eve Maler's and Jeanneel Andaloussi's book on designing SGML DTDs all those years ago wastheir insistence that if you had more than one consecutive occurrence ofsomething, you almost certainly needed to put it in a wrapper :-)

Now: HTML (and EPUB XHTML). Bib reference formats are picky, to say theleast. The advantage of biblatex is that it already knows the formats,and they have been written by people in the field, and used by millionsof users, so they are for all practical purposes correct, which meansyou don't need to write anything.

But within a fully-formatted entry there are still three typographicconsiderations apart from punctuation: fonts. Bold, italic, andsmallcaps are most of what the publishers want. Rather than implementdozens of conflicting XSLT nests of choose-when and if, the lazy way isto output the entries to BiBTeX format, run an 8-line LaTeX documentover it, through biber, lather, rinse, repeat, and create the PDF.


\documentclass{article}
\usepackage[backend=biber,style=authoryear]{biblatex}
\usepackage[a4paper,margin=2cm,nohead,nofoot]{geometry}
\addbibresource{client.bib}
\begin{document}
\nocite{*}
\printbibliography
\end{document}

Then push the PDF through Apache PDFbox ExtractText -html and the outputwill preserve case, punctuation, and bold and italics in all the rightplaces. Sadly, PDFbox doesn't understand smallcaps: that does have to beflagged somehow if you are using a format that requires it.

But it's not for everyone, and if you already have code implementing thedifferent formats, I'd stick with it.


Peter

References:
- Biblioentry markup standards -- identifying the type of entry
  - From: Richard Hamilton <hamilton@xmlpress.net>
- Re: [docbook] Biblioentry markup standards -- identifying the type of entry
  - From: Peter Flynn <peter@silmaril.ie>
- Re: [docbook] Biblioentry markup standards -- identifying the type of entry
  - From: Richard Hamilton <hamilton@xmlpress.net>