OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.

 


Help: OASIS Mailing Lists Help | MarkMail Help

topicmaps-comment message

[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index] | [Elist Home]


Subject: Re: [topicmaps-comment] multilingual thesaurus - language, scope,and topic naming constraint


Kal

> This is a classic example of the TNC causing a problem - the fact is that
> in certain languages, the same word will be used to signify multiple
> concepts. In this case, it seems obvious to me that the experts who created
> the thesaurus have identified two distinct abstract concepts and I would
> argue that it is important to retain them both.

I have no choice. I don't want to "interpret" that thesaurus, just "translate" it in XTM.

> 1) Select one language for the basename, and make all others variants -
> this is undesirable because if you choose Spanish as your "base" language
> you will still end up with an undesired merge.

Exactly.

> 2) If the thesaurus provides any notion of different contexts for the two
> definitions, use that context to create  a scope for all base names.

No context. The "themes" and "groups" used to classify the descriptors are the same ...
because the two concepts are definitely very close ...

> 3) Don't use a base name! Just make your name strings occurrences and scope
> those by language.

That's a stealth to escape the problem ... doable, but I don't really feel like it.
Of course you can do all sorts of things like that, even without scoping, like replace

<baseName>
<scope>
<topicRef xlink:href="#spanish"/>
</scope>
<baseNameString>economía</baseNameString>
</baseName>

by (completely equivalent IMO)

<occurrence>
<instanceOf>
<topicRef xlink:href="#spanish-name"/>
</instanceOf>
<resourceData>economía</resourceData>
</occurrence>

 ... and if you do that everywhere you have a name, you get definitely rid of the TNC,
since all your <baseName> are turned into <occurrence> . And in a sense, it is clearer in
that case. You say what *type* of name you use (the spanish one). This is yet another
example where difference between scoping and typing is blurred ...

> 4) Use some (probably unattractive) unique identifier as the basename (e.g.
> the thesaurus entry number suitably prefixed). Then add the entries as
> variants scoped as XTM-displayable, XTM-sortable and by language.

They are definitely not "variants". There again, it's a technical stealth ...

> 5) Don't apply the TNC...sorry, couldn't resist that one ;-)

In fact, that's what we're gonna do to begin with ;-)

Thanks for your thoughts

Bernard



[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index] | [Elist Home]


Powered by eList eXpress LLC