OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.

 


Help: OASIS Mailing Lists Help | MarkMail Help

cmis message

[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index] | [List Home]


Subject: [OASIS Issue Tracker] Commented: (CMIS-660) Clarification needed onthe use of quotes in a CONTAINS() query to search on phrases



    [ http://tools.oasis-open.org/issues/browse/CMIS-660?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=20155#action_20155 ] 

David Choy commented on CMIS-660:
---------------------------------

As far as I can tell, the Lucene Query Parser Syntax differs from CMIS 1.0 fulltext syntax in at least the following ways:
 • Double-quotes are used for phrase.
 • Terms that are not separated by a Boolean operator are OR'ed rather than AND'ed.
 • Negation is not allowed if there is only one term.
 • Character escaping. (CMIS may have to differ in this aspect no matter what so long as the fulltext expression is embedded in a SQL statement.)

Arguments FOR adopting a subset of Lucene syntax:
 • Why invent yet another standard for fulltext query? Is our syntax better? Do we have different requirements?
 • Compatibility with Lucene would reduce confusion.
 • The semantics of the fulltext expression was not defined in v1.0 and we need to fill that hole. Adopting both Lucene's syntax and semantics could be a clean fix, and it would simplify the spec also.
 • The Lucene team probably have investigated fulltext query syntax deeper than we have.
 • For implementations that use a Lucene engine, syntax conversion for fulltext query would be unnecessary.
 • ...

Arguments AGAINST adopting Lucene syntax:
 • Backward compatibility with v1.0.
 • CMIS fulltext syntax is so simple that compatibility with Lucene is an over-concern.
 • Conversion of a CMIS fulltext expression to a Lucene query is trivial.
 • ...


> Clarification needed on the use of quotes in a CONTAINS() query to search on phrases
> ------------------------------------------------------------------------------------
>
>                 Key: CMIS-660
>                 URL: http://tools.oasis-open.org/issues/browse/CMIS-660
>             Project: OASIS Content Management Interoperability Services (CMIS) TC
>          Issue Type: Improvement
>          Components: Domain Model
>    Affects Versions: Draft 0.70
>            Reporter: Jane Doong
>            Assignee: Ethan Gur-esh
>            Priority: Minor
>
> Clarification needed on the use of quotes in a CONTAINS() query to search on phrases.
> Spec:
> 2575  BNF grammar structure:: CONTAINS ( [ <qualifier> ,] ' <text search expression> ' )
> 2413  <phrase> ::= <quote> <word> [ {<space> <word>} ... ] <quote>
> 2422  <quote> ::= "'" !! Single-quote only, consistent with SQL-92 string literal
> 2597  Within a word or phrase, each (single-)quote must also be escaped by a preceding backslash "\"
> The spec specifically states that <quote> is single-quote only.
> My questions is on specifying a phrase inside the CONTAINS(). 
> Since the entire text search expression is enclosed in single quotes, 
> I question whether a phrase should again be enclosed in single quotes, or should it be in double-quotes.
> According to spec:
>  Word search:    CONTAINS('house')
>  Phrase search : CONTAINS(' 'my house' ')
> Should phrases be in double-quotes? ==> CONTAINS(' "my house" ')

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: http://tools.oasis-open.org/issues/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira




[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index] | [List Home]