Question

websolr正在返回

 RSolr::Error::Http - 400 Bad Request
Error: <html><head><title>Apache Tomcat/6.0.28 - Error report</title><style><!--H1 {font-family:Tahoma,Arial,sans-serif;color:white;background-color:#525D76;font-size:22px;} H2 {font-family:Tahoma,Arial,sans-serif;color:white;background-color:#525D76;font-size:16px;} H3 {font-family:Tahoma,Arial,sans-serif;color:white;background-color:#525D76;font-size:14px;} BODY {font-family:Tahoma,Arial,sans-serif;color:black;background-color:white;} B {font-family:Tahoma,Arial,sans-serif;color:white;background-color:#525D76;} P {font-family:Tahoma,Arial,sans-serif;background:white;color:black;font-size:12px;}A {color : black;}A.name {color : black;}HR {color : #525D76;}--></style> </head><body><h1>HTTP Status 400 - org.apache.lucene.queryParser.ParseException: Cannot parse '----': Encountered &quot; &quot;-&quot; &quot;- &quot;&quot; at line 1, column 1.
Was expecting one of:
   &quot;(&quot; ...
   &quot;*&quot; ...
   <QUOTED> ...
   <TERM> ...
   <PREFIXTERM> ...
   <WILDTERM> ...
   &quot;[&quot; ...
   &quot;{&quot; ...
   <NUMBER> ...

当试图搜索“ - ”字符时。

其他特殊字符工作正常，如“：”等我试图使用CGI.escape，但它没有逃避这些字符。

Answer 1

您是否尝试使用反斜杠转义它？

通常，当您为文档编制索引时，标记生成器会自行删除短划线字符，因此您可能只想删除短划线，除非您的意思是否为负面查询。

完整的Solr查询语法位于：http://wiki.apache.org/solr/SolrQuerySyntax

Answer 2

正如克里斯正确指出的那样，你需要逃避反斜杠。

根据您使用的查询解析器，有一些特殊字符具有意义。在撰写本文时，Lucene（以及Solr）查询解析器为这些字符赋予特殊含义：

+ - && || ! ( ) { } [ ] ^ " ~ * ? : \

你应该参考Lucene query parser syntax的文档来了解它们的全部含义。默认的Solr查询解析器提供了Lucene查询解析器语法的超集，如SolrQueryParser wiki页面所述。

如果您不想担心转义内容，DisMax Query Parser旨在接受更接近用户可能在搜索框中输入内容的输入。我最近没有对它进行过各种特殊测试，但通常它接受的输入可能更优雅。

在websolr中搜索“ - ”

2 个答案: