Question

我的问题很难解决，我已经在我的debian 9虚拟机中安装了solr，目的是进行海量数据测试。专门通过脚本

bin / post

它将所有pdf和doc文件上载到solr中的我的收藏中。我的问题是，在对内容进行查询时，我的结果是整个内容，而不是告诉我结果在哪里。我想通过查询知道单词或句子的确切位置。

例如：

"content":["This is my \nfirst test\n This is my \nfirst test\n This is my \nfirst test\n This is my \nfirst test\n This is my \nfirst test \n"], "metadata.X-Parsed-By":["org.apache.tika.parser.DefaultParser", "org.apache.tika.parser.txt.TXTParser"], "metadata.Content-Encoding":["ISO-8859-1"], "metadata.resourceName":["tika_extract.py"], "metadata.Content-Type":["text/x-python; charset=ISO-8859-1"], "id":"4ebba9d3-239c-494d-9a1f-9ec16610530b", "_version_":1630918290423414784},

查询：

{content: "test"}

我当前的结果是：

"content":["This is my \nfirst test\n This is my \nfirst test\n This is my \nfirst test\n This is my \nfirst test\n This is my \nfirst test \n"]

我需要这个结果：

“内容”：[“这是我的\ n 第一测试\ n这是我的\ n 第一测试\ n这是我的\ n 第一测试\ n这是我的\ n 第一测试\ n 这是我的\ n 第一个测试\ n“]

我的主要目标是只知道匹配词在内容或整个句子中的位置，这不是其他选择的问题。

请最好地解释我的问题。

我不知道如何在solr的字段中通过查询找到单词的确切位置？

0 个答案: