Lucene.net字段包含多个值以及要搜索的人员

时间:2012-06-20 21:07:36

标签: c# lucene.net

任何人都知道搜索包含多个值的字段的最佳方法是什么?

string tagString = "";
foreach(var tag in tags)
{
    tagString = tagString += ":" + tag;
}
doc.Field(new Field("Tags", tagString, Field.Store.YES, Field.Index.Analyzed);

假设我想搜索所有标记为“csharp”的文档,我可以最好地实现这个标记吗?

1 个答案:

答案 0 :(得分:6)

我认为您要找的是为一个Document添加多个具有相同名称的字段。

您所做的就是创建一个Document并为其添加多个标记Field

RAMDirectory ramDir = new RAMDirectory();

IndexWriter writer = new IndexWriter(ramDir, new StandardAnalyzer(Lucene.Net.Util.Version.LUCENE_29));


Document doc = new Document();
Field tags = null;

string [] articleTags = new string[] {"C#", "WPF", "Lucene" };
foreach (string tag in articleTags)
{   
    // adds a field with same name multiple times to the same document
    tags = new Field("tags", tag, Field.Store.YES, Field.Index.NOT_ANALYZED);
    doc.Add(tags);
}

writer.AddDocument(doc);
writer.Commit();

// search
IndexReader reader = writer.GetReader();
IndexSearcher searcher = new IndexSearcher(reader);

// use an analyzer that treats the tags field as a Keyword (Not Analyzed)
PerFieldAnalyzerWrapper aw = new PerFieldAnalyzerWrapper(new StandardAnalyzer(Lucene.Net.Util.Version.LUCENE_29));
aw.AddAnalyzer("tags", new KeywordAnalyzer());

QueryParser qp = new QueryParser(Lucene.Net.Util.Version.LUCENE_29, "tags", aw);

Query q = qp.Parse("+WPF +Lucene");
TopDocs docs = searcher.Search(q, null, 100);
Console.WriteLine(docs.totalHits); // 1 hit

q = qp.Parse("+WCF +Lucene");
docs = searcher.Search(q, null, 100);
Console.WriteLine(docs.totalHits); // 0 hit