Lucene评分功能

时间:2016-05-15 17:22:10

标签: java twitter lucene

我正在开发一个在索引中索引和搜索推文的系统,每条推文都有一个字段来定义其社会重要性(社交价值),我想将这个值添加到相似度得分中,这样我就可以对文档进行排名通过将他们的社会价值和他们的分数结合起来进行查询。

例如,我的评分功能就像

Final Score = QueryScore + Social score ( which is a float that I already calculated)

那我怎么能实现这个目标呢?

我正在使用lucene-5.5.0

package Lucene;


import java.nio.file.Path;
import java.nio.file.Paths;
import org.apache.lucene.analysis.standard.StandardAnalyzer;
import org.apache.lucene.document.Document;
import org.apache.lucene.index.DirectoryReader;
import org.apache.lucene.search.IndexSearcher;
import org.apache.lucene.search.Query;
import org.apache.lucene.queryparser.classic.QueryParser;
import org.apache.lucene.search.ScoreDoc;
import org.apache.lucene.store.Directory;
import org.apache.lucene.store.FSDirectory;

public class SearchFiles {

    @SuppressWarnings("deprecation")
    public static void main(String[] args){
        try{

            Path path = Paths.get("C:\\Users\\JUGURTHA\\Desktop\\boulot\\index");
            Directory dir = FSDirectory.open(path);
            DirectoryReader ireader = DirectoryReader.open(dir);
            IndexSearcher isearcher = new IndexSearcher(ireader);
            StandardAnalyzer analyzer = new StandardAnalyzer();
            //get each token            
            QueryParser parser = new QueryParser("text", analyzer);
            Query query = parser.parse("Love");
            ScoreDoc[] hits = isearcher.search(query, null, 20).scoreDocs;
            for (int i = 0; i <  hits.length; i++){
                Document hitDoc = isearcher.doc(hits[i].doc);
                System.out.println("Tweet " + i + " : " + hitDoc.get("text"));
                System.out.println("created_at: " + hitDoc.get("date"));
                System.out.println("id: " + hitDoc.get("id"));
                System.out.println();
                System.out.println();

            }


        } catch(Exception e){
            e.printStackTrace();
        }
    }
}

1 个答案:

答案 0 :(得分:0)

我想出了如何完成工作,我使用了CustomScoreProvider类,并得到了我想要的结果

class CustomizedScoreProvider extends CustomScoreProvider {

    public CustomizedScoreProvider(LeafReaderContext reader) {
            super(reader);
            // TODO Auto-generated constructor stub
        }

    public  float customScore(int doc, float subQueryScore,float valSrcScores[]){

     try {

         subQueryScore+=4; // I only added this for testing , 
     } catch(Exception e) { 
         e.printStackTrace();
            }
    return subQueryScore;
             }
    }

class CustomizedScoreQuery extends CustomScoreQuery{


public CustomizedScoreQuery(Query subQuery,IndexReader ireader) {
        super(subQuery);
        // TODO Auto-generated constructor stub
    }
public CustomizedScoreProvider getCustomScoreProvider (LeafReaderContext reader){
    CustomizedScoreProvider i=new CustomizedScoreProvider(reader);
     return (i);
}
}