Stanford CoreNLP:如何仅使用Name Entities(OpenIE)获取RelationTriple三元组?

时间:2017-08-07 17:44:49

标签: java stanford-nlp information-extraction

我目前正在 CoreNLP开放信息提取(OpenIE)中搜索仅包含 NameEntities 在主题和对象类型中。但我不知道如何获得RelationTriple List<CoreMap>对象的实体类型。

以下是https://stanfordnlp.github.io/CoreNLP/openie.html的代码:

import edu.stanford.nlp.ie.util.RelationTriple;
import edu.stanford.nlp.ling.CoreAnnotations;
import edu.stanford.nlp.pipeline.Annotation;
import edu.stanford.nlp.pipeline.StanfordCoreNLP;
import edu.stanford.nlp.naturalli.NaturalLogicAnnotations;
import edu.stanford.nlp.util.CoreMap;

import java.util.Collection;
import java.util.Properties;

/**
 * A demo illustrating how to call the OpenIE system programmatically.
 */
public class OpenIEDemo {

  public static void main(String[] args) throws Exception {
    // Create the Stanford CoreNLP pipeline
    Properties props = new Properties();
    props.setProperty("annotators", "tokenize,ssplit,pos,lemma,depparse,natlog,openie");
    StanfordCoreNLP pipeline = new StanfordCoreNLP(props);

    // Annotate an example document.
    Annotation doc = new Annotation("Obama was born in Hawaii. He is our president.");
    pipeline.annotate(doc);

    // Loop over sentences in the document
    for (CoreMap sentence : doc.get(CoreAnnotations.SentencesAnnotation.class)) {
      // Get the OpenIE triples for the sentence
      Collection <RelationTriple> triples = sentence.get(NaturalLogicAnnotations.RelationTriplesAnnotation.class);
      // Print the triples
      for (RelationTriple triple : triples) {
      // Here is where I get the entity type from a triple's subject or object
        System.out.println(triple.confidence + "\t" +
            triple.subjectLemmaGloss() + "\t" +
            triple.relationLemmaGloss() + "\t" +
            triple.objectLemmaGloss());
      }
    }
  }
}

如果有某种方法可以从RelationTriple课程中获取实体类型,我将非常感谢您的帮助。

1 个答案:

答案 0 :(得分:0)

subjectobject实例变量应该是CoreLabl的列表,它们具有通过#ner()方法附加的命名实体信息。像下面这样的东西应该做你想要的:

Collection<RelationTriple> triples = sentence.get(RelationTriplesAnnotation.class);
List<RelationTriple> withNE = triples.stream()
    // make sure the subject is entirely named entities
    .filter( triple -> 
        triple.subject.stream().noneMatch(token -> "O".equals(token.ner())))
    // make sure the object is entirely named entities
    .filter( triple -> 
        triple.object.stream().noneMatch(token -> "O".equals(token.ner())))
    // Convert the stream back to a list
    .collect(Collectors.toList());