我目前正在 CoreNLP开放信息提取(OpenIE)中搜索仅包含 NameEntities 在主题和对象类型中。但我不知道如何获得RelationTriple
List<CoreMap>
对象的实体类型。
以下是https://stanfordnlp.github.io/CoreNLP/openie.html的代码:
import edu.stanford.nlp.ie.util.RelationTriple;
import edu.stanford.nlp.ling.CoreAnnotations;
import edu.stanford.nlp.pipeline.Annotation;
import edu.stanford.nlp.pipeline.StanfordCoreNLP;
import edu.stanford.nlp.naturalli.NaturalLogicAnnotations;
import edu.stanford.nlp.util.CoreMap;
import java.util.Collection;
import java.util.Properties;
/**
* A demo illustrating how to call the OpenIE system programmatically.
*/
public class OpenIEDemo {
public static void main(String[] args) throws Exception {
// Create the Stanford CoreNLP pipeline
Properties props = new Properties();
props.setProperty("annotators", "tokenize,ssplit,pos,lemma,depparse,natlog,openie");
StanfordCoreNLP pipeline = new StanfordCoreNLP(props);
// Annotate an example document.
Annotation doc = new Annotation("Obama was born in Hawaii. He is our president.");
pipeline.annotate(doc);
// Loop over sentences in the document
for (CoreMap sentence : doc.get(CoreAnnotations.SentencesAnnotation.class)) {
// Get the OpenIE triples for the sentence
Collection <RelationTriple> triples = sentence.get(NaturalLogicAnnotations.RelationTriplesAnnotation.class);
// Print the triples
for (RelationTriple triple : triples) {
// Here is where I get the entity type from a triple's subject or object
System.out.println(triple.confidence + "\t" +
triple.subjectLemmaGloss() + "\t" +
triple.relationLemmaGloss() + "\t" +
triple.objectLemmaGloss());
}
}
}
}
如果有某种方法可以从RelationTriple
课程中获取实体类型,我将非常感谢您的帮助。
答案 0 :(得分:0)
subject
和object
实例变量应该是CoreLabl
的列表,它们具有通过#ner()
方法附加的命名实体信息。像下面这样的东西应该做你想要的:
Collection<RelationTriple> triples = sentence.get(RelationTriplesAnnotation.class);
List<RelationTriple> withNE = triples.stream()
// make sure the subject is entirely named entities
.filter( triple ->
triple.subject.stream().noneMatch(token -> "O".equals(token.ner())))
// make sure the object is entirely named entities
.filter( triple ->
triple.object.stream().noneMatch(token -> "O".equals(token.ner())))
// Convert the stream back to a list
.collect(Collectors.toList());