仅从字典中检测完整单词匹配

时间:2017-03-31 05:48:30

标签: nlp uima ruta

我正在尝试从字典中检测城市名称但是如果有任何单词没有空格,它也会被检测到。 例如:

字典开始

纽约 ..

字典结束

输入开始

纽约

纽约

输入结束

我希望将纽约作为城市而不是纽约

我正在使用以下规则。

WORDLIST USCitiesList = 'en/ruta/dictionaries/USCities.txt';
DocumentAnnotation{-> MARKFAST(USCity, USCitiesList,true)};

2 个答案:

答案 0 :(得分:1)

enter image description here以下脚本可以为您提供帮助。

PACKAGE uima.ruta.example;
DECLARE USCity;
DECLARE NoSpaceUSCity;
WORDLIST USCitiesList = 'USCities.txt';

DocumentAnnotation{-> MARKFAST(USCity, USCitiesList,true)};

RETAINTYPE(SPACE);
DocumentAnnotation{-> MARKFAST(NoSpaceUSCity, USCitiesList,true)};
RETAINTYPE;

USCity{PARTOF(NoSpaceUSCity)->UNMARK(USCity)};

答案 1 :(得分:1)

PACKAGE uima.ruta.example;
DECLARE USCity;
DECLARE NoSpaceUSCity;
WORDLIST USCitiesList = 'USCities.txt';

DocumentAnnotation{-> MARKFAST(USCity, USCitiesList,true)};



RETAINTYPE(SPACE);
CW CW+{-> MARKFAST(NoSpaceUSCity, USCitiesList,true)};
RETAINTYPE;

USCity{PARTOF(NoSpaceUSCity)->UNMARK(USCity)};