寻找解析EHR药物注释的算法

时间:2017-05-08 20:29:11

标签: python nltk

我正在尝试使用python和nltk来解析一些描述药物处方的医生笔记。我正在寻找一种方法来识别所拍摄的#项目的数值以及拍摄项目的频率。

1 TABLET DAILY
TAKE 1 TABLET DAILY
ONE TABLET TWICE DAILY
2 DAILY
TWO TABLETS DAILY
ONE PILL AT BEDTIME
1/2 PILL TWICE DAILY
ROLLING WALKER WITH SEAT ATTACHMENT AND HAND BRAKES
ONE PILL DAILY
1 TAB PO DAILY
ONE PILL TWICE A DAY WITH MEALS AS NEEDED
1 TABLET TWICE DAILY
300 MG BID
ONE DAILY
1 TABLET 3 TIMES DAILY AS NEEDED
1 DAILY
TAKE 1 CAPSULE BY MOUTH 4 (FOUR) TIMES A DAY.
1 TABLET EVERY 4 TO 6 HOURS AS NEEDED
1 TABLET BY MOUTH TWICE DAILY
INJECT 34 U TWICE A DAY

有什么建议吗?

1 个答案:

答案 0 :(得分:0)

通常有multiple variations,其中这些是医生在临床记录中写的。 例如:

1 TABLET DAILY 

也可以写成

1 tab qid

如果您正在寻找一个快速解决方案,使用正则表达式编写python脚本可能有所帮助。但如果您想要更长远的内容,可以查看i2b2 Medication Information Extraction Challenge

的数据和提交内容。