我有一连串的广告,这是从一些报纸中提取的。广告可能会以如下所示的格式显示:我的任务是提取已故人员的姓名。
John, the small son of Mr. and Mrs.<br>
Elmer Cleppfer, died at their home in<br>
Lewistown on Wednesday. The funeral<br>
will He held on Saturday afternoon<br>
from the home of the grandparents<br>
on the child, Mr. and Mrs. John<br>
Kiopper, 224 Locust street, tortiorrow<br>
afternoon at 2 o'clock. Interment witt<br>
take place at Oberlin.<br>
Mrs. Lydia Mintch, aged 6S years <br>
died yesterday afternoon at the home<br>
of Fred Flowerfleld at Enhaut. Mrs.<br>
Mlnlch contracted a severe attack of<br>
pneumonia aggravated by other illness<br>
Several days ago which resulted in her<br>
death. Funeral arrangements have not<br>
yet been completed.<br>
整个段落由2个广告组成。如果有超过1个这样的广告,任何人都可以告诉我如何将这类文本分类成段落吗?
答案 0 :(得分:0)
好Stanford Parser是你的选择。
我故意不放弃你应该放的模式 在你的努力中也是如此。
答案 1 :(得分:0)
以下是我如何处理这个问题。