Question

evernote软件提供将笔记本导出到.enex文件的选项。这可以被解析以获得识别的文本及其在图像中的位置。（如http://blog.evernote.com/tech/2013/07/18/how-evernotes-image-recognition-works/所示）现在我希望通过Evernote的python SDK获得这个可识别的文本。这就是我到目前为止所做的事情：

我可以在https://sandbox.evernote.com创建一个笔记本从浏览器中搜索笔记中的文字。
创建了一个用于访问此笔记本的开发令牌。

我可以迭代我的笔记本并获取他们的指南。并使用能够访问笔记本内部的笔记。

    notebooks = self.noteStore.listNotebooks()

    print "Notebooks:"
    for notebook in notebooks:
        print notebook.name, notebook.guid
        # display notes in the notebook
        print self.noteStore.findNoteCounts(self.dev_token, NoteFilter(notebookGuid=notebook.guid), True)
        spec = NotesMetadataResultSpec(includeTitle=True)
        # The following provides the note guid
        print self.noteStore.findNotesMetadata(self.dev_token, NoteFilter(notebookGuid=notebook.guid), 0, 10, spec)`

但我无法访问注释中的已识别文本。尝试：

note = self.noteStore.getNote(self.dev_token, note_guid, False, True, True, True) for r in note.resources: print "resource guid: ", r.guid resource = self.noteStore.getResource(r.guid, True, False, True, False) print resource.data.body

也尝试过（如How to Access A Note's Content中所述）：

noteContent = self.noteStore.getNoteContent(self.dev_token, note_guid)
    print noteContent

这就是这样的：

<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE en-note SYSTEM "http://xml.evernote.com/pub/enml2.dtd">
<en-note><div><br clear="none"/></div><br/><en-media hash="dhhd2e240c83140sjskca0bf6e8c9661fd1c3472" type="image/jpeg"/></en-note>

但在http://dev.evernote.com/doc/articles/image_recognition.php recoExample.py能够提取已识别的文本。只有我和我之间的区别是我在沙箱环境中运行。

我该怎么做才能获得认可的文字？

如何使用他们的SDK（python）访问EverNote中的已识别文本

0 个答案: