应用错误收集

我使用docx python库读取了一个MS-Word文件。 word文件包含表格（表格中只有1列），如table1，table2等。我需要逐桌阅读。每当一行有一个或多个图片文件时，我需要显示＆＃34;找到图像文件＆＃34;在表＃和行＃＆＃34;

这就是我所做的。 docx api中的 inline_shapes属性给出了找到的图像文件的对象列表。但它没有指定找到图像的表号和行号。

from docx.api import Document

doc = Document("demo.docx")

for image in doc.inline_shapes:
    print(image)

for table in doc.tables:
    for row in table.rows:
        for cell in row.cells:
            print(cell.text) # it prints only the text and no info about image

很少有用的参考资料

python docx how to read text along with inline images?

Finding image present docx file using python

从MS Word中的表中识别图像文件

0 个答案: