Question

<sup></sup>标记会中断Python中的文本读取。它会在X处停止Categorical variables were compared using the χ<sup>2</sup>test.，并且不会打印更多文本。

<Abstract>
         <AbstractText> The disease free survival (DFS) and overall survival (OS) were calculated by the Kaplan-Meier method. Categorical variables were compared using the χ<sup>2</sup>test.</AbstractText>
</Abstract>

Answer 1

一些可能的解决方案：

1）您使用什么解析器来读取此xml文件？我知道ElementTree的itertext（）方法解析出所选Element的所有subElements中的所有文本。

2）它可能根本不是读者，而是奇怪的X＆＃39;符号就在第一个sup标签之前。你的文字是如何编码的？它能读出这样的符号吗？

在python中读取xml文件时，某些内联标记（如<sup>）不会读取完整文本？

1 个答案: