Question

我正在使用pyKML模块从给定的KML文件中提取坐标。

我的Python代码如下：

from pykml import parser
fileobject = parser.fromstring(open('MapSource.kml', 'r').read())
root = parser.parse(fileobject).getroot()
print(xml.Document.Placemark.Point.coordinates)

但是，在运行此操作时，我收到以下错误：

ValueError: Unicode strings with encoding declaration are not supported. Please use bytes input or XML fragments without declaration.

在寻找解决方案的过程中，我遇到了这个解决方案http://twigstechtips.blogspot.in/2013/06/python-lxml-strings-with-encoding.html，我尝试了这个解决方案（我不确定这是正确的方法）：

from pykml import parser
from lxml import etree
from os import path
kml_file = open('MapSource.kml', 'r')
parser = etree.XMLParser(recover=True)
xml = etree.fromstring(kml_file, parser)
print(xml.Document.Placemark.Point.coordinates)

这给了我ValueError: can only parse strings。我解析KML并获得该结构坐标的正确方法是什么？

Answer 1

在上面的示例中，root = parser.parse(fileobject).getroot()在文件内容上调用parse（）作为从上一行的fromstring（）函数返回的字符串。

有两种使用pyKML解析KML文件的方法：

1：使用parse.parse（）解析文件。

from pykml import parser
with open('MapSource.kml', 'r') as f:
  root = parser.parse(f).getroot()
print(root.Document.Placemark.Point.coordinates)

2：使用parse.parsestring（）解析字符串内容。

from pykml import parser
root = parser.fromstring(open('MapSource.kml', 'rb').read())
print(root.Document.Placemark.Point.coordinates)

如果KML文件的第一行使用非UTF8编码，并且XML序言标头为XML，并且尝试以二进制格式读取“ r”作为文本而“ rb”的文件，则方法＃2可能会失败。

如果在KML文档中未正确指定编码，则注释解析可能会失败。由于名称和描述中使用国际和图形字符，因此在以下示例中使用了ISO-8859-1编码。省略编码或使用““ UTF-8”会使它成为无效的XML文件。

<?xml version="1.0" encoding="ISO-8859-1"?>
<kml xmlns="http://www.opengis.net/kml/2.2">
<Document>
  <Placemark>
    <name>Río Grande</name> 
    <description>
      Location: 18° 22′ 49″ N, 65° 49′ 53″ W
    </description>
    ...

使用pyKML解析KML文档

1 个答案: