如何在python的分裂中编码字符?

时间:2014-05-28 14:36:31

标签: python python-2.7 encoding utf-8 splinter

我使用splinter为网站进行功能测试。只要我没有输入特殊字符,测试就会运行得很好,就像德国的Umlaute一样。

如何调整脚本以测试具有这些字符的条目?

这是我到目前为止所尝试的:

# coding=utf-8

from splinter import Browser


URL = "http://www.example.com"
USER = raw_input("Username, please: ") # f.e. "Günther"

with Browser('') as browser:
    # Log in as administrator
    browser.visit(URL)
browser.fill('user', USER)

这是我得到的错误日志:

Traceback (most recent call last):
  File "test.py", line 47, in <module>
    browser.fill('user', USER)
  File "C:\Python27\lib\site-packages\splinter\driver\webdriver\__init__.py", line 230, in fill
    field.value = value
  File "C:\Python27\lib\site-packages\splinter\driver\webdriver\__init__.py", line 338, in _set_value
    self._element.send_keys(value)
  File "C:\Python27\lib\site-packages\selenium\webdriver\remote\webelement.py", line 293, in send_keys
    self._execute(Command.SEND_KEYS_TO_ELEMENT, {'value': typing})
  File "C:\Python27\lib\site-packages\selenium\webdriver\remote\webelement.py", line 370, in _execute
    return self._parent.execute(command, params)
  File "C:\Python27\lib\site-packages\selenium\webdriver\remote\webdriver.py", line 164, in execute
    response = self.command_executor.execute(driver_command, params)
  File "C:\Python27\lib\site-packages\selenium\webdriver\remote\remote_connection.py", line 344, in execute
    data = utils.dump_json(params)
  File "C:\Python27\lib\site-packages\selenium\webdriver\remote\utils.py", line 30, in dump_json
    return json.dumps(json_struct)
  File "C:\Python27\lib\json\__init__.py", line 243, in dumps
    return _default_encoder.encode(obj)
  File "C:\Python27\lib\json\encoder.py", line 207, in encode
    chunks = self.iterencode(o, _one_shot=True)
  File "C:\Python27\lib\json\encoder.py", line 270, in iterencode
    return _iterencode(o, 0)
UnicodeDecodeError: 'utf8' codec can't decode byte 0x81 in position 0: invalid start byte

2 个答案:

答案 0 :(得分:1)

USER指定为unicode对象:

USER = u"Günther"
       ^

如果字符串来自外部源,请对其进行解码:

USER = USER.decode('<ENCODING-OF-THE-STRING>')

答案 1 :(得分:0)

Unidecode是我遇到的最轻松的unicode处理程序。