Question

我想阅读excel文件并提取一些有关某些人的信息。

这就是我正在做的事情

import xlrd
dir = './schfiles';
files = os.listdir(dir);
f = files[0];
book = xlrd.open_workbook(dir+"/"+files[0]);
sh = book.sheet_by_index(0)
t = sh.cell_value(rowx=xlr2i(35),colx=xlc2i('F'))
t.find(u"Усманов")

用var t写的字符串是u'\ u0434 \ u043e \ u0446。 \ u0423 \ u0441 \ u043c \ u0430 \ u043d \ u043e \ u0432 \ u0411。\ u0428。'看起来像“доц.УсмановБ.Ш。”

u“Усманов”表示为u'\ xd3 \ xf1 \ xec \ xe0 \ xed \ xee \ xe2'

我尝试将两个字符串编码为'utf8'，使用外部库解码它们，但没有任何帮助。

有谁知道如何在这里找到特定的子字符串？

Answer 1

使用# -*- coding: utf-8 -*-作为脚本的第一行告诉解释器您正在使用哪种编码。

# -*- coding: utf-8 -*-

import os
import xlrd

dir = './schfiles'
files = os.listdir(dir)
f = files[0]

workbook_path = os.path.join(dir, files[0])
book = xlrd.open_workbook(workbook_path)

sh = book.sheet_by_index(0)
t = sh.cell_value(rowx=xlr2i(35),colx=xlc2i('F'))
t.find(u"Усманов")

python在excel

1 个答案: