Question

我需要从文本文件中提取某些行。假设我正在寻找"abcd1234"。这四个数字每次都不同，但前四个字母保持不变。

这是我到目前为止所做的：

infile = file ('//Users//Nhi//Documents//Gene List.rtf', 'r')
outfile = file ('//Users//Nhi//Documents//new.docx', 'w')

for line in infile:
    outfile.write("|MmarC5_\d{4}")

infile.close()
outfile.close()

然而，输出实际上是短语"|MmarC5_\d{4}"，而不是每次都不同的4个数字。

Answer 1

 with open("somefile.txt") as f:
      print re.findall("abcd\d{4}",f.read())

是获得任何匹配的一种方式＆＃34; abcd ####＆＃34;

将文本文档中的信息提取到另一个文本文档中

1 个答案: