对不起这个问题,我知道这是一个反复出现的主题,但我无法解决我的问题,实际上,这个问题很难描述:我想写一个文件执行的输出因为它。我的意思是,我有一个像这样的输出(抱歉这个烂摊子):
.251.242.133|:80... connected.\r\nHTTP request sent, awaiting response... 200 OK\r\nLength: 2371567 (2.3M) [application/x-gzip]\r\nSaving to: `110907_ERP000591.tar.gz\'\r\n\r\n\r 0% [ ] 0 --.-K/s \r 0% [ ] 23,003 104K/s \r 3% [> ] 82,863 184K/s \r 8% [==> ] 192,363 282K/s \r15% [=====> ] 371,943 411K/s \r26% [=========> ] 634,175 563K/s \r39% [==============> ] 925,283 680K/s \r52% [===================> ] 1,250,295 790K/s \r63% [=======================> ] 1,497,035 830K/s \r73% [===========================> ] 1,732,663 861K/s \r81% [==============================> ] 1,937,063 867K/s \r88% [=================================> ] 2,099,123 855K/s \r95% [====================================> ] 2,268,483 847K/s \r100%[======================================>] 2,371,567 849K/s in 2.7s \r\n\r\n2012-11-01 15:34:10 (849 KB/s) - `110907_ERP000591.tar.gz\' saved [2371567/2371567]\r\n\r\n110907_ERP000591/\r\n110907_ERP000591/1_110907_ERP000591_2_fastq.txt\r\n110907_ERP000591/1_110907_ERP000591_1_fastq.txt\r\n/home/travis/opt/bcbb/nextgen/tests/data/automated/../100326_FC6107FAAXX\r\n--2012-11-01 15:34:10-- http://chapmanb.s3.amazonaws.com/100326_FC6107FAAXX.tar.gz\r\nResolving chapmanb.s3.amazonaws.com (chapmanb.s3.amazonaws.com)... 205.251.242.133\r\nConnecting to chapmanb.s3.amazonaws.com (chapmanb.s3.amazonaws.com)|205.251.242.133|:80... connected.\r\nHTTP request sent, awaiting response... 200 OK\r\nLength: 7014592 (6.7M) [application/x-gzip]\r\nSaving to: `100326_FC6107FAAXX.tar.gz\'\r\n\r\n\r 0% [ ] 0 --.-K/s \r 0% [ ] 17,163 77.9K/s \r 0% [ ] 64,775 147K/s \r 2% [ ] 174,843 263K/s \r 5% [=> ] 399,683 456K/s \r12% [===> ] 866,883 790K/s \r25% [========> ] 1,798,363 1.33M/s \r45% [================> ] 3,178,955 1.90M/s \r65% [========================> ] 4,592,803 2.41M/s \r65% [========================> ] 4,629,303 2.17M/s \r67% [=========================> ] 4,761,595 2.02M/s \r74% [============================> ] 5,245,423 2.03M/s \r83% [===============================> ] 5,862,435 2.08M/s \r100%[======================================>] 7,014,592 2.46M/s in 2.7s \r\n\r\n2012-11-01 15:34:13 (2.46 MB/s) -
所以,正如你所看到的,我有一个奇怪的格式化输出,在终端中显示如下:
/home/travis/opt/bcbb/nextgen/tests/data/automated/../100326_FC6107FAAXX
--2012-11-01 15:34:10-- http://chapmanb.s3.amazonaws.com/100326_FC6107FAAXX.tar.gz
Resolving chapmanb.s3.amazonaws.com (chapmanb.s3.amazonaws.com)... 205.251.242.133
Connecting to chapmanb.s3.amazonaws.com (chapmanb.s3.amazonaws.com)|205.251.242.133|:80... connected.
HTTP request sent, awaiting response... 200 OK
Length: 7014592 (6.7M) [application/x-gzip]
Saving to: `100326_FC6107FAAXX.tar.gz'
100%[======================================>] 7,014,592 2.46M/s in 2.7s
2012-11-01 15:34:13 (2.46 MB/s) - `100326_FC6107FAAXX.tar.gz' saved [7014592/7014592]
好多了...我想用这个表单将输出写入一个文件,所以我不能剥离\ r \ n或类似的东西,因为这种方式对于进程中的每一步都会出现一行下载。我只想写一个带有最终输出的文件。
任何帮助?
修改
抱歉,我应该更清楚:这不是命令的结果,而是如下所述,解析json文件的结果。您可以重现输出:
import urllib
import json
string_to_write = json.loads(urllib.urlopen('https://travis-ci.org/jobs/3019024.json').read())['log']
答案 0 :(得分:1)
这应该有效:
re.sub(r"(?m)^.*\r(?!$)", "", text)
(?m)
标记表示这是多行正则表达式,因此^
和$
匹配每行的开头和结尾,而不是整个文本。
因此,我们匹配从.*
行开始到包括回车^
的所有\r
,除非回车符后面紧跟回车行(?!$)
- 如果您的文字包含CRLF换行符,那将是一样的。并且,将匹配替换为空字符串。
答案 1 :(得分:0)
看起来你在* NIX系统上。如果通过在命令行上运行progn
(实际上可能是python myscript.py
)生成此输出,则可以轻松地执行此操作:
progn > logfile
这样,progn
的输出会保存到logfile
而不会删除任何字符。
答案 2 :(得分:0)
如果你想删除终端上隐藏的内容,那么你可以试试这个 - >
# t is your text
''.join([i for i in t.split('\r') if i.startswith('\n') or i.startswith('100%')])
解决方案很棘手(!)但我认为它可能适合你。