Question

我一直在尝试从excel文件中提取数据，将其转换为数组，然后将其写入其他当前未定义的文件类型（因此，.txt文件是当前的占位符文件类型）。我确定代码相当丑陋，但是可以正常工作：

import os
import pandas as pd
import glob
import numpy as np

def xlxtract():
for filename in glob.glob('*.xlsx'):
    ExcelFile = filename[:-5]
    RosewoodData = pd.read_excel(ExcelFile + '.xlsx')
    DataMatrix = np.array(RosewoodData)
    DataMatrixString = np.array2string(DataMatrix, precision=4, separator=' ')
    NewFile = open(ExcelFile + 'MATRIX.txt', 'w')
    NewFile.write(' ' + DataMatrixString[1:-1])
    NewFile.close()
    print('Your file has been printed to ' + ExcelFile + '.txt')

无论如何，我遇到的问题是，尽管它确实可以打印到.txt文件，但并没有删除括号。输出看起来像这样（生成了随机数作为测试）：

我想去掉方括号，但是似乎没有任何一行方法可以做到这一点。任何帮助将不胜感激。

Answer 1

array2string格式化数组以供显示，就像执行print一样：

In [32]: x = np.arange(12).reshape(4,3)
In [33]: x
Out[33]: 
array([[ 0,  1,  2],
       [ 3,  4,  5],
       [ 6,  7,  8],
       [ 9, 10, 11]])
In [34]: np.array2string(x)
Out[34]: '[[ 0  1  2]\n [ 3  4  5]\n [ 6  7  8]\n [ 9 10 11]]'

请注意，列表的打印字符串还包括方括号（和逗号）：

In [35]: str(x.tolist())
Out[35]: '[[0, 1, 2], [3, 4, 5], [6, 7, 8], [9, 10, 11]]'

array2string的另一个复杂之处在于，它对长数组使用省略号缩写（尽管可以通过参数更改）。

np.savetxt是将2d数组写入文件的相对简单的方法，并且可以使用显式格式进行仿真。

In [37]: np.savetxt('test.txt', x, fmt='%d', delimiter=',')
In [38]: cat test.txt     # ipython system command to display a file
0,1,2
3,4,5
6,7,8
9,10,11
In [39]: for row in x:
    ...:     print('%d,%d,%d'%tuple(row))
    ...:     
0,1,2
3,4,5
6,7,8
9,10,11

或作为一个字符串

In [42]: astr = '\n'.join(['%3d %3d %3d'%tuple(row) for row in x])
In [43]: astr
Out[43]: '  0   1   2\n  3   4   5\n  6   7   8\n  9  10  11'

np.array2string不删除数组周围的括号

1 个答案: