Question

我导入的txt文件

np.genfromtxt(file_name, dtype='str')

例如，我可以获取以下numpy数组

['aaa' 'aaa' 'a']

我最后想要的是一个看起来像这样的numpy数组

[['a', 'a', 'a'], ['a', 'a', 'a'], ['a', 'a', 'a']]

请记住，文本文件的最后一行只有1个a，因此脚本应自动添加另外2个a以匹配数组中最长的列表。

我已经成功地使用了三个字符串之间的逗号

[s.replace(' ', ',') for s in file]

但是如果我用] [。

替换空格，这似乎不起作用。

有什么建议吗？

Answer 1

您是否正在寻找类似的东西

str = "'aaa' 'aaa' 'a'"
str2 = str.replace("'a'","'a' 'a' 'a'")
str3 = str2.replace("'aaa' ","'a' 'a' 'a',")
str4 = str3.replace("'aaa'","'a' 'a' 'a',")
my_data2 = [str4.split(',') for x in str4.split('|')]
print(my_data2)

注意：很抱歉，我的基本答复是我的第一个答案。希望有帮助。

编辑

[s.replace("'a'","'a','a','a'") for s in file] # add 3 'a's at the last one
[s.replace("'aaa' ","'a','a','a' ") for s in file] # split each one of the 3 'aaa's in the first to items
[s.split(" ") for s in file] # create 3 item "'a', 'a', 'a'" list per line

Answer 2

def func(file_name):
  arr = np.genfromtxt(file_name, dtype='str')
  # this line is in case you omitted the ',' between strings in loaded numpy array from your question
  # arr = arr.tolist().split() 
  l = []
  for i in arr:
    el = list(i)
    while len(el) < 3:
      el.append('a')
    l.append(el)
return np.array(l)

我希望这对你没问题。

Answer 3

使用列表推导。

例如：

import numpy as np

data = np.genfromtxt(filename, dtype='str')
mValue = len(max(data, key=lambda x: len(x)))
print([[j for j in i.ljust(mValue, i[0])] for i in data])

如何从txt文件创建numpy数组

3 个答案: