早上好, 首先,我已经阅读了与此问题类似的帖子,但它并没有解决我的问题。
我有3个csv文件(broker-4393行; skips-27761行; tippers-19118行)。 每个csv文件都由相同的函数读取:
长话短说:
broker csv文件(包含4393行)生成1359行的列表。 MISSING
跳过csv文件(包含27761行)会生成27761行的列表。 FINE
tipper csv文件(包含19118行)生成一个包含19118行的列表。 FINE
有人设法找到修复方法吗?
[见下面的费用]
import os, re, csv
# -------------------------------------- General Functions -------------------------------------- #
# function: find journey summary file
def FileFinder(fl, begin):
regex = begin
pattern = re.compile(regex)
for f in fl:
if re.findall(pattern, f): #empty seq = False
global found;
found = re.findall(pattern, f)
# function: read from 'Enquiry-...'
def ReadEnquiry():
with open(d + found[0], "r") as fR:
r = csv.reader(fR)
# capture data from csv file into 'clist'
for row in r:
global rlist;
rlist.append(row)
fR.close()
# ----------------------------------------------------------------------------------------------- #
# --------------------------------------- Broker Functions -------------------------------------- #
# function: Find and Read from BrokerExport.
def BrokerExp():
FileFinder(filelist, 'BrokerExport.*')
ReadEnquiry()
CreateBrokerList(rlist, 48, 17, 74, brokerlist)
# function: create a list of broker data. Format: Account Number,Date,Price(ex-VAT),Profit
def CreateBrokerList(rlist, col1, col2, col3, expList):
for row in rlist:
if row[41] == '': # exclude jobs that were cancelled.
expList.append([row[col1], row[col2], row[col3]])
# ----------------------------------------------------------------------------------------------- #
# ---------------------------------------- Skip Functions --------------------------------------- #
# function: Find and Read from SkipsExport.
def SkipExp():
FileFinder(filelist, 'SkipsExport.*')
ReadEnquiry()
CreateSkipList(rlist, 2, 42, 46, skiplist)
# function: create a list of skip data. Format: Account Number,Date,Price(ex-VAT),Profit
def CreateSkipList(rlist, col1, col2, col3, expList):
for row in rlist:
expList.append([row[col1], row[col2], row[col3]])
# ----------------------------------------------------------------------------------------------- #
# ---------------------------------------- Skip Functions --------------------------------------- #
# function: Find and Read from TipperExport.
def TipperExp():
FileFinder(filelist,'TipperExport.*')
ReadEnquiry()
CreateSkipList(rlist,3,4,34,tipperlist)
# function: create a list of tipper data. Format: Account Number,Date,Price(ex-VAT),Profit
def CreateTipperList(rlist, col1, col2, col3, expList):
for row in rlist:
expList.append([row[col1], row[col2], row[col3]])
# ----------------------------------------------------------------------------------------------- #
# --- General Variables --- #
rlist = []; # 'rlist' list read from csv.
found = '' # string to hold filename found through 'FileFinder()'
d = 'U:/rmarshall/To Do/' # directory to use
headings = ['Company Name', 'Rep', \
'Month 1 Calls', 'Month 1 Inv Tots', 'Month 1 No. of Invs', \
'Month 2 Calls', 'Month 2 Inv Tots', 'Month 2 No. of Invs', \
'Month 3 Calls', 'Month 3 Inv Tots', 'Month 3 No. of Invs', \
'Month 4 Calls', 'Month 4 Inv Tots', 'Month 4 No. of Invs', \
'Month 5 Calls', 'Month 5 Inv Tots', 'Month 5 No. of Invs', \
'Month 6 Calls', 'Month 6 Inv Tots', 'Month 6 No. of Invs', \
'Month 7 Calls', 'Month 7 Inv Tots', 'Month 7 No. of Invs', \
'Month 8 Calls', 'Month 8 Inv Tots', 'Month 8 No. of Invs', \
'Month 9 Calls', 'Month 9 Inv Tots', 'Month 9 No. of Invs', \
'Month 10 Calls', 'Month 10 Inv Tots', 'Month 10 No. of Invs', \
'Month 11 Calls', 'Month 11 Inv Tots', 'Month 11 No. of Invs', \
'Month 12 Calls', 'Month 12 Inv Tots', 'Month 12 No. of Invs']
cp=[headings]; da=[headings]; mb=[headings]; apd=[headings]; bobs=[headings] # separate Rep lists
filelist=os.listdir(d) # place directory filenames into a list
dlist=[]; brokerlist=[]; skiplist=[]; tipperlist=[]; book1=[] # lists used throughout code
brklist=[]; skplist=[]; tprlist=[] # a list of names
# ------------------------- #
# --- main --- #
Enquiry_Main() # call 'Enquiry_Main()' to run all work to create 'cp,da,mb,apd,bob' list data.
rlist=[]; dlist=[] # reset lists
print('1')
BrokerExp() # call 'BrokerExp()' to run all work to create 'brokerlist' data.
rlist=[] # reset list
print('2')
SkipExp() # call 'SkipExp()' to run all work to create 'skiprlist' data.
rlist=[] # reset list
print('3')
TipperExp() # call 'TipperExp()' to run all work to create 'tipperlist' data.
rlist=[] # reset list
a=0
for row in brokerlist:a+=1
print(a)
a=0
for row in skiplist:a+=1
print(a)
a=0
for row in tipperlist:a+=1
print(a)
答案 0 :(得分:0)
你到处使用了很多全局变量,它使你的程序变得复杂。函数可以创建并返回已过滤的列表,因此您可以重复使用它们并将其作为输入传递给另一个函数,该函数还将过滤其他一些参数。
此函数创建本地列表expList并且不返回任何内容。与CreateSkipList,CreateTipperList函数相同。
def CreateBrokerList(rlist, col1, col2, col3, expList):
for row in rlist:
if row[41] == '': # exclude jobs that were cancelled.
expList.append([row[col1], row[col2], row[col3]])
以正确方式返回的列表示例:
def ReadEnquiry(file_to_read, rlist):
with open(file_to_read, "r") as fR:
r = csv.reader(fR)
for row in r:
rlist.append(row)
return rlist
用法示例:
rlist = []
read_list = ReadEnquiry(d + found[0], rlist)
# pass read_list to other function as parameter
brokerlist = []
CreateBrokerList(read_list, 48, 17, 74, brokerlist)
答案 1 :(得分:0)
答案是csv阅读器没有问题。看完代码后,我看到了以下一行:
if row[41] == '': # exclude jobs that were cancelled.
该列表产生了1359个结果而不是4393个,因为差异被上面的代码行排除了。