Question

考虑我有1000个.CSV文件，其中包含我的员工的姓名。因此，文件名中没有任何顺序或编号。有没有办法用Python语言对计算机说，从头到尾都在一个特殊的文件夹中读取文件，无论它们的名字是什么？（数据对谁来说对我来说并不重要，我只需要获取这些数据进行分析即可。）

Answer 1

您可以读取以下目录中的所有csv文件：

我的csv：

col1,col2,col3
a,b,c
d,e,f

代码：

import glob
import csv

PATH = "/Users/stack/"

for file in glob.glob(PATH+"*.csv"):
    with open(file) as csvfile:
        spamreader = csv.reader(csvfile, delimiter=',')
        for row in spamreader:
            print(" ".join(row))

输出：

col1 col2 col3
a b c
d e f

Process finished with exit code 0

Answer 2

可以。我将使用一个基于正则表达式的简单测试器来检查文件，因此本质上您正在做的是使用for循环遍历目录并使用if语句，我们测试文件是否包含'。 csv'。之后，我们打开文件，然后将其简单地附加到我们的输出中，您可以选择分析或将其存储为文件。我已注释掉输出到文件的选项，但是，如果您愿意的话，可以。

import re

# Redefine this to the path of your folder:
folderPath = "SET UNIX PATH HERE"

output = None
for file in os.listdir(folderPath):
    if re.search(r'.csv', file):
        with open(file, r) as readFile:
            output += readFile.read()

# Uncomment this part if you would like to store the output to a file
# Define the path to the file that will be created:
# outputFilePath = "SET UNIX PATH"
# with open(outputFilePath, w+) as outputFile:
#     outputFile.write(output)

希望这会有所帮助：）

Answer 3

使用如下代码：（用路径替换当前路径（。）：

import os, fnmatch
import csv
listOfFiles = os.listdir('.')  
pattern = "*.csv"  
for entry in listOfFiles:  
    if fnmatch.fnmatch(entry, pattern):
        with open(entry, newline='') as csvfile:
            spamreader = csv.reader(csvfile)
            for line in spamreader:
                print(line)

##########使用Danadas软件包

import os, fnmatch
import pandas as pd

listOfFiles = os.listdir('.')  
pattern = "*.csv"  
for entry in listOfFiles:  
    if fnmatch.fnmatch(entry, pattern):
        read_File_as_DF=pd.read_csv(entry)
        print(read_File_as_DF)

如何在python中读取许多不同名称的.CSV文件？

3 个答案: