使用Argparse在Python中创建文件转换器

时间:2013-06-10 15:22:22

标签: python xml csv command-line argparse

我必须使用命令提示符和python以csv文件的形式接收输入,然后读取它并将其转换为与csv文件同名的xml文件,除了.xml文件扩展名或者用户可以使用-o --output可选命令行参数设置输出文件名和路径。好吧,我已经在谷歌搜索了几天,到目前为止,我的程序允许我输入命令行参数,我可以将csv转换为xml文件,但它不会使用与csv文件相同的名称或用户打印它设置名称。相反,它只打印出一个空白文件。这是我的代码:

    import sys, argparse
    import csv
    import indent
    from xml.etree.ElementTree import ElementTree, Element, SubElement, Comment, tostring

    parser=argparse.ArgumentParser(description='Convert wordlist text files to various formats.', prog='Text Converter')
    parser.add_argument('-v','--verbose',action='store_true',dest='verbose',help='Increases messages being printed to stdout')
    parser.add_argument('-c','--csv',action='store_true',dest='readcsv',help='Reads CSV file and converts to XML file with same name')
    parser.add_argument('-x','--xml',action='store_true',dest='toxml',help='Convert CSV to XML with different name')
    parser.add_argument('-i','--inputfile',type=argparse.FileType('r'),dest='inputfile',help='Name of file to be imported',required=True)
    parser.add_argument('-o','--outputfile',type=argparse.FileType('w'),dest='outputfile',help='Output file name')
    args = parser.parse_args()

    def main(argv):
        reader = read_csv()
        if args.verbose: 
            print ('Verbose Selected')
        if args.toxml:
            if args.verbose:
               print ('Convert to XML Selected')
            generate_xml(reader)
        if args.readcsv:
            if args.verbose:
                print ('Reading CSV file')
            read_csv()
        if not (args.toxml or args.readcsv):
            parser.error('No action requested')
        return 1

    def read_csv():
        with open ('1250_12.csv', 'r') as data:
            return list(csv.reader(data))

    def generate_xml(reader):
        root = Element('Solution')
        root.set('version','1.0')
        tree = ElementTree(root)

        head = SubElement(root, 'DrillHoles')
        head.set('total_holes', '238')

        description = SubElement(head,'description')
        current_group = None
        i = 0
        for row in reader:
            if i > 0:
                x1,y1,z1,x2,y2,z2,cost = row
                if current_group is None or i != current_group.text:
                    current_group = SubElement(description, 'hole',{'hole_id':"%s"%i})

                    collar = SubElement (current_group, 'collar',{'':', '.join((x1,y1,z1))}),
                    toe = SubElement (current_group, 'toe',{'':', '.join((x2,y2,z2))})                                       
                    cost = SubElement(current_group, 'cost',{'':cost})
            i+=1    
        indent.indent(root)
        tree.write(open('hole.xml','w'))
    if (__name__ == "__main__"):

sys.exit(main(sys.argv))

对于generate_xml()函数,你可以忽略它,因为它接受以某种方式格式化的csv文件,所以你可能不理解它,但我认为问题在于tree.write(),因为该部分生成xml文件在代码本身中编写的名称,而不是命令提示符下的参数。

2 个答案:

答案 0 :(得分:1)

您需要将文件参数传递给generate_xml()。您似乎在args.outputfile中有输出文件。

generate_xml(reader, args.outputfile)

...
def generate_xml(reader, outfile):
    ...
    tree.write(outfile)

您可能还应该使用args.inputfile

reader = read_csv(args.inputfile)
...
def read_csv(inputfile):
    return list(csv.reader(inputfile))

这一行没有做任何有用的事情,它处理.csv文件,但不对结果做任何事情:

read_csv()

答案 1 :(得分:1)

以下代码改编自FB36的codeie on code.activestate.com

它将执行您需要的操作,您不必担心csv文件中的标头,尽管csv文件中应该只有一个标头(第一行)。如果要进行批量转换,请查看this page的底部。

'''Convert csv to xml file

csv2xml.py takes two arguments:  
 1. csvFile: name of the csv file (may need to specify path to file)
 2. xmlFile: name of the desired xml file (path to destination can be specified)

If only the csv file is provided, its name is used for the xml file. 

Command line usage: 
 example1: python csv2xml.py 'fileName.csv' 'desiredName.xml'
 example2: python csv2xml.py '/Documents/fileName.csv' '/NewFolder/desiredName.xml'
 example3: python csv2xml.py 'fileName.csv'

This code has been adapted from: http://code.activestate.com/recipes/577423/
'''

import csv

def converter(csvFile, xmlFile):
    csvData = csv.reader(open(csvFile))

    xmlData = open(xmlFile, 'w')
    xmlData.write('<?xml version="1.0"?>' + "\n")

    # there must be only one top-level tag
    xmlData.write('<csv_data>' + "\n")

    rowNum = 0
    for row in csvData:
        if rowNum == 0:
            tags = row
            # replace spaces w/ underscores in tag names
            for i in range(len(tags)):
                tags[i] = tags[i].replace(' ', '_')
        else: 
            xmlData.write('<row>' + "\n")
            for i in range(len(tags)):
                xmlData.write('    ' + '<' + tags[i] + '>' \
                              + row[i] + '</' + tags[i] + '>' + "\n")
            xmlData.write('</row>' + "\n")

        rowNum +=1

    xmlData.write('</csv_data>' + "\n")
    xmlData.close()

## for using csv2xml.py from the command line
if __name__ == '__main__':
    import sys

    if len(sys.argv)==2:
        import os
        csvFile = sys.argv[1]
        xmlFile = os.path.splitext(csvFile)[0] + '.xml'
        converter(csvFile,xmlFile)
    elif len(sys.argv)==3:
        csvFile = sys.argv[1]
        xmlFile = sys.argv[2]   
        converter(csvFile,xmlFile)
    else:
        print __doc__