如何解决pymysql.err.ProgrammingError:1064-在mysql上保存数据?

时间:2018-08-02 18:59:12

标签: python mysql scrapy

我正在尝试将用scrapy抓取的数据保存到mysql。但是,我有这些问题:

  1. 不再支持MySQLdb。所以,我必须使用

    import pymysql

    pymysql.install_as_MySQLdb()settings.py文件

  2. 在python 3上已弃用%s,我必须将.格式与以下代码一起使用:

def close(self, reason):
        csv_file = max(glob.iglob('*.csv'), key=os.path.getctime)       
        mydb = MySQLdb.connect(host='localhost',
                               user='demo',
                               passwd='123456',
                               db='testdb')
        cursor = mydb.cursor()

        csv_data = csv.reader(open(csv_file))

        row_count = 0
        for row in csv_data:
            if row_count != 0:
                cursor.execute("INSERT IGNORE INTO testtb(product, category) VALUES('{}','{}')".format(*row))
            row_count += 1

        mydb.commit()
        cursor.close()
  

我遇到以下错误

<bound method AutorSpider.close of <AutorSpider 'autor' at 0x7f64725d29b0>>

Traceback (most recent call last):
  File "/home/pc/.local/lib/python3.6/site-packages/twisted/internet/defer.py", line 151, in maybeDeferred
    result = f(*args, **kw)
  File "/home/pc/.local/lib/python3.6/site-packages/pydispatch/robustapply.py", line 55, in robustApply
    return receiver(*arguments, **named)
  File "/home/pc/Escritorio/fpyautor/fpyautor/spiders/autor.py", line 109, in close
    cursor.execute("INSERT IGNORE INTO autortb(frase, categoria) VALUES({},'{}')'".format(*row))
  File "/home/pc/.local/lib/python3.6/site-packages/pymysql/cursors.py", line 170, in execute
    result = self._query(query)
  File "/home/pc/.local/lib/python3.6/site-packages/pymysql/cursors.py", line 328, in _query
    conn.query(q)
  File "/home/pc/.local/lib/python3.6/site-packages/pymysql/connections.py", line 516, in query
    self._affected_rows = self._read_query_result(unbuffered=unbuffered)
  File "/home/pc/.local/lib/python3.6/site-packages/pymysql/connections.py", line 727, in _read_query_result
    result.read()
  File "/home/pc/.local/lib/python3.6/site-packages/pymysql/connections.py", line 1066, in read
    first_packet = self.connection._read_packet()
  File "/home/pc/.local/lib/python3.6/site-packages/pymysql/connections.py", line 683, in _read_packet
    packet.check_error()
  File "/home/pc/.local/lib/python3.6/site-packages/pymysql/protocol.py", line 220, in check_error
    err.raise_mysql_exception(self._data)
  File "/home/pc/.local/lib/python3.6/site-packages/pymysql/err.py", line 109, in raise_mysql_exception
    raise errorclass(errno, errval)
pymysql.err.ProgrammingError: (1064, "You have an error in your SQL syntax; check the manual that corresponds to your MySQL server version for the right syntax to use near 'titulo del item numero 1' at line 1")

还有其他更简单/有效的方法吗?因为即时通讯会在抓取任务结束时保存数据,并且如果我获得了更多结果(3000个项目),那么将来在大型网站上可能会遇到问题吗?

1 个答案:

答案 0 :(得分:0)

escape string可以帮助您

def close(self, reason):
    csv_file = max(glob.iglob('*.csv'), key=os.path.getctime)       
    mydb = MySQLdb.connect(host='localhost',
                           user='demo',
                           passwd='123456',
                           db='testdb')
    cursor = mydb.cursor()

    csv_data = csv.reader(open(csv_file))

    row_count = 0
    for row in csv_data:
        if row_count != 0:
            product = mydb.escape_string(row[0])
            category = mydb.escape_string(row[1])
            #print category , product
            sql = 'INSERT IGNORE INTO testtb(product, category) VALUES ( "{}","{}")'.format(product,category)
            #print sql
            cursor.execute(sql)
            row_count += 1

    mydb.commit()
    cursor.close()