PyMySQL没有按预期插入

时间:2017-09-15 06:50:22

标签: python mysql insertion

我有一个表lucene_try,如下所示:

id || title    || comment
1  || Title 1  || Sentence 1 of Comment 1. Sentence 2 of Comment 1. 
2  || Title 1  || Sentence 1 of Comment 2. Sentence 2 of Comment 2.

我想要实现的是每个标题,标题的每个评论,将评论分成不同的句子,并将其放在lucene_rs中必须如此的不同表中:

id || title    || root_comment   || sub_comment
1  || Title 1  || Comment 1      || Sentence 1 
2  || Title 1  || Comment 1      || Sentence 2 
3  || Title 1  || Comment 2      || Sentence 1 
4  || Title 1  || Comment 2      || Sentence 2 

我已经为此编写了代码,当它在控制台/终端上打印时,它可以正常工作。它打印注释1的句子1和句子2以及注释2.但是,当我想插入这些数据时,它只打印并插入注释1的句子1和句子2。

这是我的代码:

import pymysql
import pymysql.cursors
import random
from bs4 import BeautifulSoup
import nltk
from nltk import tokenize
import pdb

conn = pymysql.connect(host='localhost', user='root', password='password', db='master_thesis', autocommit=True)
cursor = conn.cursor()
cursor.execute("SELECT * FROM lucene_counter WHERE count > 5 AND count <= 30")

lucene_rs_list = list()

for row in cursor:
    lucene_rs_list.append(row[1])

random.shuffle(lucene_rs_list)
final_list = lucene_rs_list[:1]

for i in range(len(final_list)):
    current_title = final_list[i]
    query = "SELECT title, comment FROM lucene_try WHERE title = %s"
    cursor.execute(query, final_list[i])
    for row in cursor:
        root_comment = BeautifulSoup(row[1], "lxml").text
        print("Root Title: ", current_title)
        print("Root Comment: ", root_comment)
        cleancomment = tokenize.sent_tokenize(root_comment)
        for j in range(len(cleancomment)):
            # THIS LINE PRINTS EVERYTHING PROPERLY WITH ALL THE COMMENTS AND SUBCOMMENTS IF CURSOR.EXECUTE IS COMMENTED OUT
            print("Sub Comment: ", cleancomment[j])
            # IF THE CURSOR.EXECUTE IS UNCOMMENTED, IT ONLY DISPLAYS RESULT OF THE FIRST ROOT_COMMENT AND NOT ALL
            cursor.execute("""INSERT INTO lucene_rs (title, root_comment, comment) VALUES ("%s", "%s", "%s")""" % (current_title, root_comment, cleancomment[j]))
        print("\n")

conn.close()

1 个答案:

答案 0 :(得分:1)

问题是您使用相同的游标执行INSERT查询,因为您正在使用SELECT查询获取结果。当您执行INSERT时,它不再包含SELECT的结果,因此循环的下一次迭代会停止。

SELECT的所有结果读入列表然后循环,或者使用不同的INSERT光标。

conn = pymysql.connect(host='localhost', user='root', password='password', db='master_thesis', autocommit=True)
cursor = conn.cursor()
cursor2 = conn.cursor()
cursor.execute("SELECT * FROM lucene_counter WHERE count > 5 AND count <= 30")

lucene_rs_list = list()

for row in cursor:
    lucene_rs_list.append(row[1])

random.shuffle(lucene_rs_list)
final_list = lucene_rs_list[:1]

for i in range(len(final_list)):
    current_title = final_list[i]
    query = "SELECT title, comment FROM lucene_try WHERE title = %s"
    cursor.execute(query, final_list[i])
    for row in cursor:
        root_comment = BeautifulSoup(row[1], "lxml").text
        print("Root Title: ", current_title)
        print("Root Comment: ", root_comment)
        cleancomment = tokenize.sent_tokenize(root_comment)
        for j in range(len(cleancomment)):
            # THIS LINE PRINTS EVERYTHING PROPERLY WITH ALL THE COMMENTS AND SUBCOMMENTS IF CURSOR.EXECUTE IS COMMENTED OUT
            print("Sub Comment: ", cleancomment[j])
            # IF THE CURSOR.EXECUTE IS UNCOMMENTED, IT ONLY DISPLAYS RESULT OF THE FIRST ROOT_COMMENT AND NOT ALL
            cursor2.execute("""INSERT INTO lucene_rs (title, root_comment, comment) VALUES ("%s", "%s", "%s")""" % (current_title, root_comment, cleancomment[j]))
        print("\n")

conn.close()