Question

我是python（3.4）的新手，并将html表解析为标题[]，行[]和单元格[] 我希望最终将这些中的每一个存储到一个表（MySQL）中，字段名称是标题中的项目[]

共有4个标题（＆＃34; data0＆＃34;，＆＃34; data1＆＃34;＆＃34; data2＆＃34;＆＃34; data3＆＃34;）有6行

使用BeautifulSoup来实现目标非常简陋：

import assert from 'assert';

function debug_assert(actual, expected, message = 'AssertionError'){
    if(process.env.NODE_ENV !== 'production'){
        assert.equal(actual, expected, message);
    }
}

我曾经做过CASE陈述，或者，天堂禁止做一些IF陈述。我通常把它们放在对于row.findAll中的单元格（＆＃39; td＆＃39;）。但是我会和一个柜台一起工作并做一些事情：

soup = BeautifulSoup(r.text)
table = soup.find("table")
cells = []
rows = table.findAll('tr')
headings = [th.get_text().strip() for th in table.findAll("th")]
for row in rows:
    for cell in row.findAll('td')
    cells .append(cell.get_text().strip())

我不太担心保存部分（但是）但是我无法绕过不能使用计数器。我意识到这是初学者的方式，但我应该感谢任何见解（我的大脑也是如此）

Python：基于头部循环遍历html表的每一行

0 个答案: