Question

url = "http://www.espn.com/nba/standings"
dfs = pd.read_html(url, header = None)

dfs[1]

导致：

1* --MILMilwaukee Bucks
0   2y --TORToronto Raptors
1   3x --PHIPhiladelphia 76ers
2   4x --BOSBoston Celtics
3   5x --INDIndiana Pacers
0   2y --TORToronto Raptors

1* --MILMilwaukee Bucks不应该是列名

我觉得我做错了事（有一段时间没有使用过熊猫了），但是从我读过的书header = None应该可以。

Answer 1

我已经尝试过这样做，但是在我的情况下header = None也无法正常工作（我正在寻找其无法正常工作的原因）而不是可以使用header = 0好吧。

data = pd.read_html("test.html",header = 0)
print(data)

**输出：: **

[  Programming Language             Creator  Year
0                    C      Dennis Ritchie  1972
1               Python    Guido Van Rossum  1989
2                 Ruby  Yukihiro Matsumoto  1995]

这将为您工作。 ;）

read_html尽管标题= None，但仍将第一行作为列标题名称

1 个答案: