Question

我正试图从this网站上搜集一下。我的目标是收集任何团队的最新10个结果（赢/输/抽奖），我只是以这个特定团队为例。单个行的来源是：

<tr class="odd      match no-date-repetition" data-timestamp="1515864600" id="page_team_1_block_team_matches_3_match-2463021" data-competition="8">
        <td class="day no-repetition">Sat</td>


        <td class="full-date" nowrap="nowrap">13/01/18</td>
        <td class="competition"><a href="/national/england/premier-league/20172018/regular-season/r41547/" title="Premier League">PRL</a></td>

          <td class="team team-a ">
              <a href="/teams/england/tottenham-hotspur-football-club/675/" title="Tottenham Hotspur">
                Tottenham Hotspur
              </a>
          </td>

        <td class="score-time score">
          <a href="/matches/2018/01/13/england/premier-league/tottenham-hotspur-football-club/everton-football-club/2463021/" class="result-win">

            4 - 0

          </a>
        </td>
          <td class="team team-b ">
            <a href="/teams/england/everton-football-club/674/" title="Everton">
              Everton
            </a>
          </td>
        <td class="events-button button first-occur">
            <a href="/matches/2018/01/13/england/premier-league/tottenham-hotspur-football-club/everton-football-club/2463021/#events" title="View events" class="events-button-button ">View events</a>
        </td>

          <td class="info-button button">

              <a href="/matches/2018/01/13/england/premier-league/tottenham-hotspur-football-club/everton-football-club/2463021/" title="More info">More info</a>



          </td>




      </tr>

您可以在<td class="score-time score"中看到，结果已存储。我对Python和网络爬行的了解非常有限，所以我目前的代码是：

res2 = requests.get(soccerwayURL)
soup2 = bs4.BeautifulSoup(res2.text, 'html.parser')
elems2 = soup2.select('#page_team_1_block_team_matches_3_match-2463021 > td.score-time.score')
print(elems2[0].text.strip())

这打印出'4-0'。这很好，但是当我尝试访问另一行时会出现问题。 7位数字（上例中的2463021）对于该行是唯一的。这意味着如果我想从不同的行获得分数，我将不得不找到唯一的7位数字并将其放在CSS选择器'#page_team_1_block_team_matches_3_match-******* > td.score-time.score'中，其中星号是唯一的数字。

我参加的在线课程仅展示了如何通过CSS选择器引用内容，因此我不确定如何在不手动为每行选择CSS选择器的情况下检索分数。

在<td class="score-time score">类中，还有另一个类读取class="result-win">。理想情况下，我希望能够提取"result-win"，因为我不是在寻找比赛的得分，我只是在寻找胜负，失败或平局的结果。

我希望这有点清楚。非常感谢你对我的耐心。

从具有唯一ID行

0 个答案: