Question

我使用scrapy并需要提取＆＃34;灰色/灰色＆＃34;使用xpath选择器。这是html代码段：

<div class="Vehicle-Overview">
    <div class="Txt-YMM">
        2006 GMC Sierra 1500
    </div>
    <div class="Txt-Price">
        Price :                                     $8,499
    </div>

    <table width="100%" border="0" cellpadding="0" cellspacing="0" 
    class="Table-Specs">
        <tr>
            <td>
                <strong>2006 GMC Sierra 1500 Crew Cab 143.5 WB 4WD 
                SLE</strong>
                <strong class="text-right t-none"></strong>
            </td>
        </tr>
        <tr>
            <td>
                <strong>Gray / Gray</strong><br />
                <strong>209,123 
                            Miles

                                  / VIN: XXXXXXXXXX

            </td>
       </tr>
</table>

我试图提取＆＃34;灰色/灰色＆＃34;在＆＃34;强大的＆＃34;标签。任何帮助表示赞赏。

Answer 1

此XPath将在Scrapy以及Google / Firefox Developer的控制台中使用：

//div[@class='Vehicle-Overview']/table[@class='Table-Specs']//tr[2]/td[1]/strong[1]/text()

您可以在蜘蛛中使用此代码：

color = response.xpath("//div[@class='Vehicle-Overview']/table[@class='Table-Specs']//tr[2]/td[1]/strong[1]/text()").extract_first()

Answer 2

您可以将此XPath表达式与示例XML / HTML一起使用：

//div[@class='Vehicle-Overview']/table[@class='Table-Specs']/tr[2]/td[1]/strong[1]

完整的XPath，给出下面提到的关于命名空间的完整文件＆＃34; http://www.w3.org/1999/xhtml＆＃34;可以

/html/body/div/div/div[@class='content-bg']/div/div/div[@class='Vehicle-Overview']/table[@class='Table-Specs']/tr[2]/td[1]/strong[1]

xpath：如何在＆＃34; strong＆＃34;中提取文本？标签？

2 个答案: