使用Jsoup解析dl标记

时间:2017-04-19 13:59:39

标签: java html jsoup

我正在尝试使用Jsoup解析<dl>标记。 <dl>代码包含<dt>代码和<dd>代码。我有一个HashMap(HashMap<String, List<String>),其中我想将<dt>作为键,将<dd> s作为值(每<dd>有多个<dt>个标记} tag。

HTML看起来像这样:

<dl>
<dt>
    <span class="paramLabel">Parameters:</span>
</dt>
<dd>
    <code>y</code> - the ordinate coordinate
</dd>
<dd>
    <code>x</code> - the abscissa coordinate
</dd>
<dt>
    <span class="returnLabel">Returns:</span>
</dt>
<dd>
    the <i>theta</i> component of the point (<i>r</i>,&nbsp;<i>theta</i>) in polar coordinates that corresponds to the point (<i>x</i>,&nbsp;<i>y</i>) in Cartesian coordinates.
</dd>

我尝试过以下方法:

String title = "";
List<String> descriptions = new ArrayList<>();
for (int i = 0; i < children.size(); i++) {
    Element child = children.get(i);

    if(child.tagName().equalsIgnoreCase("dt")) {
        if(descriptions.size() != 0) {
            block.fields.put(title, descriptions);
            descriptions.clear();

        }

        title = child.text();
    }
    else if(child.tagName().equalsIgnoreCase("dd")) {
        descriptions.add(child.text());

        if(i == children.size() - 1) {
            block.fields.put(title, descriptions);
        }
    }
}

我希望得到这个:

 * Parameters -> y - the ordinate coordinate
 *               x - the abscissa coordinate
 * Returns    -> the theta component of the point (r, theta) in polar coordinates that corresponds to the point (x, y) in Cartesian coordinates.

但我得到了这个:

 * Parameters -> the theta component of the point (r, theta) in polar coordinates that corresponds to the point (x, y) in Cartesian coordinates.


 * Returns    -> the theta component of the point (r, theta) in polar coordinates that corresponds to the point (x, y) in Cartesian coordinates.

1 个答案:

答案 0 :(得分:2)

您需要在地图中插入描述列表的副本,目前您正在操作列表的1个实例。所以而不是:

block.fields.put(title, descriptions);

创建一个新列表,例如:

block.fields.put(title, new ArrayList<>(descriptions));