使用PHP和SimpleDOM或SimpleXML修改html列表

时间:2011-03-03 18:01:43

标签: php html-parsing simplexml simpledom

我有一个HTML结构:

<li class="level1 item3 parent">
    <a href="mainlink.html" class="level1 item3 parent">
        <span>
            <span class="title">Main title</span>
            <span class="sub">Subtitle</span>
        </span>
    </a>
    <ul class="level2">
        <li class="level2 item1 first">
            <a href="alink.html" class="level2 item1 first"><span>Title 1</span></a>
        </li>
        <li class="level2 item2">
            <a href="alink.html" class="level2 item2"><span>Title 2</span></a>
        </li>
        <li class="level2 item3">
            <a href="alink.html" class="level2 item3"><span>Title 3</span></a>
        </li>
        <li class="level2 item4 last">
            <a href="alink.html" class="level2 item4 last"><span>Title 4</span></a>
        </li>
    </ul>
</li>

我想将其改为:

<li class="level1 item3 parent">
    <a href="mainlink.html" class="level1 item3 parent">
        <span>
            <span class="title">Main title</span>
            <span class="sub">Subtitle</span>
        </span>
    </a>
    <ul class="level2">
        <li class="level2 item1 first">
            <a href="mainlink.html" class="level2 item1 first"><span>Main title</span></a>
        </li>
        <li class="level2 item1">
            <a href="alink.html" class="level2 item2"><span>Title 1</span></a>
        </li>
        <li class="level2 item2">
            <a href="alink.html" class="level2 item3"><span>Title 2</span></a>
        </li>
        <li class="level2 item3">
            <a href="alink.html" class="level2 item4"><span>Title 3</span></a>
        </li>
        <li class="level2 item4 last">
            <a href="alink.html" class="level2 item5 last"><span>Title 4</span></a>
        </li>
    </ul>
</li>

主要思想是在主要li元素内的ul开头添加一个新项目,克隆li元素标记标题的一部分。我不知道如何使用SimpleXML或SimpleDOM http://code.google.com/p/simpledom/来完成这项任务......任何想法?

的问候,

1 个答案:

答案 0 :(得分:0)

新版本:

$doc = new DomDocument;
$doc->loadHTML($html);

$li = $doc->getElementsByTagName('li')->item(0)->cloneNode(true);

foreach($li->getElementsByTagName('ul') as $ul) {
    $ul->parentNode->removeChild($ul);
}

$span = $li->getElementsByTagName('span')->item(0);
$span->nodeValue = $li->getElementsByTagName('span')->item(1)->nodeValue;
foreach($span->getElementsByTagName('a') as $a) {
    $a->setAttribute('class', 'level2 item1 first');
}

$ul = $doc->getElementsByTagName('ul')->item(0);

$nodes = $ul->getElementsByTagName('a');
$node = $ul->childNodes->item(0);

$i = 2;
foreach($nodes as $n) {
    $n->setAttribute('class', 'level2 item' . $i);
    $i++;
}

$ul->insertBefore($li, $node);

$nodes->item(1)->parentNode->setAttribute('class', 'level2 item1');

echo $doc->saveHTML();

http://codepad.org/pIwwK1Z5