Jsoup清洁剂没有删除元素

时间:2012-06-12 14:36:01

标签: java jsoup

doc = Jsoup.parse(html2);
Whitelist whitelist = new Whitelist();
Whitelist.simpleText().addTags("table","td", "tr","tbody");
Cleaner cleaner = new Cleaner(whitelist);
cleaner.clean(doc);

Elements gradesTableBody = doc.select(".user-grade").select("tbody").select("tr");
Log.d("output", gradesTableBody.html());

输出html仍然包含<img>标签等。清洁工不起作用。我做错了什么?

1 个答案:

答案 0 :(得分:3)

Cleaner.clean会返回文档(您目前只是扔掉了)。

你需要做

doc = cleaner.clean(doc);
^^^^^

此外,这些界限没有意义:

Whitelist whitelist = new Whitelist();
Whitelist.simpleText().addTags("table","td", "tr","tbody");

我想你可能想做

Whitelist whitelist = Whitelist.simpleText().addTags("table","td", "tr","tbody");