doc = Jsoup.parse(html2);
Whitelist whitelist = new Whitelist();
Whitelist.simpleText().addTags("table","td", "tr","tbody");
Cleaner cleaner = new Cleaner(whitelist);
cleaner.clean(doc);
Elements gradesTableBody = doc.select(".user-grade").select("tbody").select("tr");
Log.d("output", gradesTableBody.html());
输出html仍然包含<img>
标签等。清洁工不起作用。我做错了什么?
答案 0 :(得分:3)
Cleaner.clean
会返回新文档(您目前只是扔掉了)。
你需要做
doc = cleaner.clean(doc);
^^^^^
此外,这些界限没有意义:
Whitelist whitelist = new Whitelist();
Whitelist.simpleText().addTags("table","td", "tr","tbody");
我想你可能想做
Whitelist whitelist = Whitelist.simpleText().addTags("table","td", "tr","tbody");