单击按钮时,木偶戏无法正常工作

时间:2020-05-15 14:45:24

标签: javascript node.js web-scraping web-crawler puppeteer

我的问题是我需要将评论选择器设置为“所有评论”,但该伪造者单击正确的按钮“所有评论”后,评论不呈现,评论部分消失,将提供运行中的浏览器的代码和视频。

const $ = require('cheerio');
const puppeteer = require('puppeteer');
const url = 'https://www.facebook.com/pg/SamsungGlobal/posts/';







const main = async () => {
  const browser = await puppeteer.launch({
    headless: false,
    args: ['--no-sandbox', '--disable-setuid-sandbox']
  });
  const page = await browser.newPage();
  await page.setViewport({
    width: 1920,
    height: 1080
  });
  await page.goto(url, {
    waitUntil: 'networkidle2',
    timeout: 0
  });
  page.mouse.click(50, 540, {});
  for (var a = 0; a < 18; a++) {
    setTimeout(() => {}, 16);
    await page.keyboard.press('ArrowDown');
  }
  let bodyHTML = await page.evaluate(() => document.body.innerHTML);   
  var id = "#" + $("._427x ._4-u2.mbm._4mrt", bodyHTML).attr('id');      // selects id of first post
  try {
    var exp = await page.$(`${id} a._21q1`);   // clicks on "most relevant" from the first post 
    await exp.evaluate(exp => exp.click());
    await page.click('div[data-ordering="RANKED_UNFILTERED"]');    // selects "all the comments"
    var exp = await page.$(`${id} a._42ft`);         // should click on "more comments" but it doesn't load
    await exp.evaluate(exp => exp.click());
    await page.waitForSelector(`${id} a._5v47.fss`);       // wait for the "others" in facebook comments
    var exp = await page.$$(`${id} a._5v47.fss`);
    await exp.evaluate(exp => exp.click());
    await page.screenshot({
      path: "./srn4.png"
    });
    // var post = await page.$eval(id + " .userContentWrapper", el => el.innerHTML);
    // console.log("that's the  post " + post);
  } catch (e) {
    console.log(e);
  }
  setTimeout(async function() {
    await browser.close();     //close after some time
  }, 1500);
};


main(); 

这是完整执行过程的视频:https://youtu.be/jXpSOBfVskg 单击菜单https://youtu.be/1OgfFNokxsA

时,这是一个慢镜头。

2 个答案:

答案 0 :(得分:1)

您可以尝试使用选择器的变体:

'use strict';

const puppeteer = require('puppeteer');

(async function main() {
  try {
    const browser = await puppeteer.launch({ headless: false });
    const [page] = await browser.pages();

    await page.goto('https://www.facebook.com/pg/SamsungGlobal/posts/');

    await page.waitForSelector('[data-ordering="RANKED_THREADED"]');
    await page.click('[data-ordering="RANKED_THREADED"]');

    await page.waitForSelector('[data-ordering="RANKED_UNFILTERED"]');
    await page.click('[data-ordering="RANKED_UNFILTERED"]');
  } catch (err) {
    console.error(err);
  }
})();

答案 1 :(得分:0)

page.mouse.click(50, 540, {});

这不一定行得通。您想点击什么?您需要使用CSS选择器来查找要单击的元素。

此外,动态元素可能不会立即显示在页面中。您应根据需要使用waitForSelector