How to scrap product pages in Samsung category page?

This topic is: resolved

 

Thank you for contacting me. Please note that I live in the GMT+3 time zone - responses might be delayed by this.

This topic has 1 reply, 2 voices, and was last updated 10 months, 3 weeks ago by Szabi – CodeRevolution.

Viewing 1 reply thread
  • Author
    Posts
    • #9785


      acluke
      Participant
      Post count: 11

      https://www.samsung.com/tw/refrigerators/all-refrigerators/

      Hi,
      I don’t know it’s JS or lazy load in this page that I can’t scrap the product list.

      Could you please help to take a look and advise some tips?

      Thanks,
      Luke

       

       

    • #9786


      Szabi – CodeRevolution
      Keymaster
      Post count: 4620

      Hello,

      I checked and yes, this page is using lazy loading of products, using JavaScript. The products are not added in the original HTML of the page, but they are loaded directly in the user’s browser, at page load, using JavaScript.

      You can scrape these also, only if you install Puppeteer on your server and set up Echo RSS to use it for scraping, as it can also render JavaScript on pages.
      Installation tutorial video: https://www.youtube.com/watch?v=pRUDcSOe724
      Afterwards, you can use below settings to crawl links from the above URL:

      Seed Page Crawling Query Type
      XPath

      Seed Page Crawling Query String
      //*[@class=’pd03-product-card__product-name-link’]

      Regards, Szabi – CodeRevolution.

Viewing 1 reply thread

The topic ‘How to scrap product pages in Samsung category page?’ is closed to new replies.