How to crawl the image in data URI format?

This topic is: resolved

 

Thank you for contacting me. Please note that I live in the GMT+3 time zone - responses might be delayed by this.

This topic has 1 reply, 2 voices, and was last updated 9 months, 3 weeks ago by Szabi – CodeRevolution.

Viewing 1 reply thread
  • Author
    Posts
    • #9978


      acluke
      Participant
      Post count: 11

      For example: https://www.tatung.com/Product/Consumer/1408

      the image URL prefix is : data:image/jpeg;base64

      Please advise, thanks for the help.

    • #9979


      Szabi – CodeRevolution
      Keymaster
      Post count: 4620

      Hello,

      This page uses JavaScript to render its content, because of this, the plugin is not able to grab contents from it by default. To make this work, you need to install Puppeteer on your server (a headless browser, which is able to render JavaScript) and set up Crawlomatic to use it.

      Tutorial video on Puppeteer install: https://www.youtube.com/watch?v=pRUDcSOe724

      Crawlomatic setup with Puppeteer:

      Afterwards, you can set the bellow settings in rules and image scraping will work:

      Featured Image Query Type
      Class

      Featured Image Query String
      elt_1

      Regards, Szabi – CodeRevolution.

Viewing 1 reply thread

The topic ‘How to crawl the image in data URI format?’ is closed to new replies.