Scrape issue, need help here

This topic is: resolved

 

Thank you for contacting me. Please note that I live in the GMT+3 time zone - responses might be delayed by this.

This topic has 1 reply, 2 voices, and was last updated 2 years, 4 months ago by Szabi – CodeRevolution.

Viewing 1 reply thread
  • Author
    Posts
    • #5473


      Skimm3r
      Participant

      Hello,

      I would like to scrape the content from this website: https://www.bhg.com/home-improvement/remodeling/carpentry/all-about-walls-and-ceilings/

      But I have trouble to get things right, please help to provide the correct settings.

    • #5475


      Szabi – CodeRevolution
      Keymaster
      Post count: 4577

      Hello,

      First of all, thank you for your purchase.

      Please check plugin settings below (as mentioned also in the email you sent me):

       

      Scraper Start (Seed) URL / Keywords

      https://www.bhg.com/decorating/small-spaces/homes/french-vintage-style-tiny-texas-home/

      Run Regex On Content:

      <div(?:[^>]*?)src=”([^”]*?)”(?:[\s\S]*?)<\/div>

      <div(?:[^>]*?)class=”lazy-image__loadingPlaceholder”(?:[^>]*?)>(?:[\s\S]*?)</div>

       

      Replace Matches From Regex (Content):

      <img src=”$1″>

      <span></span>

       

      Do Not Scrape Seed URL:

      UNchecked

       

      Seed Page Crawling Query Type:

      Disabled

       

      Content Query Type

      XPath

       

      Content Query String

      //*[@class=’article-content-container’]

       

      Lazy Loading Images HTML Tag:

      data-src

       

      Regards, Szabi – CodeRevolution.

Viewing 1 reply thread

The topic ‘Scrape issue, need help here’ is closed to new replies.