Please help scraping

This topic is: resolved

 

Thank you for contacting me. Please note that I live in the GMT+3 time zone - responses might be delayed by this.

Tagged: 

Viewing 7 reply threads
  • Author
    Posts
    • #4451


      teddychu2001
      Participant
      Post count: 9

      Is it possible to scrape https://www.webnots.com/post-sitemap3.xml? I received the following error message. Thanks for your help.

      [7-Jan-2022 10:29:16 UTC] Now crawling: https://www.webnots.com/post-sitemap3.xml
      [7-Jan-2022 10:29:16 UTC] [FATAL] Exit error: Uncaught Error: Call to undefined function vipnytt\split() in /home/sites/11a/3/38cf2f9eff/public_html/abc/wp-content/plugins/crawlomatic-multipage-scraper-post-generator/res/SitemapParser-master/src/SitemapParser.php:287 Stack trace: #0 /home/sites/11a/3/38cf2f9eff/public_html/abc/wp-content/plugins/crawlomatic-multipage-scraper-post-generator/res/SitemapParser-master/src/SitemapParser.php(164): vipnytt\SitemapParser->parseString(‘<html><meta htt…’) #1 /home/sites/11a/3/38cf2f9eff/public_html/abc/wp-content/plugins/crawlomatic-multipage-scraper-post-generator/res/SitemapParser-master/src/SitemapParser.php(96): vipnytt\SitemapParser->parse(‘https://www.web…&#8217;, ‘<html><meta htt…’, ”, ”, ‘0’, ”) #2 /home/sites/11a/3/38cf2f9eff/public_html/abc/wp-content/plugins/crawlomatic-multipage-scraper-post-generator/crawlomatic-multipage-scraper-post-generator.php(9426): vipnytt\SitemapParser->parseRecursive(‘https://www.web…&#8217;, ‘<html><meta htt…’, ”, ”, ‘0’, ”) #3 /home/sites/11a/3/38cf2f9eff/public_html/ab, file: /home/sites/11a/3/38cf2f9eff/public_html/abc/wp-content/plugins/crawlomatic-multipage-scraper-post-generator/res/SitemapParser-master/src/SitemapParser.php, line: 287 – rule ID: 61d815c40da050.63106445!

    • #4452


      Szabi – CodeRevolution
      Keymaster
      Post count: 4205

      Hello,

      First of all, thank you for your purchase.

      Can you send me, please, temporary admin login credentials to your WordPress install, so I can check this issue out? Send it, please, to my email address: [email protected].

      Regards, Szabi – CodeRevolution.

    • #4461


      teddychu2001
      Participant
      Post count: 9

      Thanks for your reply Szabi. I think the issue is with the scraping website https://www.webnots.com/post-sitemap3.xml itself because I have no issue scraping other websites.

      Can you please try to scrape this URL to see if there is any special setting needed?

      Thanks for your help.

    • #4462


      Szabi – CodeRevolution
      Keymaster
      Post count: 4205

      I just tried to scrape the site on my demo site and for me, it worked without issues, please check result: https://wpinitiate.com/crawlomatic-test/demoa8ec4344/

      From the looks of the errors you added, it seems like an incompatibility with another plugin which is installed and active on your site. Some other plugin might load an older version of the same library which is used also by my plugin, causing this incompatibility.

      So, to debug this, please try to deactivate other (similar) plugins one by one, which might cause this issue.

      Let me know if this helped.

      Regards.

    • #4566


      teddychu2001
      Participant
      Post count: 9
      This reply has been marked as private.
    • #4569


      Szabi – CodeRevolution
      Keymaster
      Post count: 4205

      Hello,

      I checked and saw that you had a very old version of the plugin installed on your site (2.0.0, released back in 2020). I updated it to the latest version, issue should be fixed now, please check.

      Regards.

    • #4570


      teddychu2001
      Participant
      Post count: 9
      This reply has been marked as private.
    • #4571


      Szabi – CodeRevolution
      Keymaster
      Post count: 4205

      I am glad to help!

      If you like the plugin, please give it a rating on CodeCanyon, it is really appreciated! 🙂

      Regards.

Viewing 7 reply threads

The topic ‘Please help scraping’ is closed to new replies.