richelo

Forum Replies Created

Viewing 8 posts - 1 through 8 (of 8 total)
  • Author
    Posts
  • in reply to: Weird Feed Failure #6566


    richelo
    Participant
    Post count: 8

    Email sent.

  • in reply to: Weird Feed Failure #6563


    richelo
    Participant
    Post count: 8

    Sorry, yes, using Crawlomatic along with HeadLessBrowser API, I bought the plugin today on Envato, and signed up for a subscription for the API today as well.

    I will get that screenshot over to you a little later.

  • in reply to: Weird Feed Failure #6561


    richelo
    Participant
    Post count: 8

    You missed out on a few points …

    • The ConstantContact one HAS dates, but they were all published with today’s date.
    • I set in main settings to not import anything before 1 January 2023, but for ConstantContact, it imported a bunch from December 2022, and yes, the posts has dates.
    • I need help getting https://convertkit.com/resources/ to work in the scraper.
    • Convertkit is the one that does not have dates. I am kind of okay publishing those on the days scraped.
    • One last thing … In the RSS plugin, there is a URL that needs to be run in cron with wget to have the rules run. I don’t see this in the scraping plugin. Does it just put itself in the WP cron?

    That’s all for now. Thank you so much for your help!

     

  • in reply to: Weird Feed Failure #6559


    richelo
    Participant
    Post count: 8

    I also just noticed that the Convertkit one does not have dates on the posts/articles. OUCH!

  • in reply to: Weird Feed Failure #6558


    richelo
    Participant
    Post count: 8

    That worked PERFECTLY, thank you!

    There is ONE issue though … The scraping takes today’s date for the post, and not post publish date. I also set in the main settings to not import anything before 1 Jan 2023, but this one imported a bunch from December 2022.

    Sorry for being a pain. Really new to all this. One more that does not even have an RSS feed.

    I tried with the same settings, but not working.

    Could you help with the scraping settings for this one please: https://convertkit.com/resources/

     

  • in reply to: Weird Feed Failure #6556


    richelo
    Participant
    Post count: 8

    Setup everything exactly as you said, using HeadLessBrowserAPI, and it fails, and gives me this in the logs:

    [5-Jan-2023 16:53:17 UTC] An error occurred while getting content from HeadlessBrowserAPI: https://headlessbrowserapi.com/apis/scrape/v1/puppeteer?apikey=MyValidKey&url=https%3A%2F%2Fwww.constantcontact.com%2Fblog%2F&custom_user_agent=Mozilla%2F5.0+%28Windows+NT+6.3%3B+Win64%3B+x64%29+AppleWebKit%2F537.36+%28KHTML%2C+like+Gecko%29+Chrome%2F60.0.3112.113+Safari%2F537.36&custom_cookies=default&user_pass=default&timeout=default&proxy_url=default&proxy_auth=default&solvecaptcha=1&enableadblock=1 – puppeteer Unhandled Rejection Unhandled Rejection, reason: Error: net::ERR_TUNNEL_CONNECTION_FAILED at https://www.constantcontact.com/blog/ at navigate (/var/www/html/wp-content/plugins/custom-scraper-api/res/puppeteer/node_modules/puppeteer/lib/cjs/puppeteer/common/FrameManager.js:115:23) at process._tickCallback (internal/process/next_tick.js:68:7) /var/www/html/wp-content/plugins/custom-scraper-api/res/puppeteer/puppeteer.js:33 process.on(‘unhandledRejection’, up => { console.error(‘Unhandled Rejection, reason:’, up);throw up }) ^ Error: net::ERR_TUNNEL_CONNECTION_FAILED at https://www.constantcontact.com/blog/ at navigate (/var/www/html/wp-content/plugins/custom-scraper-api/res/puppeteer/node_modules/puppeteer/lib/cjs/puppeteer/common/FrameManager.js:115:23) at process._tickCallback (internal/process/next_tick.js:68:7)
    [5-Jan-2023 16:53:22 UTC] Failed to get source web page, importing will not run from this URL! https://www.constantcontact.com/blog/

  • in reply to: Weird Feed Failure #6545


    richelo
    Participant
    Post count: 8

    Thank you for your detailed response.

    Can Crawlomatic do Excerpts of the scraped posts, or only complete post? I am not talking about summary with something like TLDRThis, I mean just an Excerpt of the original content?

    Thanks

    Rich.

  • in reply to: TLDRThis Issue #6531


    richelo
    Participant
    Post count: 8

    Facepalm moment!

    I update the plugin to the latest version, and BINGO, working!

Viewing 8 posts - 1 through 8 (of 8 total)