Forum Replies Created
-
AuthorPosts
-
January 5, 2023 at 7:12 pm in reply to: Weird Feed Failure #6566
Email sent.
-
January 5, 2023 at 6:42 pm in reply to: Weird Feed Failure #6563
Sorry, yes, using Crawlomatic along with HeadLessBrowser API, I bought the plugin today on Envato, and signed up for a subscription for the API today as well.
I will get that screenshot over to you a little later.
-
January 5, 2023 at 5:40 pm in reply to: Weird Feed Failure #6561
You missed out on a few points …
- The ConstantContact one HAS dates, but they were all published with today’s date.
- I set in main settings to not import anything before 1 January 2023, but for ConstantContact, it imported a bunch from December 2022, and yes, the posts has dates.
- I need help getting https://convertkit.com/resources/ to work in the scraper.
- Convertkit is the one that does not have dates. I am kind of okay publishing those on the days scraped.
- One last thing … In the RSS plugin, there is a URL that needs to be run in cron with wget to have the rules run. I don’t see this in the scraping plugin. Does it just put itself in the WP cron?
That’s all for now. Thank you so much for your help!
-
January 5, 2023 at 5:29 pm in reply to: Weird Feed Failure #6559
I also just noticed that the Convertkit one does not have dates on the posts/articles. OUCH!
-
January 5, 2023 at 5:18 pm in reply to: Weird Feed Failure #6558
That worked PERFECTLY, thank you!
There is ONE issue though … The scraping takes today’s date for the post, and not post publish date. I also set in the main settings to not import anything before 1 Jan 2023, but this one imported a bunch from December 2022.
Sorry for being a pain. Really new to all this. One more that does not even have an RSS feed.
I tried with the same settings, but not working.
Could you help with the scraping settings for this one please: https://convertkit.com/resources/
-
January 5, 2023 at 4:57 pm in reply to: Weird Feed Failure #6556
Setup everything exactly as you said, using HeadLessBrowserAPI, and it fails, and gives me this in the logs:
[5-Jan-2023 16:53:17 UTC] An error occurred while getting content from HeadlessBrowserAPI: https://headlessbrowserapi.com/apis/scrape/v1/puppeteer?apikey=MyValidKey&url=https%3A%2F%2Fwww.constantcontact.com%2Fblog%2F&custom_user_agent=Mozilla%2F5.0+%28Windows+NT+6.3%3B+Win64%3B+x64%29+AppleWebKit%2F537.36+%28KHTML%2C+like+Gecko%29+Chrome%2F60.0.3112.113+Safari%2F537.36&custom_cookies=default&user_pass=default&timeout=default&proxy_url=default&proxy_auth=default&solvecaptcha=1&enableadblock=1 – puppeteer Unhandled Rejection Unhandled Rejection, reason: Error: net::ERR_TUNNEL_CONNECTION_FAILED at https://www.constantcontact.com/blog/ at navigate (/var/www/html/wp-content/plugins/custom-scraper-api/res/puppeteer/node_modules/puppeteer/lib/cjs/puppeteer/common/FrameManager.js:115:23) at process._tickCallback (internal/process/next_tick.js:68:7) /var/www/html/wp-content/plugins/custom-scraper-api/res/puppeteer/puppeteer.js:33 process.on(‘unhandledRejection’, up => { console.error(‘Unhandled Rejection, reason:’, up);throw up }) ^ Error: net::ERR_TUNNEL_CONNECTION_FAILED at https://www.constantcontact.com/blog/ at navigate (/var/www/html/wp-content/plugins/custom-scraper-api/res/puppeteer/node_modules/puppeteer/lib/cjs/puppeteer/common/FrameManager.js:115:23) at process._tickCallback (internal/process/next_tick.js:68:7)
[5-Jan-2023 16:53:22 UTC] Failed to get source web page, importing will not run from this URL! https://www.constantcontact.com/blog/ – -
January 5, 2023 at 4:00 am in reply to: Weird Feed Failure #6545
Thank you for your detailed response.
Can Crawlomatic do Excerpts of the scraped posts, or only complete post? I am not talking about summary with something like TLDRThis, I mean just an Excerpt of the original content?
Thanks
Rich.
-
January 3, 2023 at 6:03 pm in reply to: TLDRThis Issue #6531
Facepalm moment!
I update the plugin to the latest version, and BINGO, working!
-
AuthorPosts