Thank you for contacting me. Please note that I live in the GMT+3 time zone - responses might be delayed by this.
This topic has 17 replies, 2 voices, and was last updated 2 months, 4 weeks ago by Szabi – CodeRevolution.
-
AuthorPosts
-
-
September 14, 2025 at 5:53 pm #12463
I want to crawl news content on the https://m.sports.naver.com/golf/news site.
The sub-URL of the article is
ex1) https://m.sports.naver.com/golf/article/009/0005558352
ex2)https://m.sports.naver.com/golf/article/018/0006115678The number after the object/ is different.
Is there any way I can crawl automatically?
also,
If you upload automatically to the post, the main page of my homepage will be dead.
May I know why?my site it : https://golfnfriend.com/
-
September 14, 2025 at 5:55 pm #12468
Hello,
First of all, thank you for your purchase.
Can you send me, please, temporary admin login credentials to your WordPress install, so I can check this issue out? Send it, please, to my email address: kisded@yahoo.com.
Regards,
Szabi – CodeRevolution. -
September 15, 2025 at 4:24 am #12471
I delivered the temporary manager ID and PW by mail.
Please check. -
September 15, 2025 at 9:03 am #12474
Would you like to try again?
I am connected with the ID I shared and PW. -
September 15, 2025 at 12:57 pm #12475
Hello,
I checked and I am getting the same issue, please check the email I sent to you, there I show a screen recording of the issue.
Regards.
-
September 16, 2025 at 12:18 am #12482
Thank you for your quick confirmation.
All security plug-ins have been deleted.
Please try logging in again. -
September 16, 2025 at 12:49 am #12483
Hosting company has set up only Korean servers to access. Currently, it has been modified.
You’ll be able to access it now. -
September 16, 2025 at 8:22 am #12487
Hello,
Thank you for the login credentials. I checked the site you want to scrape and indeed, scraping it is very hard, as it’s content is fully JavaScript generated (dynamic), everything is rendered on the page after it is loaded (this is why you see the lazy loading placeholders when you load the page).
I managed to scrape it automatically only using Puppeteer – this needs to be installed on your server, as shown here: https://www.youtube.com/watch?v=pRUDcSOe724 – contact hosting support and ask about this.
After it is installed, it can be used as shown here: https://www.youtube.com/watch?v=ZljpMpmi_dU
I managed to scrape the site you mentioned, using the below settings:
Scraper Start (Seed) URL / Keywords
https://m.sports.naver.com/golf/indexContent Scraping Method To Use:
PuppeteerHeadless Browser Wait Before Rendering Pages (ms):
5000Do Not Scrape Seed URL:
CheckedSeed Page Crawling Query Type:
ClassSeed Page Crawling Query String:
grid_itemContent Query Type
ClassContent Query String
_article_contentI hope this helps.
Regards,
Szabi – CodeRevolution. -
September 16, 2025 at 9:30 am #12488
Thank you very much.
I was impressed with your skills.As you told me, I will proceed using Puppeteer.
If there is any blockages, I will leave an inquiry again.
Thank you.
-
September 16, 2025 at 11:12 am #12489
I am glad to help!
-
September 17, 2025 at 2:07 am #12496
To install Puppeteer on a Windows operating system
Do I have to sign up for https://cloud.digitalocean.com/ ?I’m inquiring about the cost here as well.
-
September 17, 2025 at 5:25 am #12498
Puppeteer needs to be installed on your server (where your site is running), not on your local Windows computer. If your current hosting is not allowing Puppeteer install, you can set up your site on Digital Ocean, where you will be able to install Puppeteer.
If your site is running locally on your computer (localhost), you can install Puppeteer on Windows, as shown here: https://www.youtube.com/watch?v=s4fEYCOIZjk
Regards.
-
September 17, 2025 at 7:16 am #12499
I created the site through a hosting company called Cafe24 in Korea.
(https://hosting.cafe24.com/?controller=new_product_page&page=adsense-wordpress)The host company provides WordPress CMS,
It can be accessed by FTP.In this case, I would like to ask if it is okay to install Node.js and npm on the FPT path.
-
September 17, 2025 at 8:21 am #12500
Well, I am not sure if it is possible to install node.js and npm only using FTP, as I know that you need SSH access for this. Please ask your hosting company if it is possible to grant also SSH access to your server.
Regards.
-
September 17, 2025 at 10:01 am #12501
-
September 17, 2025 at 10:14 am #12503
You have to ask your hosting support about this, they might have some security measure which blocks access.
-
September 17, 2025 at 2:25 pm #12506
This reply has been marked as private. -
September 17, 2025 at 5:35 pm #12509
I am sorry, but installing Puppeteer using phpMyadmin is not possible. phpMyAdmin is just a PHP web app for managing databases – it has no way to install or run Node.js scripts.
Sorry for this.
-
-
AuthorPosts
The topic ‘Auto-crawling problems’ is closed to new replies.