Forum Replies Created
-
AuthorPosts
-
May 10, 2023 at 11:38 am in reply to: Crwaling perfrom with multi-type strings #7492
Hello Team
How can I scrape the close date and add it to the post content?
Thanks
-
May 10, 2023 at 10:22 am in reply to: Crawling is not working with https://www.seek.com.au/ #7484
Hello Team,
Thank you for your response and it’s working now.
-
May 10, 2023 at 6:41 am in reply to: Crawling is not working with https://www.seek.com.au/ #7472
Hello Team
I have again created the expressions but still got my custom fields empty. Please see the screenshot
https://prnt.sc/a9BRq7OEkFWWThanks
-
May 10, 2023 at 5:53 am in reply to: Crawling is not working with https://www.seek.com.au/ #7471
Hello Team
But I have set the shortcode for job_salary_value and company_name which is
job_salary_value => regex @@ #<span\s+class=”_1wkzzau0\s+a1msqi4y\s+lnocuo0\s+lnocuo1\s+lnocuo21\s+_1d0g9qk4\s+lnocuo7″\s+data-automation=”jobSalary”\s*><div\s+class=”_1wkzzau0\s+v28kuf0\s+v28kuf4\s+v28kuf2″\s+title=”([^”]+)”\s*>([^<]*)</div></span>#
company_name => regex @@ #<a\s+href=”([^”]+)”\s+rel=”nofollow”\s+class=”(?:_1wkzzau0\s+){2}_842p0a0″\s+title=”([^”]+)”\s+aria-label=”([^”]+)”\s+data-automation=”jobCompany”\s*>([^<]+)#These are not working. For the time being I need these on priority
thanks
-
May 10, 2023 at 5:23 am in reply to: Crawling is not working with https://www.seek.com.au/ #7469
Hello Team
I have forwarded the login details to the shared email. Please check. It’s from sukhjinderbham@gmail.com
Thanks
-
February 18, 2023 at 12:26 pm in reply to: Jobs Scraper #6874
<div class=”aju”></div>
<div class=”gs”>
<div class=””>
<div id=”:p4″ tabindex=”-1″>Hi Szabi</div>
<div id=”:ot” class=”ii gt adO”>
<div id=”:os” class=”a3s aiL “>
<div dir=”ltr”>
<div>
Thanks for the explanations, I’m understanding a bit more but still struggling to actually get the data in correctly. So for</div>
<div></div>
<div>1) Custom taxonomies – I have used job_listing_type => uk-ihe-jobs (slug) but this actually just creates a new custom field name uk-ihe-jobs. Should I just use the name of the custom field, as opposed to the slug? (basically I want all posts scraped by this rule to be assigned to my existing custom post type whose slug is uk-ihe-jobs and name is “UK International Higher Education Jobs”)2)I’ve created the custom shortcodes but am struggling a bit with the crawler, I used the helper but since the information I am scraping is from a table the CLASS doesn’t really help – is there a way to use HTML selectors?
as an example this is an example page to be scraped – https://www.jobs.ac.<wbr />uk/job/CXO168/senior-<wbr />international-recruitment-<wbr />officer
And I would like to scrape the following:</div>
<div>
_job_salary => /html/body/div[1]/div[1]/div[<wbr />2]/div/div[1]/table/tbody/tr[<wbr />2]/td
_job_location => /html/body/div[1]/div[1]/div[<wbr />2]/div/div[1]/table/tbody/tr[<wbr />1]/td
_company_tagline => /html/body/div[1]/div[1]/div[<wbr />2]/div/div[1]/table/tbody/tr[<wbr />4]/td
_company_twitter => /html/body/div[1]/div[1]/div[<wbr />2]/div/div[1]/table/tbody/tr[<wbr />3]/td
_job_expires => /html/body/div[1]/div[1]/div[<wbr />2]/div/div[2]/table/tbody/tr[<wbr />2]/td
_company_name => /html/body/div[1]/div[1]/h3/b/<wbr />span
_application => /html/body/div[1]/div[1]/div[<wbr />3]/a/@hrefIs there any way I could tweak the above so it actually works and creates the shortcode with the right data in?
Thanks! <span style=”color: #888888;”>
Jose</span></div>
</div>
<div class=”yj6qo ajU”></div>
</div>
</div>
</div>
</div> -
AuthorPosts