josefernandilho

Forum Replies Created

Viewing 6 posts - 1 through 6 (of 6 total)
  • Author
    Posts
  • in reply to: Crwaling perfrom with multi-type strings #7492


    josefernandilho
    Participant
    Post count: 6

    Hello Team

    How can I scrape the close date and add it to the post content?

     

    Thanks

  • in reply to: Crawling is not working with https://www.seek.com.au/ #7484


    josefernandilho
    Participant
    Post count: 6

    Hello Team,

    Thank you for your response and it’s working now.

  • in reply to: Crawling is not working with https://www.seek.com.au/ #7472


    josefernandilho
    Participant
    Post count: 6

    Hello Team

    I have again created the expressions but still got my custom fields empty. Please see the screenshot
    https://prnt.sc/a9BRq7OEkFWW

    Thanks

  • in reply to: Crawling is not working with https://www.seek.com.au/ #7471


    josefernandilho
    Participant
    Post count: 6

    Hello Team

    But I have set the shortcode for job_salary_value and company_name which is

    job_salary_value => regex @@ #<span\s+class=”_1wkzzau0\s+a1msqi4y\s+lnocuo0\s+lnocuo1\s+lnocuo21\s+_1d0g9qk4\s+lnocuo7″\s+data-automation=”jobSalary”\s*><div\s+class=”_1wkzzau0\s+v28kuf0\s+v28kuf4\s+v28kuf2″\s+title=”([^”]+)”\s*>([^<]*)</div></span>#
    company_name => regex @@ #<a\s+href=”([^”]+)”\s+rel=”nofollow”\s+class=”(?:_1wkzzau0\s+){2}_842p0a0″\s+title=”([^”]+)”\s+aria-label=”([^”]+)”\s+data-automation=”jobCompany”\s*>([^<]+)#

    These are not working. For the time being I need these on priority  

    thanks

  • in reply to: Crawling is not working with https://www.seek.com.au/ #7469


    josefernandilho
    Participant
    Post count: 6

    Hello Team

    I have forwarded the login details to the shared email. Please check. It’s from sukhjinderbham@gmail.com

    Thanks

  • in reply to: Jobs Scraper #6874


    josefernandilho
    Participant
    Post count: 6

    <div class=”aju”></div>
    <div class=”gs”>
    <div class=””>
    <div id=”:p4″ tabindex=”-1″>Hi Szabi</div>
    <div id=”:ot” class=”ii gt adO”>
    <div id=”:os” class=”a3s aiL “>
    <div dir=”ltr”>
    <div>
    Thanks for the explanations, I’m understanding a bit more but still struggling to actually get the data in correctly. So for</div>
    <div></div>
    <div>1) Custom taxonomies – I have used job_listing_type => uk-ihe-jobs (slug) but this actually just creates a new custom field name uk-ihe-jobs. Should I just use the name of the custom field, as opposed to the slug? (basically I want all posts scraped by this rule to be assigned to my existing custom post type whose slug is uk-ihe-jobs and name is “UK International Higher Education Jobs”)

    2)I’ve created the custom shortcodes but am struggling a bit with the crawler, I used the helper but since the information I am scraping is from a table the CLASS doesn’t really help – is there a way to use HTML selectors?

    as an example this is an example page to be scraped – https://www.jobs.ac.<wbr />uk/job/CXO168/senior-<wbr />international-recruitment-<wbr />officer

    And I would like to scrape the following:</div>
    <div>
    _job_salary => /html/body/div[1]/div[1]/div[<wbr />2]/div/div[1]/table/tbody/tr[<wbr />2]/td
    _job_location => /html/body/div[1]/div[1]/div[<wbr />2]/div/div[1]/table/tbody/tr[<wbr />1]/td
    _company_tagline => /html/body/div[1]/div[1]/div[<wbr />2]/div/div[1]/table/tbody/tr[<wbr />4]/td
    _company_twitter => /html/body/div[1]/div[1]/div[<wbr />2]/div/div[1]/table/tbody/tr[<wbr />3]/td
    _job_expires => /html/body/div[1]/div[1]/div[<wbr />2]/div/div[2]/table/tbody/tr[<wbr />2]/td
    _company_name => /html/body/div[1]/div[1]/h3/b/<wbr />span
    _application => /html/body/div[1]/div[1]/div[<wbr />3]/a/@href

    Is there any way I could tweak the above so it actually works and creates the shortcode with the right data in?

    Thanks!  <span style=”color: #888888;”>

    Jose</span></div>
    </div>
    <div class=”yj6qo ajU”></div>
    </div>
    </div>
    </div>
    </div>

Viewing 6 posts - 1 through 6 (of 6 total)