The Author box at the bottom of the page can’t be removed

This topic is: resolved

 

Thank you for contacting me. Please note that I live in the GMT+3 time zone - responses might be delayed by this.

Viewing 3 reply threads
  • Author
    Posts
    • #3251


      teddychu2001
      Participant
      Post count: 9

      Please find below are the links to scrape. I tried many different settings including using the “strip” function but still can’t remove the Author box information at the bottom of the pages after scraped. Can it be removed or not?

      https://candid.technology/spotify-acquires-podz/

      https://candid.technology/duckduckgo-desktop-app-announced/

    • #3252


      Szabi – CodeRevolution
      Keymaster
      Post count: 4556

      Hello,

      First of all, thank you for your purchase.

      Please try using the following settings in the plugin to scrape content from this particular website, without including the author box from the bottom:

      Content Query Type:
      Class

      Content Query String:
      post-content

      Strip HTML Elements by Class:
      has-inline-color has-vivid-cyan-blue-color,saboxplugin-wrap,tve-leads-two-step-trigger tl-2step-trigger-0,sidebar sidebar-after-post-content

       

      Let me know if this worked for you also.

      Regards, Szabi – CodeRevolution.

    • #3253


      teddychu2001
      Participant
      Post count: 9

      Thanks Szabi for your prompt response. Your solution solved the problem of removing the author box. However, it created another problem that the lazy load image HTML tag “data-src’ is not working when using Content Query String “post-content’. (It was working when using Auto Detect in Content Query String)

      How to fix this issue? (Not only remove author box but lazy load image needs working)

      Example URL:

      https://candid.technology/samsung-odyssey-flatscreen-gaming-monitor-g7-g5-g3/

      Thanks again

    • #3254


      Szabi – CodeRevolution
      Keymaster
      Post count: 4556

      I am glad to hear that it worked.

      You can solve the image issue using the following settings:

      Run Regex On Content: 

      (<noscript>)|(</noscript>)

       

      Regards.

Viewing 3 reply threads

The topic ‘The Author box at the bottom of the page can’t be removed’ is closed to new replies.