Mediumomatic: Try To Get Full Article Content option skips content

This topic is: resolved

 

Thank you for contacting me. Please note that I live in the GMT+3 time zone - responses might be delayed by this.

Tagged: ,

Viewing 3 reply threads
  • Author
    Posts
    • #5856


      gerog
      Participant
      Post count: 3

      Hello! I found an issue with parsing with enabled “Try To Get Full Article Content” option. It skips pieces of content 🙂 I tried to explain on screenshots attached.

      So sometimes headers are missing, or content after 3 dots is missing. This what I met from 2 from 3 checked posts.

      If i disable this option everything works fine (the plugin parse full content, just adds unnecessary block of code).

      Attachments:
      You must be logged in to view attached files.
    • #5860


      Szabi – CodeRevolution
      Keymaster
      Post count: 4195

      Hello,

      If you enable the ‘Try To Get Full Article Content’ option, the plugin will scrape the article from Medium and try to automatically detect the readable content from the scraped HTML content. This can be imperfect in some cases, as you pointed it out. I will look into improving this algorithm, however, because Medium does not have a precise structure in their HTML content for posts, this is a bit tricky.

      This feature was added only for those publications which do not have full content added in the response which Medium returns for the initial API call, however, this is not the case for the publication you are using, so, I recommend you uncheck the ‘Try To Get Full Article Content’ checkbox.

      Also, to remove the unwanted part of the page, I recommend you also add in rule settings, the settings from below:

      Run Regex On Content:
      <a([^>]*?)>([^<]*?)<\/a> was originally published in <a href=”https:\/\/blog\.ssv\.network\/”>ssv\.network<\/a>([\s\S]*)

       

      I hope this info helps.

      Regards, Szabi – CodeRevolution.

    • #5861


      gerog
      Participant
      Post count: 3

      Thank you Szabi!

      Everything works! Best support ever! 😀

      I changed regex a little bit and attached screenshot below for those who will met the same issue.

      <hr>

      <a([^>]*?)>([^<]*?)<\/a> was originally published in ([\s\S]*)

       

       

       

      Attachments:
      You must be logged in to view attached files.
    • #5863


      Szabi – CodeRevolution
      Keymaster
      Post count: 4195

      Glad to help! 🙂

Viewing 3 reply threads

The topic ‘Mediumomatic: Try To Get Full Article Content option skips content’ is closed to new replies.