anarcha Is there an option to get the original feed content as it is? I think we've talked about this before, I remember it was something planned.
Not yet, sorry, but still planned
I finally found the problem, seems like a bug with full-text-rss, by default pruning mode is enabled to sanitize the extracted content.
I added 'pruning: no' to the config, and now the full content is extracted:
https://direct.infolibertaire.net/rss_aggregator/full-text-rss/makefulltextfeed.php?force=1&url=https%3A%2F%2Flib.anarcho-copy.org%2Ffeed&max=5&links=preserve&submit=Create+Feed
Sometimes enabling debug mode (&debug=1) in FTR helps to understand what's happening behind the scene:
https://direct.infolibertaire.net/rss_aggregator/full-text-rss/makefulltextfeed.php?force=1&url=https%3A%2F%2Flib.anarcho-copy.org%2Ffeed&max=5&links=preserve&submit=Create+Feed&debug=1
Now I see in the backend, the missing articles are correctly being scraped:
https://www.anarsisthaberler.net/rss_aggregator/get_rss.php?force=1&url=https%3A%2F%2Flib.anarcho-copy.org%2Ffeed
anarcha If we take into account the situation stated in the above item; considering that Anarcho-Copy is an archive site, able to frequently share on the same days, and that the contents only point to one file, I do not see any problem with removing the source from the list.
It's up to you if you want to completely remove this source or not. Too many articles could flood the site and give less visibility to other articles with more content. That was a problem on some other sites too, so this is a valid point.
One solution that we are considering is to add a separate "library" section, but since there are already well-established websites (like AnarchistLibraries) doing an excellent job at it, there is no point in scraping the content. However, we could just index the list of books and integrate it into the platform (with external links)
anarcha I see two posts[1] [2] from Anarcho-Copy posted here what is causing this issue? (Even though I ran the option "Delete and re-import this post", this situation did not improve.)
Not sure about this, maybe I did an error when I was trying to fix the feed. Let's delete the article and see if it gets re-imported by the cron job.
anarcha Thank you, can you email the FTP instructions?
paging @Anarcho_Tech_Team here. Please create an FTP account for our friend to edit the FTR config files. I think it would be helpful to also give access to the RSS aggregator's sources list config files, so they can edit the feed as they seem comfortable with PHP stuff.