The rules for bots # Bots should be clearly prompted by a command # Bots that always post without human intervention are noisy and are often unwanted.
Bots should not act in a community without mods from that community being contacted first # Moderators should not have to chase down the bots being used in their community, it should be opt-in. Just ask, be nice.
Bots should minimize the space they take with their messages # A bot’s response should be a small as possible to avoid taking space that could be used by people.
I’m just trying to understand, generally speaking, how the bot works. It appears to me that the bot is looking for posted articles that exceed a certain word count threshold. If it finds these, then it creates a summary and posts this as a comment. Am I understanding this correctly?
It has support for specific news sites, I don’t want to rely on some automatic text extraction because those are prone to breaking. Here are the content extractors themselves, each for one site. If a post that contains a link to any of the supported sites is found across all of Lemmy (that the bot can see), it extracts the text and then summarizes it using this. It takes 6 sentences directly from the article that look most important to the machine learning model it uses. Then it posts it as a comment.