neon-mmd / websurfx

:rocket: An open source alternative to searx which provides a modern-looking :sparkles:, lightning-fast :zap:, privacy respecting :disguised_face:, secure :lock: meta search engine
GNU Affero General Public License v3.0
700 stars 91 forks source link

✨ Spam filtering of the aggregated search results #392

Open neon-mmd opened 9 months ago

neon-mmd commented 9 months ago

Description

Implement an algorithm to filter out spam results from the aggregated search results by analyzing the title, url and description of results and checking whether they are related to the search query or are completely unrelated (in other words if they are spam results). The algorithm should be implemented in the aggregator.rs file located in the src/results/ folder under the codebase (websurfx directory) where the algorithm to aggregate search results lie.

Screenshots

No response

Do you want to work on this issue?

None

Additional information

No response

github-actions[bot] commented 9 months ago

The issue has been unlocked and is now ready for dev. If you would like to work on this issue, you can comment to have it assigned to you. You can learn more in our contributing guide https://github.com/neon-mmd/websurfx/blob/rolling/CONTRIBUTING.md

lxrst commented 9 months ago

I'd be interested in working on this

alamin655 commented 9 months ago

Thank you for your willingness to contribute. 😊 I will assign you to the issue. If you have any questions or need further information, please don't hesitate to ask. ❤️


We have a Discord server; feel free to join and share your ideas and ask questions about the project. We would be glad to hear from you.

neon-mmd commented 7 months ago

@lxrst it has been a month so far, any progress on this issue so far? We would like to know :slightly_smiling_face: .

github-actions[bot] commented 5 months ago

Stale issue message

github-actions[bot] commented 2 months ago

Stale issue message