commonsearch / cosr-back

Backend of Common Search. Analyses webpages and sends them to the index.
https://about.commonsearch.org
Apache License 2.0
122 stars 24 forks source link

Add first document-level quality signals #28

Open sylvinus opened 8 years ago

sylvinus commented 8 years ago

We will need to have a model that evaluates many features from documents and gives us a document quality score.

Before doing any machine learning, it would be great to explore the first few features/signals we could include.

A first list of ideas, please add your own!

sylvinus commented 8 years ago

34 may give us other signal ideas

IvRRimum commented 8 years ago

404 pages ?

indolering commented 7 years ago