chnm / serendipomatic

http://serendipomatic.org/
26 stars 9 forks source link

Serendipity machine results filtering #3

Open mialondon opened 11 years ago

mialondon commented 11 years ago

Tasks related to improving the filtering of results back from the API queries

mialondon commented 11 years ago

See also https://github.com/chnm/serendipomatic/issues/66 https://github.com/chnm/serendipomatic/issues/117 for related work

An example of de-duping for parent item/parts of the whole item. I tried lyrics for 'There is a light that never goes out' (long story) and got lots of pages from this one item in the results - http://www.europeana1914-1918.eu/en/contributions/3369 but it'd be better just to get the parent item instead of lots of the part/child items (e.g. http://www.europeana.eu/portal/record/2020601/D75A016FA1F513107E98B520F9F9300DF0120357.html?utm_source=api&utm_medium=api&utm_campaign=5QpzDWzoy). We might be able to filter on part/whole in the query, or get the parent ID from the part/child record.