Beit-Hatfutsot / dbs-back

The MoJP RESTfull API server
GNU Affero General Public License v3.0
1 stars 4 forks source link

[task] research a big gap in the number of results between "Clearmash" & "BHP data as available in dev Elasticsearch #193

Open Netush opened 7 years ago

Netush commented 7 years ago

Need to understand why there is a big gap in the number of results between "Clearmash" & "BHP. At movies collection/Type - In BHP the total results is 116 & in Clearmash is 1192 At personalities collection/Type - In BHP the total results is 3312 & in Clearmash is 7373 At places collection/Type - In BHP the total results is 3405 & in Clearmash is 6245 & at Photo units collection/Type - In BHP the total results is 45795 & in clearmash is 60002

OriHoch commented 7 years ago

thanks, I understand this is all in the Elasticsearch data

Will be interesting to compare with the source Clearmash data and the source BHP data and compare those (e.g. log-in to clearmash or BHP and see what you see there)

related issue: Beit-Hatfutsot/mojp-dbs-pipelines#17

OriHoch commented 7 years ago

in latest pipelines version we only sync items allowed to show

now it looks like numbers are better, not exactly the same, but less different..

I guess there will be difference between clearmash data is updated while BHP data is not