maastrichtlawtech / extraction_libraries

Python libraries for extracting from data sources like Rechtspraak, ECHR, Cellar
Apache License 2.0
10 stars 1 forks source link

ECHR data collection query not getting all data? #2

Closed Cloud956 closed 1 year ago

Cloud956 commented 1 year ago

Example situation - all cases from 01/01/2022 to 02/02/2023 in English.

Hudoc search - 4016 results Hudoc query : link%20AND%20(languageisocode:"ENG")%20AND%20(kpdate>="2022-01-01T00:00:00.0Z"%20AND%20kpdate<="2023-02-02T00:00:00.0Z")&select=sharepointid,Rank,ECHRRanking,languagenumber,itemid,docname,doctype,application,appno,conclusion,importance,originatingbody,typedescription,kpdate,kpdateAsText,documentcollectionid,documentcollectionid2,languageisocode,extractedappno,isplaceholder,doctypebranch,respondent,advopidentifier,advopstatus,ecli,appnoparts,sclappnos&sort=&start=0&length=1&rankingModelId=11111111-0000-0000-0000-000000000000)

Our search - 3559 results Our query : link

Difference?

Our case query contains this: (documentcollectionid2:"JUDGMENTS" OR '\ 'documentcollectionid2:"COMMUNICATEDCASES" OR '\ 'documentcollectionid2:"DECISIONS" OR '\ 'documentcollectionid2:"CLIN")

Hudoc search does not contain this. Is this the intended behaviour?

shashankmc commented 1 year ago

@Cloud956 is this issue resolved? If yes, can we close this issue?

Cloud956 commented 1 year ago

Yes, I will close this issue.