chembl / chembl_webservices_2

Source code of the ChEMBL web services.
https://www.ebi.ac.uk/chembl/ws
Other
16 stars 3 forks source link

Consider increasing max_limit in paginator #175

Open hammer opened 5 years ago

hammer commented 5 years ago

How was the value 1000 chosen for https://github.com/chembl/chembl_webservices_2/blob/master/chembl_webservices/core/pagination.py#L24? There does not seem to have been much discussion around this decision: https://github.com/chembl/chembl_webservices_2/issues/15.

I am trying to prepare data for the Open Targets Platform data pipeline using the code in https://github.com/opentargets/platform-input-support. By far the slowest part of this script is the retrieval of data from ChEMBL through the REST interface, in particular from the molecule endpoint. If we could retrieve data with larger page size this script would run much faster.

Alternatively, does the ChEMBL team make individual tables available for download? I've found the ChEMBLdb download but it's over a GB so it takes quite some time to download.