Feature request: progress indicator when deep paging

sckott / habanero

client for Crossref search API

https://habanero.readthedocs.io

MIT License

207 stars 30 forks source link

Feature request: progress indicator when deep paging #77

Closed gorbynet closed 5 years ago

gorbynet commented 6 years ago

When systematically retrieving large data sets, it would be useful to have some way of measuring progress through the data harvest, e.g. show how many records have been retrieved, and how many there are in total, while the retrieval is ongoing.

sckott commented 6 years ago

thanks @gorbynet

I assume you mean with deep paging? looking into it, not done progress bars before in python

sckott commented 6 years ago

maybe https://github.com/weecology/retriever/blob/0112008b710d176fc543be174bcd1205cb1fef1e/retriever/lib/engine.py#L455 using https://pypi.org/project/tqdm/

sckott commented 6 years ago

clint is another option

gorbynet commented 6 years ago

Scott, thanks for picking this up. I'm not sure how it would work (I'm quite inexperienced at Python) but what I meant was that it would be useful to have some way of showing progress when deep paging through a large dataset. That might be a progress bar, or some way of the crossref module feeding back to the calling script so that the script can choose how to reflect that information.

sckott commented 6 years ago

thanks, i'll experiment and ask for your feedback