jannisborn / paperscraper

Tools to scrape publication metadata from pubmed, arxiv, medrxiv and chemrxiv.
MIT License
263 stars 31 forks source link

Adding a time range functionality to extract records from chemrxiv #25

Closed achouhan93 closed 1 year ago

achouhan93 commented 1 year ago

Hi @jannisborn, During the bulk extraction of the chemrxiv articles using chemrxiv functions. The current code used was not using a time range functionality for extraction. With this pull request, a time range functionality is added in the chemrxiv with the begin_date and end_date optional parameters. So if the user wants to extract articles for a specific time frame, then with these changes, they can extract the articles for the specific time frame. Thus, every time the function is executed, if the user specifies the begin_date and end_date, it will extract for a specific time frame; otherwise, it will take launch_date and today_date as the begin and end date. Thus, functionality is added to provide a begin_date and end_date parameter to the chemrxiv scripts for article extraction. Please consider this pull request.

jannisborn commented 1 year ago

@achouhan93 I incorporated those changes plus an example in the README (see #2c54db9)

If you agree, we can merge!

achouhan93 commented 1 year ago

Hi @jannisborn, Thank you so much for taking care of formatting the code using black and adding the logging part. I reviewed the changes, and it looks good to me. I agree with the changes, and we can merge the request.

jannisborn commented 1 year ago

Hey @achouhan93 I released a new version, it should be on pypi soon!

achouhan93 commented 1 year ago

Hello @jannisborn, I can see the new version on pypi😊👍. Thank you so much for accepting the changes.