ourresearch / citeas-api

Get the scholarly citation for any research product: software, preprint, paper, or dataset
https://citeas.org
MIT License
69 stars 4 forks source link

Problems with the CSV Export: , and line breaks #72

Open Finesim97 opened 1 year ago

Finesim97 commented 1 year ago

Hi,

First of all thank you providing this API! I wanted to use the CSV Export to also get the abstract.

The CSV export sometimes contains line breaks and additional ,. The JSON parts in this file also contain ,, but are distinguishable by including a whitespace and using a RegEx as the separator. However, sometimes there are exports, which are not parseable even with those hacks.

If you are interested, I could work on the following PRs:

  1. Automatically strip line breaks
  2. Quoting in the CSV
  3. TSV Export while stripping \t

Have a nice day!