pybliometrics-dev / pybliometrics

Python-based API-Wrapper to access Scopus
https://pybliometrics.readthedocs.io/en/stable/
Other
414 stars 129 forks source link

#338 Unescape unicode #341

Closed nils-herrmann closed 3 months ago

nils-herrmann commented 3 months ago

Unescape unicode in following fields of results: affilname, afid, affiliation-city, affiliation-country. Now thinking about it, should we just unescape affilname @Michael-E-Rose ?

Michael-E-Rose commented 3 months ago

Yes, let's unescape affilname as well.

Could you also format the function https://github.com/pybliometrics-dev/pybliometrics/blob/master/pybliometrics/scopus/utils/parse_content.py#L95 please? Format correctly and write a docstring

nils-herrmann commented 3 months ago

And what about the title and keywords like here. Do we also want to unescape?

Michael-E-Rose commented 3 months ago

Yes, let's also apply this to title, abstract, keywords and index terms as well. All text

Michael-E-Rose commented 3 months ago

Actually, we should also apply to this to the ScopusSearch() class, fields "description", "authkeywords" and "title".