scholarly-python-package / scholarly

Retrieve author and publication information from Google Scholar in a friendly, Pythonic way without having to worry about CAPTCHAs!
https://scholarly.readthedocs.io/
The Unlicense
1.43k stars 307 forks source link

get publication's co-author names and published venue #264

Closed Misty-2409 closed 3 years ago

Misty-2409 commented 3 years ago

using the proxy generator, I have successfully found all the publication titles and their number of citations. But I cannot generate the name of co-authors and venue in addition to the publication's title. To sum up, I need all the publication names, their corresponding venue, co-author names and number of citation. something like:

publications: {title: x author: [a,b,c] venue: q noc:12 }

How to do that? Here is a sample of my code.

`from scholarly import scholarly, ProxyGenerator

pg = ProxyGenerator() pg.Tor_Internal(tor_cmd = "tor") scholarly.use_proxy(pg)

search_query = scholarly.search_author('Seemanta Saha') author_fill = next(search_query) author = scholarly.fill(author_fill)

print(author)

title = [pub['bib']['title'] for pub in author['publications']] number_of_citations = [pub['num_citations'] for pub in author['publications']]`

ipeirotis commented 3 years ago

You need to also use the scholarly.fill command for each publication. Note that you can query for authors without a problem, but once you start filling publications, you will need a proxy service.

Misty-2409 commented 3 years ago

using scholarly.fill command for each publication, I can capture the co-author list, published year etc. But I cannot capture the published venue for conference. I mean to say I can get the name of the journal but cannot get the name of the conference. The only thing that I get is the name of the publisher (IEEE) for a conference. I tried to use a loop for searching each of the publications through search_pubs(). But I cannot get result more than three publications.

Is there any other way?

ipeirotis commented 3 years ago

You will need to use a proxy, most likely.