ropensci / openalexR

Getting bibliographic records from OpenAlex
https://docs.ropensci.org/openalexR/
Other
91 stars 20 forks source link

Input to biblioshiny #123

Open trangdata opened 1 year ago

trangdata commented 1 year ago

@rkrug reported this issue to me via Slack. @massimoaria could you take a look plz? 🌻

Rainer is trying to load the output from oa2bibliometrix to an application called biblioshiny, but it seems like there are a few columns missing. I believe we can fill in to increase the completeness of the data, for example, change the column name doi to DI (but I'm unsure if we need to make it all uppercase). I'm not that familiar with biblioshiny but maybe you can help @massimoaria. Other columns we can try to fill: DE, LA, NR, WC, which are keywords, language, number of cited references, science categories, respectively.

library(openalexR)
#> Thank you for using openalexR!
#> To acknowledge our work, please cite the package by calling
#> `citation("openalexR")`.
flat_snow <- snowball2df(oa_snowball(
  identifier = "W1516819724",
  verbose = TRUE
))
#> Requesting url: https://api.openalex.org/works/W1516819724
#> Collecting all documents citing the target papers...
#> Requesting url: https://api.openalex.org/works?filter=cites%3AW1516819724
#> Getting 1 page of results with a total of 5 records...
#> Collecting all documents cited by the target papers...
#> Requesting url: https://api.openalex.org/works?filter=cited_by%3AW1516819724
#> Getting 1 page of results with a total of 3 records...
to_biblio <- oa2bibliometrix(flat_snow)

sort(names(to_biblio))
#>  [1] "AB"                "AU"                "AU_CO"            
#>  [4] "AU_UN"             "author"            "backward_count"   
#>  [7] "C1"                "cited_by"          "cited_by_api_url" 
#> [10] "citing"            "concepts"          "connection"       
#> [13] "connection_count"  "counts_by_year"    "CR"               
#> [16] "DB"                "doi"               "DT"               
#> [19] "first_page"        "forward_count"     "host_organization"
#> [22] "ID"                "id_oa"             "id_url"           
#> [25] "ids"               "is_oa"             "is_paratext"      
#> [28] "is_retracted"      "issn_l"            "issue"            
#> [31] "J9"                "JI"                "last_page"        
#> [34] "license"           "oa_input"          "pdf_url"          
#> [37] "publication_date"  "PY"                "referenced_works" 
#> [40] "related_works"     "RP"                "SO"               
#> [43] "so_id"             "SR"                "SR_FULL"          
#> [46] "TC"                "TI"                "url"              
#> [49] "version"           "volume"

Created on 2023-07-15 with reprex v2.0.2

biblioshiny

trangdata commented 1 year ago

CC @yjunechoe as well in case you have some insight into this format.

massimoaria commented 1 year ago

Please take a look at this https://github.com/massimoaria/bibliometrix/issues/380