dvmorozov / arxiv

ArxivExpress - arxiv.org client for Android and iOS, ArxivNavigator - interactive arxiv.org metadata visualization. I would appreciate any way of contributing: GitHub issue, email or pull request.
https://dvmorozov.github.io/arxiv/
Other
0 stars 0 forks source link

Graph displaying references hierarchy between articles #92

Open dvmorozov opened 1 year ago

dvmorozov commented 1 year ago

Task

  1. Help find most influential articles in the topic.
  2. Display most influential article in the topic node. ❓

Solution

  1. Implement Python script to compute cumulative number of references through other articles. ✔️
  2. Implement Python script to get list of most influential authorities.
  3. Modify Python script from arxiv-public-datasets to add last updating date.
  4. Implement graph page and JavaScripts to load and visualize data. Use Circle Packing graph.
  5. Add hyperlink opening graph in separate browser tab.

Data

https://www.kaggle.com/datasets/Cornell-University/arxiv https://github.com/mattbierbaum/arxiv-public-datasets/tree/821d543e9ed9d04426e8a76d471cd94ed92ff994

References

https://pypi.org/project/anytree/https://www.delftstack.com/howto/python/trees-in-python/

https://pypi.org/project/treelib/ https://treelib.readthedocs.io/en/latest/

Related

74 #107

dvmorozov commented 1 year ago

Script from arxiv-public-datasets requires python 3.7. Use
sudo docker pull python:3.7.14-bullseye sudo docker run -it -v arxiv-public-datasets-directory:/root python:3.7.14-bullseye /bin/bash