tommv / bibliograph

A tool to create and explore bibliometrics graphs
GNU General Public License v3.0
7 stars 1 forks source link

Fishy extraction from ISI Web of Science #29

Closed tommv closed 2 years ago

tommv commented 2 years ago

Using the attached CSV file test.csv, I am said in the filters page that there are "670 References occurring in at least 2 records". Yet when I generate the network, I obtain a GEXF file with only 93 references. The graph export txt also confirm 670. I know that we remove orphans nodes, but I don't think it's right that there are some many of them.

Also the graph export says that the corpus contains 172 notices (adding up by year break-down) or 176 (adding up by type break-down), but the input CSV has 500 lines. Again, I know that there might me duplicates and badly formatted line, but maybe not so many.

Sorry for bothering and thanks for looking into it.