cjlee112 / spnet

selected papers network web engine
http://thinking.bioinformatics.ucla.edu/2011/07/02/open-peer-review-by-a-selected-papers-network/
GNU General Public License v2.0
40 stars 11 forks source link

Versioned arXiv IDs create duplicate Paper records #93

Closed cjlee112 closed 11 years ago

cjlee112 commented 11 years ago

First noted for 1108.1172 due to indexing a post that included both versioned and unversioned forms of this arXiv ID.

Implemented a test for this bug as test_incoming.test_arxiv_versions().

kim-em commented 11 years ago

Is there a long term plan for merging identifiers? E.g. suppose I cite something by a DOI, and we (perhaps later) learn that that article has also been mentioned on #spnetwork via an arXiv identifier. What's the process?

cjlee112 commented 11 years ago

@semorrison I wrote about how to handle merging of IDs from different domains (e.g. arXiv vs. DOI) here: https://github.com/cjlee112/spnet/wiki/Simple-Feature-Tutorial#deeper-data-model-considerations

The bottom line is it's straightforward to do, and we should do it. Since we're in "team building mode", people told me I should write tutorials showing how to do things like this, rather than just silently doing it... One way or the other, we should implement this soon.

cjlee112 commented 11 years ago

a related duplication issue: math.HO/9404236, see test_incoming.test_arxiv_versions2()