trappedinspacetime / wikiteam

Automatically exported from code.google.com/p/wikiteam
0 stars 0 forks source link

uploader.py should check for duplicates with originalurl #95

Open GoogleCodeExporter opened 8 years ago

GoogleCodeExporter commented 8 years ago
And upload in the existing item. This would also allow changing the identifier 
format in the future without cluttering the collection with multiple new 
identifiers. However, we'd probably first need to be able to fix incorrect 
metadata.
Old examples:
http://archive.org/details/wiki-enecgpediaorg
http://archive.org/details/wiki-en.ecgpedia.org

Original issue reported on code.google.com by nemow...@gmail.com on 2 Feb 2014 at 5:56

GoogleCodeExporter commented 8 years ago
This *may* be helped by issue 54 because the internetarchive library also has a 
search function, but it's useless if that metadata is not indexed.

Original comment by nemow...@gmail.com on 26 Feb 2014 at 11:32