inspirehep / inspire-next

The INSPIRE repo.
https://inspirehep.net
GNU General Public License v3.0
59 stars 69 forks source link

Matcher: Don't match #3496

Open ksachs opened 6 years ago

ksachs commented 6 years ago

Never propose a match for

ksachs commented 6 years ago

See "same doi, different arxiv ids" and "missing arXiv / legacy" in general on Zulip

michamos commented 6 years ago

See "same doi, different arxiv ids" and "missing arXiv / legacy" in general on Zulip

i.e. https://inspirehep.zulipchat.com/#narrow/stream/122196-general/subject/same.20doi.2C.20different.20arxiv.20ids/near/128309884 and https://inspirehep.zulipchat.com/#narrow/stream/122199-ops/subject/missing.20arXiv.20.2F.20legacy/near/128310108 (links can be found by clicking on the small dropdown arrow that appears when you hover on a message).

ksachs commented 6 years ago

(links can be found by clicking on the small dropdown arrow that appears when you hover on a message).

Thanks, but I still don't see the link. Need another video tutorial

ksachs commented 6 years ago

I'm not sure about this anymore:

In the long term the will be no CNUMs in the incoming feeds. We have this only while going via the DESY spider. So it might not be worth doing something.

Matching of different arXiv IDs: Yes, they are mostly wrong. But today we had a new record 'Accepted by MNRAS'. Matched to a not-that-old arXiv record with a MNRAS pubnote. That's a potential wrong merge to the older arXiv. Maybe a nice way to spot such mistakes.

ksachs commented 6 years ago

Match literature only to non-deleted HEP records.

https://labs.inspirehep.net/holdingpen/1179088 is matched to hidden collection (Fermilab) which are slides.