lil-lab / recnet

A human-driven recommendation system for academic readings.
https://recnet.io
MIT License
4 stars 1 forks source link

arXiv links normalization and HTML detection #298

Closed yoavartzi closed 1 month ago

yoavartzi commented 1 month ago

PDF links should normalize to abs links:

image

HTML links are not detected:

image

HTML links should also be normalized to abs links

joannechen1223 commented 1 month ago

Updated the regex to https?:\/\/arxiv\.org\/(abs|pdf|html)\/(?<id>[0-9.]+(v[0-9]+)?) to address html case.

yoavartzi commented 1 month ago

Need to fix on frontend to take the link @swh00tw