KnowledgeCaptureAndDiscovery / somef

SOftware Metadata Extraction Framework: A tool for automatically extracting relevant software information from readme files
MIT License
44 stars 22 forks source link

error in doi recognition #553

Closed Ardark27 closed 10 months ago

Ardark27 commented 1 year ago

error in doi recognition:

repo: https://github.com/ika-rwth-aachen/DEviLOG

cite:

@misc{https://doi.org/10.48550/arxiv.2211.08278,
  doi = {10.48550/ARXIV.2211.08278},  
  url = {https://arxiv.org/abs/2211.08278},
  author = {van Kempen, Raphael and Lampe, Bastian and Reiher, Lennart and Woopen, Timo and Beemelmanns, Till and Eckstein, Lutz},
  title = {Data-Driven Occupancy Grid Mapping using Synthetic and Real-World Data},
  year = {2022},
  copyright = {Creative Commons Attribution Non Commercial Share Alike 4.0 International}
}

result:

"citation": [
        {
            "result": {
                "value": "@misc{https://doi.org/10.48550/arxiv.2211.08278,\n  doi = {10.48550/ARXIV.2211.08278},  \n  url = {https://arxiv.org/abs/2211.08278},\n  author = {van Kempen, Raphael and Lampe, Bastian and Reiher, Lennart and Woopen, Timo and Beemelmanns, Till and Eckstein, Lutz},\n  title = {Data-Driven Occupancy Grid Mapping using Synthetic and Real-World Data},\n  year = {2022},\n  copyright = {Creative Commons Attribution Non Commercial Share Alike 4.0 International}\n}",
                "type": "Text_excerpt",
                "format": "bibtex",
                "doi": "https://doi.org/10.48550/arxiv.2211.08278,\n  doi = {10.48550/ARXIV.2211.08278"
            },
            "confidence": 1,
            "technique": "regular_expression",
            "source": "https://raw.githubusercontent.com/ika-rwth-aachen/DEviLOG/main/README.md"
        }
    ]

doi result should be https://doi.org/10.48550/arxiv.2211.08278 or 10.48550/ARXIV.2211.08278

dgarijo commented 1 year ago

Looks like the regular expression is not working correctly. Will have a look