clearlydefined / crawler

A service that crawls projects and packages for information relevant to ClearlyDefined
MIT License
48 stars 33 forks source link

Source location not properly harvested #446

Open netzdoktor opened 2 years ago

netzdoktor commented 2 years ago

The other day, I triggered a Harvest of a crate: https://clearlydefined.io/definitions/crate/cratesio/-/jsonschema/0.13.2

This happened with other crates as well: For some reason the harvester cannot properly fill the "Source Location" field, leading to a low score. However, crates.io has a clear link to the repository that should just be transferable.

Can someone confirm this/help debugging this?

qtomlinson commented 2 years ago

The reason that the github source location was not auto detected is that the ref/tag for the release was not recognized by clearlydefined. Typically, generic tag patterns like refs/tags/v${version} can be auto detected. For this particular case, the tag for the release is refs/tags/rust-v0.13.2 (see https://github.com/Stranger6667/jsonschema-rs/tags). Failing to recognize this release tag results in not able to find the commit hash for the release, and thus the source location not identified.