fasten-project / fasten

Analyse package dependency networks at the call graph level
https://www.fasten-project.eu
Apache License 2.0
91 stars 28 forks source link

Update Metadata DB schema for ingested artifacts #396

Closed proksch closed 2 years ago

proksch commented 2 years ago

This PR proposes to change the ingestion table to only contain a generic key that can be checked for existence. This allows for a more generic handling of the duplicate detection and can be used as the respective plugin sees fit. For example, Java's PomAnalyzer will use keys like gid:aid:packaging:version-lane, but C/Python might want to use different keys, e.g., if they do not use the lane concept. We could also think about using this key for additional duplication checks later in the pipeline, e.g., by making the pipeline stage part of it.

The adopted schema file can be found in the docker-compose repository.