clearlydefined / crawler

A service that crawls projects and packages for information relevant to ClearlyDefined
MIT License
43 stars 30 forks source link

Exclude .git directory content when calculating package file count #525

Closed qtomlinson closed 2 months ago

qtomlinson commented 7 months ago

This was discovered during sanity test for upgrading crawler image to use node:18-bullseye. The file count for pod/cocoapods/-/SoftButton/0.1.0 was different:

The content of .git directory should be excluded for the package file count.

Task: https://github.com/clearlydefined/crawler/issues/529

qtomlinson commented 7 months ago

@mpcen ready for review

qtomlinson commented 6 months ago

@jeffwilcox @mpcen rebased and ready for review.

qtomlinson commented 6 months ago

@elrayle ready for review

qtomlinson commented 6 months ago

The existing code only exclude .git directory itself, but not its containing files. This fix intends to exclude .git directory and its contents.

qtomlinson commented 3 months ago

@lumaxis @elrayle ready for review