issues
search
dkpro
/
dkpro-c4corpus
DKPro C4CorpusTools is a collection of tools for processing CommonCrawl corpus, including Creative Commons license detection, boilerplate removal, language detection, and near-duplicate removal.
https://dkpro.github.io/dkpro-c4corpus
Apache License 2.0
50
stars
8
forks
source link
Avoid deploying shaded JAR for hadoop module to repo/Maven central
#39
Closed
reckart
closed
8 years ago
reckart
commented
8 years ago
Avoid deploying shaded JAR for hadoop module to repo/Maven central
Avoid deploying shaded JAR for hadoop module to repo/Maven central