issues
search
cdimascio
/
essence
Automatically extract the main text content (and more) from an HTML document
Apache License 2.0
116
stars
16
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Wrong content from some sites
#12
Moosheimer
opened
1 year ago
3
demo page link not working
#11
Grienauer
opened
1 year ago
0
Bump jsoup from 1.11.3 to 1.15.3
#10
dependabot[bot]
opened
2 years ago
0
DocumentScorer.kt stopwords.size > 2 seems to be wrong
#9
zaixiaguozhen
opened
2 years ago
0
Bump kotlin-stdlib from 1.3.0 to 1.6.0
#8
dependabot[bot]
opened
2 years ago
0
Bump jsoup from 1.11.3 to 1.14.2
#7
dependabot[bot]
closed
2 years ago
1
Improve core logic to support non space tokenized languages like japenese
#6
mayankpunetha007
opened
3 years ago
0
Unable to parse text from yahoo html
#5
Gillani0
opened
3 years ago
0
Bump junit from 4.12 to 4.13.1
#4
dependabot[bot]
opened
4 years ago
0
docs: add Cleymax as a contributor
#3
allcontributors[bot]
closed
4 years ago
0
Any changes and improvements
#2
Cleymax
closed
4 years ago
2
Update README.md
#1
neroux
closed
4 years ago
0