issues
search
karussell
/
snacktory
Readability clone in Java
461
stars
159
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Bump junit from 4.11 to 4.13.1
#67
dependabot[bot]
opened
4 years ago
0
Added notes about migrating to Crux
#66
chimbori
closed
7 years ago
1
Bad parsing of article from `nytimes`
#65
Hronom
opened
7 years ago
0
Bad parsing of article from `cnbc`
#64
Hronom
opened
7 years ago
0
Extract text, title, etc from url without fetch url (avoid its downloading)
#63
adelaidaram
opened
7 years ago
0
Extract images even if they all have weights exactly zero
#62
todvora
closed
7 years ago
2
Update README.md - add jitpack.io build link
#61
todvora
closed
7 years ago
1
Fix Extraction Issues
#60
abhishekvm
closed
7 years ago
0
Not able to extract content
#59
saketmalpure
opened
7 years ago
1
Please don't cause referrer spam
#58
Flameeyes
closed
7 years ago
3
Changes to handle multiple contiguous article elements
#57
dmorgan-github
closed
7 years ago
0
Converter.detectCharset throws for inputs longer than 2048
#56
LBoraz
opened
7 years ago
0
Crux, an Android-optimized fork of Snacktory, with many issues fixed
#55
chimbori
opened
7 years ago
7
Stack overflow ...
#54
alanlit
opened
8 years ago
0
Make it possible to Increase maxBytes in HtmlFetcher
#53
falmanna
opened
8 years ago
0
dependency via sbt
#52
sebastian-alfers
opened
8 years ago
0
Not working
#51
sahilshekhawat
opened
8 years ago
0
wrong imageUrl in youtube url's
#50
mufumbo
opened
8 years ago
2
NoClassDefFoundError: Could not initialize class de.jetwick.snacktory.HtmlFetcher
#49
danielabar
opened
8 years ago
4
Updated test cases
#48
nzv8fan
closed
8 years ago
2
Fix Issue #42 to improve content identification
#47
nzv8fan
closed
8 years ago
1
String text ignores paragraphs, isn't there a way to get the text in html
#46
Mohamed164
opened
9 years ago
0
Using values og:title and twitter:title before value of <title>
#45
kovcic
closed
9 years ago
4
Fix issue #10 allow users to set a proxy
#44
kinow
closed
9 years ago
7
Allow users to set a proxy
#43
kinow
closed
9 years ago
5
Many websites only extract partial content
#42
rubdottocom
opened
9 years ago
7
Update README.md
#41
jonathansantilli
closed
10 years ago
0
Misspelling in README file
#40
jonathansantilli
closed
10 years ago
0
Make it possible to extract a list of texts
#39
hnrc
closed
10 years ago
4
Unsupported Popular Internet Landmarks
#38
OnlyInAmerica
opened
10 years ago
1
Fetch content from Twitter URLs?
#37
rubdottocom
opened
10 years ago
4
Snacktory on Android? java.beans.Introspector
#36
rubdottocom
closed
10 years ago
6
Constructor jsoup doc
#35
bejean
closed
10 years ago
3
Text content is removed when there is an image in news webpage.
#34
avi20072008
opened
11 years ago
1
determineImageSource and list of images in bestMatchElement
#33
bejean
closed
11 years ago
4
determineImageSource for width or height = 50
#32
bejean
closed
11 years ago
1
h* elements are removed from returned text
#31
bejean
closed
11 years ago
0
Preserve paragraphs?
#30
ondrejmirtes
closed
10 years ago
4
Can't split getText() into paragraphs
#29
liusiqi43
closed
11 years ago
2
Remove noscript elements
#28
dajac
closed
11 years ago
1
Ensure use of slf4j over log4j in Converter class
#27
tjerkw
closed
11 years ago
3
jsoup 1.7.1
#26
dajac
closed
12 years ago
3
Twitter Cards
#25
dajac
closed
12 years ago
1
Provide optional extraction directives
#24
bejean
opened
12 years ago
3
determineImageSource for images without width and height attributes including tests
#23
bejean
closed
12 years ago
15
determineImageSource for images without width and height attributes
#22
bejean
closed
12 years ago
6
determineImageSource for images without width and height attributes
#21
bejean
closed
11 years ago
5
hello
#20
bejean
closed
12 years ago
0
Build fail due to test failed
#19
bejean
closed
12 years ago
5
fix exception in keyword extraction
#18
kireet
closed
12 years ago
1
Next