karussell snacktory issues

karussell / snacktory

Readability clone in Java

461 stars 159 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

Bump junit from 4.11 to 4.13.1

#67 dependabot[bot] opened 4 years ago
0
Added notes about migrating to Crux

#66 chimbori closed 7 years ago
1
Bad parsing of article from `nytimes`

#65 Hronom opened 7 years ago
0
Bad parsing of article from `cnbc`

#64 Hronom opened 7 years ago
0
Extract text, title, etc from url without fetch url (avoid its downloading)

#63 adelaidaram opened 7 years ago
0
Extract images even if they all have weights exactly zero

#62 todvora closed 7 years ago
2
Update README.md - add jitpack.io build link

#61 todvora closed 7 years ago
1
Fix Extraction Issues

#60 abhishekvm closed 7 years ago
0
Not able to extract content

#59 saketmalpure opened 7 years ago
1
Please don't cause referrer spam

#58 Flameeyes closed 7 years ago
3
Changes to handle multiple contiguous article elements

#57 dmorgan-github closed 7 years ago
0
Converter.detectCharset throws for inputs longer than 2048

#56 LBoraz opened 7 years ago
0
Crux, an Android-optimized fork of Snacktory, with many issues fixed

#55 chimbori opened 7 years ago
7
Stack overflow ...

#54 alanlit opened 8 years ago
0
Make it possible to Increase maxBytes in HtmlFetcher

#53 falmanna opened 8 years ago
0
dependency via sbt

#52 sebastian-alfers opened 8 years ago
0
Not working

#51 sahilshekhawat opened 8 years ago
0
wrong imageUrl in youtube url's

#50 mufumbo opened 8 years ago
2
NoClassDefFoundError: Could not initialize class de.jetwick.snacktory.HtmlFetcher

#49 danielabar opened 8 years ago
4
Updated test cases

#48 nzv8fan closed 8 years ago
2
Fix Issue #42 to improve content identification

#47 nzv8fan closed 8 years ago
1
String text ignores paragraphs, isn't there a way to get the text in html

#46 Mohamed164 opened 9 years ago
0
Using values og:title and twitter:title before value of <title>

#45 kovcic closed 9 years ago
4
Fix issue #10 allow users to set a proxy

#44 kinow closed 9 years ago
7
Allow users to set a proxy

#43 kinow closed 9 years ago
5
Many websites only extract partial content

#42 rubdottocom opened 9 years ago
7
Update README.md

#41 jonathansantilli closed 10 years ago
0
Misspelling in README file

#40 jonathansantilli closed 10 years ago
0
Make it possible to extract a list of texts

#39 hnrc closed 10 years ago
4
Unsupported Popular Internet Landmarks

#38 OnlyInAmerica opened 10 years ago
1
Fetch content from Twitter URLs?

#37 rubdottocom opened 10 years ago
4
Snacktory on Android? java.beans.Introspector

#36 rubdottocom closed 10 years ago
6
Constructor jsoup doc

#35 bejean closed 10 years ago
3
Text content is removed when there is an image in news webpage.

#34 avi20072008 opened 11 years ago
1
determineImageSource and list of images in bestMatchElement

#33 bejean closed 11 years ago
4
determineImageSource for width or height = 50

#32 bejean closed 11 years ago
1
h* elements are removed from returned text

#31 bejean closed 11 years ago
0
Preserve paragraphs?

#30 ondrejmirtes closed 10 years ago
4
Can't split getText() into paragraphs

#29 liusiqi43 closed 11 years ago
2
Remove noscript elements

#28 dajac closed 11 years ago
1
Ensure use of slf4j over log4j in Converter class

#27 tjerkw closed 11 years ago
3
jsoup 1.7.1

#26 dajac closed 12 years ago
3
Twitter Cards

#25 dajac closed 12 years ago
1
Provide optional extraction directives

#24 bejean opened 12 years ago
3
determineImageSource for images without width and height attributes including tests

#23 bejean closed 12 years ago
15
determineImageSource for images without width and height attributes

#22 bejean closed 12 years ago
6
determineImageSource for images without width and height attributes

#21 bejean closed 11 years ago
5
hello

#20 bejean closed 12 years ago
0
Build fail due to test failed

#19 bejean closed 12 years ago
5
fix exception in keyword extraction

#18 kireet closed 12 years ago
1