issues
search
son-dh
/
boilerpipe
Automatically exported from code.google.com/p/boilerpipe
0
stars
0
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Program does not terminate for badly formatted/syntactically incorrect HTML input
#67
GoogleCodeExporter
opened
9 years ago
0
[deleted issue]
#66
GoogleCodeExporter
closed
9 years ago
0
BoilerplateBlockFilter ignores labelToKeep
#65
GoogleCodeExporter
opened
9 years ago
0
Never endning loop
#64
GoogleCodeExporter
opened
9 years ago
2
Difference WebApi - Api
#63
GoogleCodeExporter
opened
9 years ago
1
Hotpatched nekohtml classes cause library incompatibilities
#62
GoogleCodeExporter
opened
9 years ago
6
ContentFusion can change the order of document text
#61
GoogleCodeExporter
opened
9 years ago
0
Faulty XML encoding of characters in <script> tags in <head>
#60
GoogleCodeExporter
opened
9 years ago
0
Runtime Error while using boilerpipe in android
#59
GoogleCodeExporter
opened
9 years ago
2
Extract article HTML from given HTML source?
#58
GoogleCodeExporter
opened
9 years ago
1
BoilerPipe for Android
#57
GoogleCodeExporter
opened
9 years ago
9
Output as JSON
#56
GoogleCodeExporter
opened
9 years ago
0
Can not parse NYtimes pages
#55
GoogleCodeExporter
opened
9 years ago
2
Web api codes?
#54
GoogleCodeExporter
opened
9 years ago
0
Incorrect characters in Extractor output
#53
GoogleCodeExporter
opened
9 years ago
4
Please push 1.2 to maven central
#52
GoogleCodeExporter
opened
9 years ago
0
No tag in svn for 1.2?
#51
GoogleCodeExporter
opened
9 years ago
0
StackOverflowError when page includes another <body> part in <noframes>
#50
GoogleCodeExporter
opened
9 years ago
2
Article Image
#49
GoogleCodeExporter
opened
9 years ago
0
hybrid extractor?
#48
GoogleCodeExporter
opened
9 years ago
0
Errors deploying to Android
#47
GoogleCodeExporter
opened
9 years ago
0
Library does not produce same results as http://boilerpipe-web.appspot.com/
#46
GoogleCodeExporter
opened
9 years ago
5
Ignore FORM tags in HTMLHighlighter
#45
GoogleCodeExporter
closed
9 years ago
1
Ignore FORM tags in HTMLHighlighter
#44
GoogleCodeExporter
opened
9 years ago
3
DocumentTitleMatchClassifier should include the « and • characters
#43
GoogleCodeExporter
opened
9 years ago
0
Patch for /trunk/boilerpipe-core/src/main/de/l3s/boilerpipe/filters/heuristics/DocumentTitleMatchClassifier.java
#42
GoogleCodeExporter
closed
9 years ago
1
Title detection: Treat non-breaking space as whitespace
#41
GoogleCodeExporter
closed
9 years ago
6
Patch for /trunk/boilerpipe-core/src/main/de/l3s/boilerpipe/sax/DefaultTagActionMap.java
#40
GoogleCodeExporter
closed
9 years ago
1
Patch for /trunk/boilerpipe-core/src/main/de/l3s/boilerpipe/sax/CommonTagActions.java
#39
GoogleCodeExporter
closed
9 years ago
1
Patch for /trunk/boilerpipe-core/src/main/de/l3s/boilerpipe/sax/BoilerpipeHTMLContentHandler.java
#38
GoogleCodeExporter
closed
9 years ago
2
timeout and fallback strategy for boilerpipe
#37
GoogleCodeExporter
closed
9 years ago
6
ImageExtractor doesn't detect alternative images for Object plugins
#36
GoogleCodeExporter
closed
9 years ago
1
word counting code does not account for & being special html symbol.
#35
GoogleCodeExporter
closed
9 years ago
2
Add 'getInstance' accessor for ImageExtractor
#34
GoogleCodeExporter
closed
9 years ago
2
Bad xml format in html output from Web API
#33
GoogleCodeExporter
opened
9 years ago
1
Documentation - How to output html extract fragement instead of text?
#32
GoogleCodeExporter
closed
9 years ago
4
Support HTML5 elements
#31
GoogleCodeExporter
opened
9 years ago
2
Outputs html instead of plain text for certain urls
#30
GoogleCodeExporter
closed
9 years ago
2
boilerpipe crash
#29
GoogleCodeExporter
closed
9 years ago
1
UTF characters are not handled correctly
#28
GoogleCodeExporter
closed
9 years ago
3
Add 1.2.0 release to maven repository
#27
GoogleCodeExporter
closed
9 years ago
1
Tags missing in output html
#26
GoogleCodeExporter
closed
9 years ago
4
Feature Request - api to return character offsets of non-boilerplate text
#25
GoogleCodeExporter
closed
9 years ago
3
Boilepipe fails (but not web api edition)
#24
GoogleCodeExporter
closed
9 years ago
4
Encoding problem (input is interpreted as Latin-1)
#23
GoogleCodeExporter
closed
9 years ago
2
Page not being parsed correctly <li> the issue.
#22
GoogleCodeExporter
closed
9 years ago
9
Included nekhtml 1.9.9 mising LostText class
#21
GoogleCodeExporter
closed
9 years ago
2
Featurerequest: Run boilerpipe as a command line tool
#20
GoogleCodeExporter
opened
9 years ago
3
Code for Google app-engine?
#19
GoogleCodeExporter
opened
9 years ago
8
Description of different extractors?
#18
GoogleCodeExporter
closed
9 years ago
3
Next