issues
search
misja
/
python-boilerpipe
Python interface to Boilerpipe, Boilerplate Removal and Fulltext Extraction from HTML pages
Other
539
stars
143
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
user-agent value changed as it was giving HTTP 406 errors for some links.
#59
krishnaupadhyay3
closed
3 years ago
0
Fix simple typo: argment -> argument
#58
timgates42
closed
3 years ago
1
Issue python-boilerpipe on docker
#57
lraghib
opened
4 years ago
1
Changed the googlecode link
#56
surya-iquanti
opened
4 years ago
0
jpype._jclass.java.lang.NoClassDefFoundError: java.lang.NoClassDefFoundError: de/l3s/boilerpipe/sax/ImageExtractor
#55
liyongrui
opened
5 years ago
1
Replaced socket.setdefaulttimeout with urlopen timeout
#53
arisudesu
closed
7 years ago
1
Use of socket.setdefaulttimeout in import-level code
#52
arisudesu
closed
7 years ago
5
Corrected attribute presence conditions
#51
arisudesu
closed
7 years ago
1
Empty html causes exception from Extractor
#50
arisudesu
closed
7 years ago
3
I use python-boilerpipe on win10 but it doesn't work
#49
YoshiPark
opened
7 years ago
0
"not a gzip file" error
#48
haoransh
opened
7 years ago
1
"not a gzip file
#47
haoransh
closed
7 years ago
0
Segmentation fault (core dumped) on import
#46
DeckardSG
opened
7 years ago
7
Error: Process finished with exit code -1073741819 (0xC0000005)
#45
mansoorfayyaz
opened
7 years ago
6
gzip pages decompressor added
#44
mahdi-saberi
opened
7 years ago
0
Issues 42
#43
tuxdna
closed
7 years ago
0
Does anyone use KeepEverythingWithMinKWordsExtractor ?
#42
hugsbrugs
closed
7 years ago
5
kernel crashes
#41
mihir-k
closed
7 years ago
4
Remove charade in favour of chardet
#40
aniav
closed
8 years ago
1
python 3 compatibility
#39
gutfeeling
closed
8 years ago
0
Fix installation - updated setup.py and README.rst
#38
tuxdna
closed
8 years ago
0
change in tgz_url value
#37
ghost
opened
8 years ago
1
Cannot run `python setup.py`
#36
dhruvghulati-zz
closed
8 years ago
2
Use proper request headers when opening URI.
#35
ccgillett
opened
8 years ago
1
Import Error in boilerpipe python 2.7 . JVM isn't starting
#34
ethan-hunt-007
closed
7 years ago
3
konlpy & boilerpipe mutually creates exceptions
#33
e9t
opened
9 years ago
1
Caimany
#32
Caimany
opened
9 years ago
1
fixed getImages RuntimeError when using boilerpipe-1.2.0.jar built…
#31
benpryke
opened
9 years ago
5
Can't find Java runtime even though JPype installed
#30
peterswang
opened
9 years ago
3
java.lang.OutOfMemoryError: Java heap space after multiple getHTML calls
#29
alibozorgkhan
opened
9 years ago
1
some urls will not work with celery
#28
Zman67
opened
9 years ago
2
python-boilerpipe setup.py fails using Python3
#27
rhmccullough
opened
10 years ago
1
Import Error
#26
fccoelho
opened
10 years ago
1
urllib2 headers changed
#25
rshiva
opened
10 years ago
1
Boilerpipe fails to extract certain urls with 406 Error
#24
rshiva
closed
3 years ago
5
Installation via pip fails
#23
JustinGibbons
opened
10 years ago
2
Implementing the JSON method?
#22
mittenchops
opened
10 years ago
2
unable to find module, is path affected by OS language?
#21
iuribeferrari
opened
10 years ago
0
No module named jpype, error
#20
iuribeferrari
closed
10 years ago
2
Encoding Issues - UnicodeDecodeError: 'utf8' codec can't decode byte
#19
jimishjoban
opened
10 years ago
1
UnicodeEncodeError: 'ascii' codec can't encode character u'\xbb' in position 20: ordinal not in range(128)
#18
marcoippolito
opened
10 years ago
1
A fatal error has been detected by the Java Runtime Environment: SIGSEGV (0xb)
#17
rshiva
opened
10 years ago
9
boilerpipe hangs in multiprocessing program
#16
dpatro
closed
10 years ago
1
added getTitle function
#15
kanarinka
opened
10 years ago
2
extractor.getImage raises an exception
#14
Cutuchiqueno
closed
11 years ago
1
LookupError when giving url as one that is already saved on the disk (file:///)
#13
sekon
opened
11 years ago
1
Changes for PyPI version 1.2.0.0
#12
ptwobrussell
closed
11 years ago
1
Update setup.py
#11
marsam
closed
11 years ago
8
Image Extraction does not work
#10
codelucas
closed
11 years ago
4
Chardet -> Charade?
#9
honzajavorek
closed
11 years ago
1
Next