issues
search
alirezamika
/
autoscraper
A Smart, Automatic, Fast and Lightweight Web Scraper for Python
MIT License
6.24k
stars
654
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
replace fuzzywuzzy with difflib
#36
maxbachmann
closed
3 years ago
1
ERROR: Package 'autoscraper' requires a different Python: 2.7.16 not in '>=3.6'
#35
mechengineermike
closed
3 years ago
4
Training text with extra spaces before and after while predicted text does not
#34
predoctech
closed
3 years ago
6
Need help to get price from the web site.
#33
somniapotato
closed
4 years ago
2
Asynchronous methods for fetching URLs, parsing HTML, and exporting data
#32
tarasivashchuk
closed
3 years ago
10
Scraper can't find requested data even though site is well-structured and consistent
#31
cstrouse
closed
4 years ago
3
build method with wanted_dict does not work.
#30
romain-utelly
closed
4 years ago
3
Update README.md
#29
jasonleonhard
closed
4 years ago
0
Add unique, keep_order and contain_similar_leaves parameters
#28
alirezamika
closed
4 years ago
0
About removing duplicate result
#27
Mervyen
closed
4 years ago
9
Add rule alias
#26
alirezamika
closed
4 years ago
0
Pulling tables would be awesome
#25
craine
closed
4 years ago
11
normalize the html content
#24
alirezamika
closed
4 years ago
0
Nonbreaking spaces lead to surprising behavior
#23
steve-bate
closed
4 years ago
4
Scrapping a private website page
#22
NickGoto
closed
4 years ago
11
Cannot support Chinese
#21
carrie-chris
closed
4 years ago
1
Added metadata field
#20
Narasimha1997
closed
4 years ago
8
Add ability to learn new rules while preserving the previous ones
#19
alirezamika
closed
4 years ago
1
Add support for incremental learning
#18
Narasimha1997
closed
4 years ago
4
Add Docstring
#17
alirezamika
closed
4 years ago
0
Replace filter with list comprehensions
#16
PickNickChock
closed
4 years ago
3
Added docstrings
#15
Narasimha1997
closed
4 years ago
1
unique refactor
#14
elquatro
closed
4 years ago
1
Not very reasonable, a lot of things change
#13
abbabb123
closed
4 years ago
1
Add ability to group results and remove unwanted ones
#12
alirezamika
closed
4 years ago
0
wanted_list presupposes knowledge of page
#11
zbrill
closed
4 years ago
1
Create python-publish.yml
#10
alirezamika
closed
4 years ago
0
Dev
#9
alirezamika
closed
4 years ago
0
Websites that require cookies
#8
denny64
closed
4 years ago
2
add URL with save & load function
#7
imadarsh1001
closed
4 years ago
1
Make the code a bit cleaner
#6
cthulhu-irl
closed
4 years ago
2
Add docs string
#5
Narasimha1997
closed
4 years ago
0
[refactor] optimize unique function
#4
Lulzx
closed
4 years ago
3
add host to default headers
#3
alirezamika
closed
4 years ago
0
bs4.FeatureNotFound Error
#2
redplant0
closed
4 years ago
2
Progression of errors while installing
#1
santiagodemierre
closed
4 years ago
3
Previous