issues
search
lorey
/
mlscraper
🤖 Scrape data from HTML websites automatically by just providing examples
https://pypi.org/project/mlscraper/
1.31k
stars
89
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Module not found error?
#43
Immortalus13
closed
10 months ago
2
Is it possible to handle anti-scraping measures?
#42
omaiyiwa
closed
1 year ago
2
High memory usage with github page as sample
#41
entrptaher
opened
1 year ago
10
Adding question mark to the sample fails
#40
entrptaher
opened
1 year ago
8
Extreme RAM usage
#39
BitGeek29
closed
1 year ago
8
Scraper not found error
#38
soumyabroto
closed
1 year ago
6
Spiegel example from Gist
#37
antonengelhardt
opened
1 year ago
3
è®ç»ƒå‡ºçŽ°é”™è¯¯
#36
liliwen365
closed
1 year ago
1
Does not work with some sites
#35
GyurkanM
closed
1 year ago
6
Installation issues
#34
GyurkanM
closed
1 year ago
0
Bump cryptography from 37.0.2 to 39.0.1 in /requirements
#33
dependabot[bot]
opened
1 year ago
0
Bump wheel from 0.37.1 to 0.38.1 in /requirements
#32
dependabot[bot]
opened
1 year ago
0
Bump certifi from 2022.6.15 to 2022.12.7 in /requirements
#31
dependabot[bot]
opened
1 year ago
0
Example from docs does not work
#30
creatorrr
closed
2 years ago
1
missing mlscraper.html
#29
appsec-airito
closed
2 years ago
6
Feature/test example
#28
leo8198
closed
2 years ago
6
Improve version pinning
#27
lorey
closed
1 year ago
0
how to save the model ?
#26
Tlntin
closed
2 years ago
3
Spiegel authors not scraped if defined as list
#25
lorey
closed
2 years ago
0
Find better selectors
#24
lorey
opened
2 years ago
2
Find and fix issue with github profile pages
#23
lorey
closed
2 years ago
4
Show progress during training
#22
lorey
opened
2 years ago
1
Integer Matching
#21
lorey
opened
2 years ago
0
Match substrings
#20
lorey
opened
2 years ago
1
Feedback
#19
jonashaag
opened
2 years ago
18
Improve errors when no match is found
#18
lorey
closed
2 years ago
0
Re-think relationship between samples and matches
#17
lorey
opened
2 years ago
0
Training Set generation is cumbersome
#16
lorey
opened
2 years ago
0
Fuzzy text matching
#15
lorey
opened
2 years ago
1
Stackoverflow example not working
#14
rish-hyun
closed
2 years ago
2
Bump lxml from 4.5.1 to 4.6.5
#13
dependabot[bot]
closed
2 years ago
1
Bump pip from 19.2.3 to 21.1
#12
dependabot[bot]
closed
2 years ago
1
Bump urllib3 from 1.25.9 to 1.26.5
#11
dependabot[bot]
closed
2 years ago
1
Bump py from 1.8.1 to 1.10.0
#10
dependabot[bot]
closed
2 years ago
1
Bump lxml from 4.5.1 to 4.6.3
#9
dependabot[bot]
closed
2 years ago
1
Bump lxml from 4.5.1 to 4.6.2
#8
dependabot[bot]
closed
3 years ago
1
Split tests by type of test not by extraction methods
#7
lorey
closed
2 years ago
0
Resolve flake8 issues
#6
lorey
closed
2 years ago
0
Enable increasing complexity in rule-based scraper
#5
lorey
opened
4 years ago
1
Create readthedocs
#4
lorey
opened
4 years ago
1
Include fetching in scrapers
#3
lorey
closed
2 years ago
1
Test example code
#2
lorey
opened
4 years ago
2
Separate rule extraction from scrapers
#1
lorey
closed
2 years ago
1