issues
search
entrepreneur-interet-general
/
OpenScraper
An open source webapp for scraping: towards a public service for webscraping
http://www.cis-openscraper.com/
MIT License
92
stars
22
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Bump certifi from 2018.1.18 to 2022.12.7
#88
dependabot[bot]
opened
1 year ago
0
Bump twisted from 17.9.0 to 22.10.0
#87
dependabot[bot]
opened
2 years ago
0
Bump scrapy from 1.5.0 to 2.6.2
#86
dependabot[bot]
opened
2 years ago
0
Bump lxml from 4.1.1 to 4.9.1
#85
dependabot[bot]
opened
2 years ago
0
Bump scrapy from 1.5.0 to 2.6.1
#84
dependabot[bot]
closed
2 years ago
1
Bump twisted from 17.9.0 to 22.4.0
#83
dependabot[bot]
closed
2 years ago
1
Bump pyjwt from 1.6.0 to 2.4.0
#82
dependabot[bot]
opened
2 years ago
0
Bump twisted from 17.9.0 to 22.2.0
#81
dependabot[bot]
closed
2 years ago
1
Bump scrapy from 1.5.0 to 1.8.2
#80
dependabot[bot]
closed
2 years ago
1
Bump twisted from 17.9.0 to 22.1.0
#79
dependabot[bot]
closed
2 years ago
1
Bump lxml from 4.1.1 to 4.6.5
#78
dependabot[bot]
closed
2 years ago
1
Bump babel from 2.5.3 to 2.9.1
#77
dependabot[bot]
opened
3 years ago
0
Bump scrapy-splash from 0.7.2 to 0.8.0
#76
dependabot[bot]
opened
3 years ago
0
Bump scrapy from 1.5.0 to 1.8.1
#75
dependabot[bot]
closed
2 years ago
1
Bump urllib3 from 1.24.1 to 1.26.5
#74
dependabot[bot]
opened
3 years ago
0
Bump lxml from 4.1.1 to 4.6.3
#73
dependabot[bot]
closed
2 years ago
1
Bump pygments from 2.2.0 to 2.7.4
#72
dependabot[bot]
opened
3 years ago
0
Bump jinja2 from 2.10 to 2.11.3
#71
dependabot[bot]
opened
3 years ago
0
Bump lxml from 4.1.1 to 4.6.2
#70
dependabot[bot]
closed
3 years ago
1
Bump cryptography from 2.3.1 to 3.2
#69
dependabot[bot]
opened
4 years ago
0
Bump twisted from 17.9.0 to 19.7.0
#68
dependabot[bot]
closed
2 years ago
1
Fix "TSV" generation
#67
CBalsier
opened
5 years ago
0
added option for chrome headless
#66
retdop
opened
5 years ago
0
Scraper with Selenium config behaves differently with or without headless option
#65
retdop
opened
5 years ago
2
Lien pour installer un environnement virtuel
#64
Tony4469
opened
5 years ago
0
While testing scraper should save the data in separate collection, not erase previous results in main DB
#63
JulienParis
opened
5 years ago
0
Lancer régulièrement les mêmes spiders est un processus douloureux
#62
DavidBruant
opened
5 years ago
2
Modifier masterspider.py risque de casser des spiders existantes
#61
DavidBruant
opened
5 years ago
2
Who maintains OpenScraper?
#60
bzg
opened
5 years ago
0
server in https / lets encrypt
#59
JulienParis
opened
5 years ago
0
Proposition de design alternatif
#58
DavidBruant
opened
5 years ago
9
I cannot see the entire summary of a scrapped page
#57
DavidBruant
opened
5 years ago
0
Typo in field type "adress"
#56
DavidBruant
opened
5 years ago
0
export/import a contributor
#55
DavidBruant
opened
5 years ago
0
export/import data model
#54
DavidBruant
opened
5 years ago
0
ImportError: cannot import name json_util error on install
#53
DavidBruant
closed
5 years ago
3
What is app_scrapnado ?
#52
DavidBruant
closed
5 years ago
1
in install doc, talk about openscraper/config/settings_example.py
#51
DavidBruant
closed
5 years ago
2
better switch when running main
#50
JulienParis
closed
5 years ago
3
better doc for install : more about settings !
#49
JulienParis
closed
5 years ago
3
fix search with or without commas
#48
JulienParis
opened
5 years ago
0
correct collections indexation --> for now all collections and subfields are indexed for research...
#47
JulienParis
opened
5 years ago
2
forgot password routine
#46
JulienParis
opened
6 years ago
0
Password are stored unobfuscated
#45
thibault
opened
6 years ago
1
Fix unicode / binary conversion problems in logging functions
#44
thibault
closed
6 years ago
0
Scraper halts upon meeting a link with unicode characters
#43
thibault
opened
6 years ago
2
vérifier côté OpenScraper capacité à abndonner des requêtes (fermer connexion TCP)
#42
JulienParis
opened
6 years ago
0
add option "RANDOMIZE_DOWNLOAD_DELAY" true/false in the "advanced settings"
#41
JulienParis
closed
6 years ago
1
How to debug spider?
#40
thibault
opened
6 years ago
7
Le re-crawl d'un même site détruit les identifiants des objets scrappés
#39
DavidBruant
opened
6 years ago
1
Next