issues
search
ContentMine
/
quickscrape
A scraping command line tool for the modern web
MIT License
259
stars
42
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Installation via npm misses tiny-jsonrpc dependency
#103
bcarradini
opened
7 years ago
2
Attempts to locate scraper under --output directory
#102
bcarradini
opened
7 years ago
4
Include source URL in results file
#101
maxpsq
opened
7 years ago
0
Last URL missed
#100
adclose
opened
7 years ago
3
Need Help
#99
itahmid
closed
7 years ago
1
Quickscrape not scraping through my list of 1000 URLs
#98
alexmaina
closed
7 years ago
14
Quickscrape url retrieval failing?
#97
chartgerink
closed
7 years ago
1
installing quickscrape after installing getpapers, breaks getpapers
#96
rossmounce
opened
7 years ago
2
scraper directory uses the user-input output directory
#95
rossmounce
closed
7 years ago
6
Fix logging
#93
tarrow
opened
8 years ago
1
Print which scraper was used
#92
tarrow
opened
8 years ago
0
Mark that Scraping has been attempted
#91
tarrow
opened
8 years ago
0
Migrate from Phantom to Nightmare
#90
tarrow
opened
8 years ago
0
Possible Spooky bug
#89
petermr
opened
8 years ago
0
undefined response object in UrlResolver
#88
petermr
closed
8 years ago
1
Fails on ACS
#87
bjonnh
opened
8 years ago
4
sanitize creation of folder
#86
tarrow
closed
8 years ago
1
Quickscrape fails to properly santise filenames
#85
tarrow
closed
8 years ago
0
Added missing comma
#84
larsgw
closed
8 years ago
2
Bleeding edge thresher to test hyphen URL fix (see #79)
#83
blahah
closed
8 years ago
2
Hangs on `Loading resource failed with status=fail`
#82
petermr
opened
8 years ago
2
QS hangs indefintely
#81
tarrow
closed
8 years ago
6
Hangs on URL
#80
tarrow
closed
8 years ago
1
Issue with URL that contains hyphen
#79
ficolo
opened
8 years ago
14
added CONTRIBUTING.md
#78
chreman
closed
8 years ago
2
duplicate abbreviation `-f` in help
#77
petermr
opened
8 years ago
0
Quickscrape hangs and emits warnings
#76
petermr
opened
8 years ago
1
Download multiple supplemental files (e.g. tables)
#75
petermr
opened
8 years ago
1
Hanging seemingly randomly when downloading a list of URLs
#74
robintw
opened
8 years ago
7
Remove requirement for `-o` with `-n`
#73
petermr
closed
8 years ago
2
Fails when using relative paths - depends on platform?
#72
robintw
opened
8 years ago
0
Added simple implementation of code to skip URLs already processed
#71
robintw
opened
8 years ago
5
Skip already-downloaded articles
#70
robintw
opened
8 years ago
1
possible to just download json file?
#69
muranava
opened
8 years ago
1
switch to wgxpath as xpath lib
#68
blahah
opened
8 years ago
0
scraper directory is wrong
#67
skasberger
closed
8 years ago
1
Fix url list processing
#66
zemanel
opened
9 years ago
2
links with many special characters not properly scraped
#65
chartgerink
closed
8 years ago
5
Passing down headless mode parameter to scraper
#64
lanzer
opened
9 years ago
0
Cannot run Quickscrape in headless mode
#63
lanzer
opened
9 years ago
3
No handler for status code != 200
#62
lanzer
opened
9 years ago
4
Long garbled link breaks quickscrape mkdir
#61
chartgerink
closed
7 years ago
2
v0.4.7 breaks scraping in Windows
#60
chartgerink
closed
8 years ago
2
Review command-line help for clarity
#59
blahah
opened
9 years ago
0
Scraper list option?
#58
chartgerink
closed
9 years ago
5
How to address non-attribute content in quickscrape
#57
petermr
opened
9 years ago
1
Relative path for scraper definition resolved to wrong location
#56
dan2097
opened
9 years ago
0
Quickscrape halts with no error messages after 'processing URL: ...'
#55
dsmurrell
opened
9 years ago
0
DOI resolution gives different results to the direct URL
#54
markmacgillivray
closed
9 years ago
4
same short arg for urllist and ratelimit -r
#53
chreman
closed
9 years ago
1
Next