issues
search
matthewmueller
/
x-ray
The next web scraper. See through the <html> noise.
MIT License
5.87k
stars
349
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Add GitHub Templates
#149
Kikobeats
closed
8 years ago
1
Add GitHub Templates
#148
Kikobeats
closed
8 years ago
0
Enable Greenkeeper Integration
#147
Kikobeats
closed
5 years ago
1
Add services integration
#146
Kikobeats
closed
8 years ago
0
Support x-ray-parse filters
#145
fabien
closed
8 years ago
6
Decode XML/HTML entities
#144
rcdiaz
opened
8 years ago
6
timeout is not a function
#143
rcdiaz
closed
8 years ago
1
Total Refactor, a New Hope
#142
Kikobeats
closed
8 years ago
8
Add node.stream method
#141
Kikobeats
closed
8 years ago
0
Sanitize/Modify data before returning (middleware of sorts)?
#140
ishan-marikar
closed
8 years ago
5
Crawl a web feed by scrolling and waiting for more data to load via ajax
#139
sauravskumar
closed
5 years ago
3
Wait for page that loaded with ajax
#138
NetanelBasal
closed
8 years ago
4
Change to documentation of support for raw HTML
#137
gconnolly
closed
8 years ago
3
Fix issue where callback was called too many times when using composi…
#136
dfdeagle47
closed
5 years ago
2
Issue with composition (crawling another page) + collection
#135
dfdeagle47
opened
8 years ago
6
API Example
#134
MMRandy
closed
8 years ago
6
Warn if constructor is misused
#133
jspri
closed
5 years ago
1
Dynamically build follow url
#132
rahatarmanahmed
closed
5 years ago
2
Cannot read property 'children' of undefined
#131
dgroch
closed
5 years ago
1
Crawling to another site
#130
umpirsky
closed
8 years ago
2
Crawling problem
#129
ibeerepoot
closed
8 years ago
1
Pagination
#128
matt-erhart
closed
8 years ago
0
Append data to file
#127
ludwigfrank
opened
8 years ago
1
Unexpected elements break crawl
#126
deathg0d
closed
8 years ago
2
Cannot Read property 'html' of null
#125
christiansaiki
opened
8 years ago
1
Delay method
#124
heady
closed
5 years ago
2
Added support for <base> tag when generating absolute URLs.
#123
narmontas
closed
8 years ago
1
Workaround for paginate when there is js in the href?
#122
shwaydogg
closed
5 years ago
3
crawl to nested url feature is not functioning
#121
madibalive
closed
8 years ago
11
.delay / .timeout is undefined?
#120
vodp
closed
5 years ago
6
passing through values into xray post scrape function
#119
markgibaud
closed
8 years ago
0
memory leaks
#118
redaready
closed
8 years ago
1
Merge fix nested crawling
#117
Shipow
closed
8 years ago
0
delay not working
#116
jonstuebe
closed
8 years ago
2
Duplicate result of selecting td with classname
#115
klingkie
closed
5 years ago
5
tag only returns the 1st entry of it.
#114
IoDmitri
closed
8 years ago
1
Cannot read property 'is' of null
#113
msokalski
closed
8 years ago
2
Do no error on trying to follow a non-url
#112
0xgeert
closed
8 years ago
5
Nested crawling broken on 'master'. When to merge 'bugfix/nested-crawling'
#111
0xgeert
closed
5 years ago
19
How to get to data not in html elements, but in javascript for instance?
#110
0xgeert
closed
8 years ago
2
Allow plugin for url feeding / pagination
#109
0xgeert
closed
5 years ago
4
Multiple Scoping with one selector
#108
alejodiazg
closed
5 years ago
6
Is there a way to track the progress of a scrape?
#107
kanethal
closed
7 years ago
4
Passing cookies or request headers
#106
patrickarlt
closed
8 years ago
2
how to target .xml pages
#105
kanethal
closed
7 years ago
3
Throttle, Concurrency supported?
#104
thinkloop
closed
8 years ago
2
Unhandled error bug and non-html
#103
kevindeasis
closed
5 years ago
1
Fix potential issue when selector is a function following comments on…
#102
dfdeagle47
closed
8 years ago
1
HTML entities and other odd characters
#101
yusijs
closed
8 years ago
2
Not finding all selectors
#100
ivansabik
closed
8 years ago
3
Previous
Next