issues
search
crwlrsoft
/
crawler
Library for Rapid (Web) Crawler and Scraper Development
https://www.crwlr.software/packages/crawler
MIT License
325
stars
11
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Improve message composition
#114
szepeviktor
closed
1 year ago
10
Improve variable names
#113
szepeviktor
closed
1 year ago
1
Ignore special non HTTP links
#112
otsch
closed
1 year ago
0
Fix reading input sitemap in HTTP crawl step
#111
otsch
closed
1 year ago
0
"Microseconds" not found
#110
flanderboy
closed
1 year ago
1
Fatal error: is not a valid URL
#109
flanderboy
closed
1 year ago
5
Limit Pagination Crawler
#108
flanderboy
closed
1 year ago
2
Fix typo in HttpTest
#107
szepeviktor
closed
1 year ago
0
Improve Json step
#106
otsch
closed
1 year ago
0
Fix getting only certain HTML meta data properties
#105
otsch
closed
1 year ago
0
Fix issue with array of arrays in result
#104
otsch
closed
1 year ago
0
sub steps
#103
TheCrealm
closed
1 year ago
4
Add version 1.1 in CHANGELOG.md
#102
otsch
closed
1 year ago
0
New method HttpLoader::cacheOnlyWhereUrl()
#101
otsch
closed
1 year ago
0
Use new crwlr/utils Json class
#100
otsch
closed
1 year ago
0
Use proxy
#99
vladsvd
closed
1 year ago
7
Filter by output key aliases
#98
otsch
closed
1 year ago
0
Negate filter functionality
#97
otsch
closed
1 year ago
0
Crawl step can optionally use canonical links
#96
otsch
closed
1 year ago
0
Get HTTP request parameters from input data
#95
otsch
closed
1 year ago
0
Http::crawl() step respect maxOutputs limit
#94
otsch
closed
1 year ago
0
Links/URLs without fragment part
#93
otsch
closed
1 year ago
0
Remove UTF-8 BOM from beginning of strings
#92
otsch
closed
1 year ago
0
Question about submitting form data
#91
DMGPage
closed
1 year ago
6
Fix duplicate list point with broken link
#90
otsch
closed
1 year ago
0
Paginating links with JavaScript href
#89
pjdevries
closed
1 year ago
2
Upgrade to pest v2
#88
otsch
closed
1 year ago
0
Add Crawler::runAndDump()
#87
otsch
closed
1 year ago
0
Output key aliases for addToResult()
#86
otsch
closed
1 year ago
0
Fix JSON keys with empty string value
#85
otsch
closed
1 year ago
0
Improve CI in four ways
#84
szepeviktor
closed
1 year ago
0
Remove PHP version displaying steps
#83
szepeviktor
closed
1 year ago
0
Upgrade actions/checkout action
#82
szepeviktor
closed
1 year ago
0
Remove deprecated --no-suggest from CI
#81
szepeviktor
closed
1 year ago
0
Speed up CI
#80
szepeviktor
closed
1 year ago
0
Improve fixing JSON having keys without quotes
#79
otsch
closed
1 year ago
0
Extracting <script> tags
#78
dimitardimitrov2
closed
1 year ago
2
PHP 8.0 support
#77
MathiasReker
closed
1 year ago
1
v1.0
#76
otsch
closed
1 year ago
0
How to process extracted values?
#75
HelgeSverre
closed
1 year ago
6
Finish v0.7
#74
otsch
closed
1 year ago
0
Keeping input data uses original input
#73
otsch
closed
1 year ago
0
Base Http step also works with array of URLs
#72
otsch
closed
1 year ago
0
Deprecate the loop feature
#71
otsch
closed
1 year ago
0
Fix JSON object keys without quotes
#70
otsch
closed
1 year ago
0
Fix losing result data to add later
#69
otsch
closed
1 year ago
0
Add option to provide chrome executable name
#68
otsch
closed
1 year ago
0
Minor validation methods improvement
#67
otsch
closed
1 year ago
0
Html/Xml data extraction multiple layers
#66
otsch
closed
1 year ago
0
Improve adding data to final Result objects
#65
otsch
closed
1 year ago
0
Previous
Next