issues
search
meilisearch
/
scrapix
MIT License
21
stars
9
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Fix some readme typos & clarify some language
#100
klvs
closed
4 months ago
0
Provide option to slow or rate limit requests
#99
klvs
opened
4 months ago
1
Cannot run under Windows (path contains invalid characters)
#98
AXYZE9
opened
4 months ago
1
Update package.json
#97
brunoocasali
closed
5 months ago
0
add configuration option for additional request headers
#96
dardanbujupaj
closed
5 months ago
5
Scrapix Docker image configuration: JSON string parsing and Liquid syntax compatibility
#95
CaroFG
opened
6 months ago
0
`user_agents` in configuration file doesn't change HTTP User-Agent header
#94
TonyRL
opened
7 months ago
0
Add possibility to exclude selectors
#93
CaroFG
opened
7 months ago
0
Increases timeout to avoid MeiliSearchTimeOutError when using auto-embeddings
#92
CaroFG
closed
7 months ago
0
Meilisearch throws a timeout error when indexing with auto-embeddings
#91
CaroFG
closed
7 months ago
0
Document the purpose of hierarchy_radio_lvl
#90
CaroFG
opened
8 months ago
0
Add config file support on docssearch strategy
#89
CaroFG
closed
7 months ago
0
Allow the use of a config file when docssearch strategy is set
#88
CaroFG
closed
7 months ago
0
Fix typo in README.md
#87
tomecko
closed
1 year ago
1
Update readme with docker usage
#86
bidoubiwa
closed
7 months ago
0
remove useless logs
#85
bidoubiwa
closed
1 year ago
0
Add CI to publish to docker hub
#84
bidoubiwa
closed
1 year ago
0
Add new flags
#83
bidoubiwa
closed
1 year ago
0
Update version to v0.1.7
#82
bidoubiwa
closed
1 year ago
0
Change temporary indexname to _crawler_tmp instead of _tmp
#81
bidoubiwa
closed
1 year ago
0
fix(crawler): fix `failed` webhook
#80
brunoocasali
opened
1 year ago
2
Update version to v0.1.6
#79
bidoubiwa
closed
1 year ago
0
Update scrapix to v0.1.5
#78
bidoubiwa
closed
1 year ago
0
Update scrapix to v0.1.4
#77
bidoubiwa
closed
1 year ago
0
Remove sender add console log
#76
bidoubiwa
closed
1 year ago
0
Fix bug where same batches are send multiple times
#75
bidoubiwa
closed
1 year ago
0
Only package src and dist
#74
bidoubiwa
closed
1 year ago
0
Update version of package to v0.1.0
#73
bidoubiwa
closed
1 year ago
0
Empty enqueued url's after crawler to avoid caching
#72
bidoubiwa
closed
1 year ago
0
Prepare project for npm release
#71
bidoubiwa
closed
1 year ago
0
Add webhook_url as a config option
#70
bidoubiwa
closed
1 year ago
0
Make webhook private
#69
bidoubiwa
closed
1 year ago
0
Create AWS lambda function to run scrapix
#68
bidoubiwa
closed
1 year ago
1
Create script to zip and upload scrapix to AWS
#67
bidoubiwa
closed
1 year ago
1
Export webhook
#66
bidoubiwa
closed
1 year ago
0
Build scrapix into a cjs module
#65
bidoubiwa
closed
1 year ago
0
Wait for scrapix server to run during CI tests
#64
bidoubiwa
closed
1 year ago
0
Fix failing scraping when no hierarchy level0 is found
#63
bidoubiwa
closed
1 year ago
0
Add webhooks
#62
qdequele
closed
1 year ago
0
Simplify the way to get npm package version
#61
qdequele
opened
1 year ago
0
Add schema graph
#60
qdequele
closed
1 year ago
1
Add webhooks setting
#59
bidoubiwa
closed
1 year ago
0
Use browserless to run the headless chrome
#58
bidoubiwa
closed
1 year ago
3
Avoid port conflict between server port and playground port
#57
bidoubiwa
closed
1 year ago
0
Throw error when redis server is not answering
#56
bidoubiwa
opened
1 year ago
0
Update the readme with explainations on config file settings
#55
bidoubiwa
closed
1 year ago
0
Add user agent and possibility to provide additional user agents
#54
bidoubiwa
closed
1 year ago
0
Retrieve page titles from meta tags
#53
Strift
opened
1 year ago
3
Ensure same documents are not pushed more than once.
#52
bidoubiwa
opened
1 year ago
1
Improve splitting of documents
#51
bidoubiwa
closed
1 year ago
0
Next