issues
search
typesense
/
typesense-docsearch-scraper
A fork of Algolia's awesome DocSearch Scraper, customized to index data in Typesense (an open source alternative to Algolia)
https://typesense.org/docs/guide/docsearch.html
Other
97
stars
36
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Are only files in sitemap.xml scraped?
#68
FantasticFiasco
closed
3 days ago
1
feat: add support for http_auth_domain spider attribute
#67
nkls-so
closed
1 week ago
2
feat: carry curation rules and synonyms to new collection after scraper runs
#66
tharropoulos
closed
1 week ago
2
Content of Some links cannot be crawled
#65
simmonn
opened
4 months ago
2
Collection which is latest was deleted after scraper completed
#64
ruoqianfengshao
closed
4 months ago
2
Automatically Close Resources
#63
pixeeai
closed
1 week ago
1
Incomplete indexing of a large docusaurus site
#62
KevinMArtio
closed
4 months ago
5
Don't approve this
#61
nascosto
closed
5 months ago
0
Move from circleci to GitHub actions
#60
nascosto
closed
5 months ago
1
Fix link to the guide
#59
teners
closed
5 months ago
1
feat: multi-arch
#58
darkweaver87
opened
5 months ago
2
Provide multi-arch docker image
#57
darkweaver87
opened
5 months ago
3
No option to create non-nested attributes
#56
attila-csaszar
opened
8 months ago
3
Fix start URL with JS test
#55
CodeSandwich
closed
9 months ago
0
Upgrade selenium to 4.15.2
#54
CodeSandwich
opened
9 months ago
1
Selenium test failing on the master branch
#53
CodeSandwich
closed
9 months ago
1
[Feature request] Allow support for a less verbose option
#52
imballinst
opened
10 months ago
1
How can I configure docsearch-scraper to run against a private internal documentation site that requires auth via oauth2?
#51
liberty-wollerman-kr
opened
10 months ago
2
Scrapper not crawling antora site
#50
wanderanimrod
closed
10 months ago
2
Connection was refused by other side running scraper via docker
#49
noghartt
closed
10 months ago
3
Include "Bot" token in default user agent string
#48
Krinkle
closed
11 months ago
1
Delete CHANGELOG.md
#47
Krinkle
closed
11 months ago
0
Allow mapping page addresses
#46
CodeSandwich
opened
11 months ago
0
Possible to send a header with `docker run -i ... typesense/docsearch-scraper` ?
#45
paulrudy
opened
1 year ago
2
Scraper not working on air-gapped machine
#44
MAHDI-ZN
opened
1 year ago
0
How to set 'locale'
#43
kijung-iM
opened
1 year ago
1
Use Port in `start_urls`
#42
JasonWhall
opened
1 year ago
2
Add support for Keycloak. Implements #39
#41
joostdecock
closed
1 year ago
7
How to rank "word Foo" before foobar, fooz, fooquux, etc.
#40
Krinkle
closed
1 year ago
3
[request] Add support for authenticating to Keycloak
#39
joostdecock
closed
1 year ago
2
Improve stop_urls documentation
#38
Krinkle
closed
1 year ago
2
Early position should have positive instead of negative affect on priority
#37
Krinkle
closed
1 year ago
4
Unexpected spaces in snippet around every character
#35
Krinkle
opened
1 year ago
0
Avoid multiple results from the same webpage
#36
Krinkle
closed
1 year ago
3
Which versions of typesense-server is docsearch-scraper compatible with?
#34
Krinkle
closed
1 year ago
1
Allow passing custom collection options
#33
marcospassos
closed
1 year ago
1
Optimize package installation in docker base image
#32
am97
opened
1 year ago
1
Big variation in number of hits when js_render is true
#31
beauchette
opened
1 year ago
1
Error when running docsearch-scraper
#30
arrondev
closed
1 year ago
5
ModuleNotFoundError: No module named 'requests' in scraper 0.4.0
#29
kostis-codefresh
opened
1 year ago
1
Keep previous versions available in Dockerhub
#28
lanegoolsby
closed
1 year ago
10
Unable to install dependencies when using scraper image in CircleCI
#27
lanegoolsby
opened
1 year ago
5
After rerun docsearch-scraper there is status code 404 on delete previous collection
#26
jasiek-net
closed
7 months ago
7
RuntimeError("cannot join thread before it is started")
#25
dtlhlbs
opened
1 year ago
8
#21 Added locale to collection's field
#24
Markeli
closed
1 year ago
0
Instructions for building scraper on Ubuntu 22
#23
Zamiell
closed
1 year ago
1
Sitemap found but not crawled
#22
yves-v
closed
1 year ago
1
Support configurable `locale` on creating one collections's fields.
#21
PupilTong
closed
1 year ago
7
feat: support apple silicon
#20
PupilTong
opened
1 year ago
2
Add docker image for Apple silicon (arm64)?
#19
PupilTong
opened
1 year ago
2
Next