typesense / typesense-docsearch-scraper

A fork of Algolia's awesome DocSearch Scraper, customized to index data in Typesense (an open source alternative to Algolia)
https://typesense.org/docs/guide/docsearch.html
Other
101 stars 36 forks source link

Allow mapping page addresses #46

Open CodeSandwich opened 1 year ago

CodeSandwich commented 1 year ago

Description

I need to index a site in my local environment. The site will be hosted on https://example.com/mySite, but I want to run scraper on my locally built and hosted copy, available under http://localhost. It would be great if the scraper could do the mapping and e.g. scrape content of http://localhost/foo.html and index it as found on https://example.com/mySite/foo.html, so I can use the built index in the TypeSense instance that https://example.com/mySite will be using in its search bar.