BuilderIO / gpt-crawler

Crawl a site to generate knowledge files to create your own custom GPT from a URL
https://www.builder.io/blog/custom-gpt
ISC License
18.15k stars 1.88k forks source link

Rate Limiting, Max Concurrency, Infinite Crawl & Additional Configurations #102

Open cpdata opened 7 months ago

cpdata commented 7 months ago

Initial Improvements

Main Additions

Full Summery

I would like to contribute to this project on a regular basis. I have a lot of Web-scraping, A.I./LLMs, CI/CD, Automation, experience and would like to discuss with the main collaborators and see were I can be of the most use.

cpdata commented 6 months ago

I updated with prettier formatting for the files that failed README.md, src/config.ts, src/core.ts, and config.ts. I also added the formatting for jsdoc/typedoc as recommened by @marcelovicentegc in response to my orginal pull request #102. Additionally, I added .prettierignore file.

steve8708 commented 6 months ago

@marcelovicentegc this look good to you to merge?

marcelovicentegc commented 5 months ago

@marcelovicentegc this look good to you to merge?

Hey @steve8708! Happy new years! One rebase and a few nitpicks ☝️ and it occurs to me that we are good to go 🤗

Ademrobert commented 5 months ago

Please merge this branch ASAP!