issues
search
BuilderIO
/
gpt-crawler
Crawl a site to generate knowledge files to create your own custom GPT from a URL
https://www.builder.io/blog/custom-gpt
ISC License
18.14k
stars
1.88k
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Crawling more than max number of pages
#118
dcgleason
opened
6 months ago
0
Query on Configuring Multiple Web Pages with Unique config.ts Files
#117
PipeDream941
opened
6 months ago
0
Support for multiple cookies
#116
little873
closed
6 months ago
2
some html like https://www.abc.com/file/abc.pdf couldn't be crawled
#115
cuterv
opened
6 months ago
0
Optional Selector to limit link extraction to be within it
#114
monagjr
opened
6 months ago
0
Json too large for GPT
#113
nexuslux
opened
6 months ago
7
How to limit the hierarchy of pages to be crawled?
#112
wywywy1990
opened
6 months ago
0
Pages to Crawl With Authentication
#111
Atrum023
opened
6 months ago
3
maxpage did not work well
#110
bcharityfi
opened
6 months ago
1
ERROR PlaywrightCrawler
#109
why-you-trust-me
closed
6 months ago
3
waiting for locator('td.content') to be visible
#108
baqi2
opened
6 months ago
0
Why I have no "my gpts" option in openai site
#107
yishibakaien
opened
6 months ago
3
Update config.ts to add maxTokens default
#106
mcoliver
closed
5 months ago
1
Github crawling example request
#105
antoan
closed
6 months ago
1
Issue with wildcard usage in 'match' configuration
#104
dxbmax
opened
6 months ago
0
Module '"../config.js"' has no exported member 'defaultConfig'.
#103
smcfall14
opened
6 months ago
0
Rate Limiting, Max Concurrency, Infinite Crawl & Additional Configurations
#102
cpdata
opened
6 months ago
4
add exclude to the script (my first pull request)
#101
Webstudio88
closed
5 months ago
1
How to limit the crawling speed of a spider
#100
ZhangzheBJUT
opened
7 months ago
1
Crawl picture
#99
heiyejiang
opened
7 months ago
0
Invalid filename
#98
arkeyliu
opened
7 months ago
0
Feat: Multiple Match Pattern Config; Pattern Avoid; Grap Content with innerHTML Compatible
#97
FTAndy
opened
7 months ago
0
Error: Invalid or unsupported zip format. No END header found
#96
jiangsiYang
closed
6 months ago
0
Request to support PDF scraping
#95
Zenpenguin
opened
7 months ago
2
Gpt Assistant Throws An Error
#94
snarasimhan80
opened
7 months ago
0
--tool-tip-position-left, 0)
#93
ahmet-rttr34
opened
7 months ago
0
Crawl local harddrive
#92
marcusdeman
opened
7 months ago
0
Help, why can I only climb to the first page of gitbook
#91
wt195799611
opened
7 months ago
5
Major Refactoring for Enhanced Flexibility and Performance
#90
maxime4000
opened
7 months ago
4
Curate HTML w/ Semantic similarity | JinaAI Embeddings v2 (Small) | Curate HTML to Markdown with JinaAI Embedding Processing for Redundancy Removal
#89
Daethyra
closed
6 months ago
2
chore: add prettier check
#88
kunal00000
closed
7 months ago
5
Mac systems have error with npm start
#87
doudouma
opened
7 months ago
3
A fork and modified version to specialize in crawling source code from Github project
#86
FTAndy
closed
7 months ago
0
How to search all URLs with a certain word in it
#85
nynewco
opened
7 months ago
5
How to search sub-folders e.g. xyz.com/folder/page1, page 2 etc.
#84
nynewco
opened
7 months ago
6
Does the project not support Mac systems?
#83
lijialei001
opened
7 months ago
3
Multiple concurrent crawler with split output. Asking if there is interest in completing my Fork.
#82
maxime4000
opened
7 months ago
0
[FR] Optimization of Data Formatting for Custom GPT
#81
Snowzer91
opened
7 months ago
0
[FR] Multitasking system
#80
Snowzer91
opened
7 months ago
0
[FR] System to pause and resume the task later
#79
Snowzer91
opened
7 months ago
1
[FR] Exclude a list of urls
#78
Snowzer91
opened
7 months ago
1
Size
#77
albrox
opened
7 months ago
3
Adding pagination option
#76
kanehooper
closed
7 months ago
0
Comparison with well-established crawlers
#75
dandv
closed
7 months ago
5
FR: preserve links
#74
dandv
opened
7 months ago
0
FR: remove cruft from links
#73
dandv
opened
7 months ago
0
Fix English in README
#72
dandv
closed
6 months ago
2
Is the selector still required?
#71
dandv
opened
7 months ago
1
Running as CLI (broken link)
#70
oneezy
opened
7 months ago
0
chore(cicd): setup test pipeline
#69
marcelovicentegc
closed
5 months ago
2
Previous
Next