devflowinc / philosophize-this

1 stars 2 forks source link

cleanup: strip out chunks with [music] only, and other small chunks during indexing #7

Closed danielsgriffin closed 1 month ago

danielsgriffin commented 2 months ago

Example:

image
skeptrunedev commented 2 months ago

https://github.com/devflowinc/philosophize-this/blob/main/transcript-scrape/bulkCreate.js#L5

To complete this issue, in bulkCreate.js, remove chunks from the array which are less than 5 words.