feat: move CUDA prebuilt binaries to dependency modules to improve npm install times
fix: improve GGUF metadata reading performance
fix: adapt to breaking llama.cpp changes
Pull-Request Checklist
[x] Code is up-to-date with the master branch
[x] npm run format to apply eslint formatting
[x] npm run test passes with this change
[ ] This pull request links relevant issues as Fixes #0000
[ ] There are new or updated unit tests validating the change
[ ] Documentation has been updated to reflect this change
[x] The new commits and pull request title follow conventions explained in pull request guidelines (PRs that do not follow this convention will not be merged)
Description of change
npm install
timesllama.cpp
changesPull-Request Checklist
master
branchnpm run format
to apply eslint formattingnpm run test
passes with this changeFixes #0000