withcatai / node-llama-cpp

Run AI models locally on your machine with node.js bindings for llama.cpp. Force a JSON schema on the model output on the generation level
https://withcatai.github.io/node-llama-cpp/
MIT License
736 stars 63 forks source link

feat: flash attention #264

Closed giladgd closed 3 days ago

giladgd commented 3 days ago

Description of change

Pull-Request Checklist

github-actions[bot] commented 3 days ago

:tada: This PR is included in version 3.0.0-beta.37 :tada:

The release is available on:

Your semantic-release bot :package::rocket: