Closed DevinTDHa closed 2 months ago
Changes Missing Coverage | Covered Lines | Changed/Added Lines | % | ||
---|---|---|---|---|---|
src/main/scala/com/johnsnowlabs/nlp/pretrained/ResourceDownloader.scala | 0 | 2 | 0.0% | ||
src/main/scala/com/johnsnowlabs/nlp/HasLlamaCppProperties.scala | 0 | 381 | 0.0% | ||
<!-- | Total: | 0 | 383 | 0.0% | --> |
Totals | |
---|---|
Change from base Build 10678787199: | -1.6% |
Covered Lines: | 8981 |
Relevant Lines: | 14911 |
Description
This PR implements support for llama.cpp in Spark NLP.
llama.cpp is a high-performance C/C++ library designed for running Meta's LLaMA models and other large language models (LLMs) on a variety of hardware platforms.
This will enable users to do inference of LLMs wich a variety of optimizations:
Motivation and Context
Many users will have clusters with many smaller nodes. This will enable these nodes to also perform inference for LLMs with limited memory.
How Has This Been Tested?
Local tests, google colab, databricks
Types of changes
Checklist: