-
### Is your feature request related to a problem? Please describe.
_No response_
### Describe the solution you'd like
$OLLAMA_HOST is the standard way to define your Ollama server URL. It is used b…
-
Layer-Condensed KV Cache for Efficient Inference of Large Language Models
https://arxiv.org/abs/2405.10637
Scaling Monosemanticity: Extracting Interpretable Features from Claude 3 Sonnet
https://tr…
-
**Application**
If I have a square matrix that is very large and being stored as an `np.memmap` array and I try to construct a BQM with it I often run out of memory _even if_ the actual final BQM isn…
-
Hello,
I am using your TTS as an Android Text-to-Speech (TTS) engine for offline use, but I have encountered an issue with the audio synthesis. It is taking approximately **500 ms** to speak the te…
-
We propose [MeteoRA: Multiple-tasks Embedded LoRA for Large Language Models](https://arxiv.org/pdf/2405.13053). Our proposed MeteoRA (Multiple-Tasks embedded LoRA) is a scalable and efficient framewor…
-
Nice package, I have been looking for methods for heteroskedastic distribution prediction! (see https://github.com/sktime/skpro/issues/7)
More generally, I think `rolch` would fit the scope of skpr…
-
**Please Describe The Problem To Be Solved**
ollama website [ollama](https://ollama.com/)
Ollama get up and running with large language models, locally.
Ollama has gained popularity for its efficie…
-
## What dataset are you requesting?
US residential mortgage perforamance dataset from Fannie Mae [Data Dynamics](https://www.fanniemae.com/data-dynamics).
1. [Here](https://capitalmarkets.fanniemae…
-
### Feature request
Hi, I'm the author of [zhuzilin/ring-flash-attention](https://github.com/zhuzilin/ring-flash-attention).
I wonder if you are interested in integrating context parallel with [zh…
-
aim : this projects aims to detect the parkisons diseases using Machine learning models in an efficient way
objective : uses different Ml Models to predicts the disease using kaggle data set , by ap…