gpt2-inference-performance Search Results

236 results
for gpt2-inference-performance

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ollama/ollama #6425

optimize numa behavior for large models with GPU and CPU inf…

### What is the issue? My setup is a 4x A100 80GB, 2TB ram, dual intel cpu. Ubuntu server 22.04. On a previous version of ollama, the model llama3.1:405b was loaded in a reasonable amount of second…

fabiounixpi updated 1 month ago
14
huggingface/swift-transformers #95

Convert OpenELM to float16 Core ML

I converted the models to `float32` using this script: https://gist.github.com/pcuenca/23cd08443460bc90854e2a6f0f575084, but found precision problems when targeting `float16`. It'd be interesting to s…

pcuenca updated 4 months ago
17
guidance-ai/guidance #794

Temperature effect on Select

Hi there, I have some doubts about the process behind the `select` method. 1. Is there any detailed explanation about what happens under the hood while using `select` and `gen`? I mean, can `s…

luciolcv updated 4 months ago
9
irthomasthomas/undecidability #643

I finally got perfect labels (classification task) via promp…

- [ ] [I finally got perfect labels (classification task) via prompting : r/LocalLLaMA](https://www.reddit.com/r/LocalLLaMA/comments/1amvfua/i_finally_got_perfect_labels_classification_task/) # TIT…

irthomasthomas updated 7 months ago
1
UChicago-Computational-Content-Analysis/Readings-Responses-2023 #27

6. Prediction & Causal Inference - oritenting

Post questions here for this week's oritenting readings: Veitch, Victor, Dhanya Sridhar & David M. Blei. 2020. “Adapting Text Embeddings for Causal Inference.” Proceedings of the 36th Conference on Un…

JunsolKim updated 2 years ago
23
google/uis-rnn #50

uis-rnn can't work for long utterances dataset?

## Describe the question In Diarization task, i train on AMI train-dev set and ICSI corpus , i test on AMI test set. Both datasets include audios of 3-5 speakers in 50-70 minutes. My d embedding tra…

wrongbattery updated 3 years ago
19
ggerganov/ggml #59

Text to speech models in GGML?

@ggerganov do you have any interest in producing more models in GGML format? I'm now convinced your approach of zero dependency, no memory allocation cpu-first ideaology will make it accessible to…

simplejackcoder updated 3 months ago
93
apache/singa #700

Expand the model zoo (example model set)

SINGA has multiple example models at http://singa.apache.org/docs/examples/ Some are implemented from scratch and some are converted from ONNX, which has a bigger model zoo https://github.com/onnx/m…

nudles updated 4 years ago
39
anijain2305/torchdynamo_dashboard #3

one-off runs

(next 2 comments are for max-autotune, warm start run) AMP RUN ~~~ +------------------------+------------+-------------+-------------+ | Compiler | torchbench | huggingface | tim…

anijain2305 updated 1 year ago
12
salesforce/simpletod #24

Shouldn't context be masked during training?

If I understand correctly the idea should be that model generate belief states, dbsearch results, action and response conditioned on some dialog context. Then shouldn't we mask the context in between …

yuanzhaoz updated 3 years ago
8

上一页 1...3 4 5 6 7 8 9...24 下一页

236 results for gpt2-inference-performance

236 results
for gpt2-inference-performance