NVIDIA Triton Computing Requirements.

agent-husky / Husky-v1

Code for Husky, an open-source language agent that solves complex, multi-step reasoning tasks. Husky v1 addresses numerical, tabular and knowledge-based reasoning tasks.

316 stars 30 forks source link

Nevermind I found your GPU specs in the paper. Thanks again

Action generator. We use the provided hyperparameter configuration and fine-tune Llama-2-7B and 13B across 2 NVIDIA A100 80GB GPUs, and Llama-3-8B across 4 NVIDIA A100 80GB GPUs. Code generator. We use the provided hyperparameter configuration and fine-tune CodeTulu-7B and DeepSeekCoder-7B-Instruct-v1.5 across 2 NVIDIA A100 80GB GPUs, and Llama-3-8B across 4 NVIDIA A100 80GB GPUs. Math reasoner. We use the provided hyperparameter configuration and fine-tune Tulu-2-7B and DeepSeekMath-7B-Instruct across 2 NVIDIA A100 80GB GPUs, and Llama-3-8B across 4 NVIDIA A100 80GB GPUs. Query generator. We use the provided hyperparameter configuration and fine-tune Llama-2-7B across 2 NVIDIA A100 80GB GPUs, and Llama-3-8B across 4 NVIDIA A100 80GB GPUs.

agent-husky / Husky-v1

NVIDIA Triton Computing Requirements. #5