efficient-training Search Results

1000+ results
for efficient-training

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

SakanaAI/AI-Scientist #116

The process has been stuck at the retrieval phase for about …

(AI_Scientist) root@intern-studio-50102651:~/AI-Scientist# python launch_scientist.py --model "gpt-4o-2024-05-13" --experiment nanoGPT --num-ideas 1 Using GPUs: [0] Using OpenAI API with model gpt-4…

Wuyuhang11 updated 2 months ago
2
dmlc/xgboost #11000

[RFC] High-level interface for external memory.

This document concerns the design for the future high-level external memory interface for XGBoost. The closest existing examples are data loaders in deep learning libraries, and there's no standardize…

trivialfis updated 16 hours ago
2
NifTK/NiftyNet #185

Ensure efficient IO when training on large sets of 2D images

As discussed with @atbenmurray and previously with @wyli and @luiscarlosgph, this is a follow up of [cmiclab issue #205](https://cmiclab.cs.ucl.ac.uk/CMIC/NiftyNet/issues/205). We now have support for…

tvercaut updated 5 years ago
2
chemprop/chemprop #1078

[v2 FEATURE]: Add an option to ignore chirality in input SMI…

I'm a senior scientist at Merck, which is part of the MLPDS Consortium. We would like a new feature to optionally ignore chirality. **Is your feature request related to a problem? Please describe.*…

yunsiechung updated 1 month ago
2
rohitinu6/Stock-Price-Prediction #15

To Improve Model fastness we can use LightGBM and CatBoost

LightGBM: Efficiency: LightGBM is designed to be highly efficient and can handle large datasets with faster training times. Accuracy: It often provides better accuracy compared to other gradient b…

praveenarjun updated 2 months ago
3
cocktailpeanut/fluxgym #198

Switching to cosine with warmup as default setting

@cocktailpeanut as evoked in another thread --optimizer_args "relative_step=False" "scale_parameter=False" "warmup_init=False" --lr_scheduler constant_with_warmup **THIS SETTING IS ABSOLUTE C…

Tablaski updated 2 weeks ago
9
pytorch/torchtune #1487

How to disable Checkpointing for Full tuning or PEFT runs?

I am trying to run single GPU to multinode distributed fine tuning for Llama3-70B and Llama3 8B Models. Below is my training configuration: SFT (Llama3 8B & 70B) Epochs: 3 Gradient Accumulatio…

premmotgi updated 2 months ago
4
AkihikoWatanabe/paper_notes #1523

Understanding LLMs: A Comprehensive Overview from Training t…

# URL - https://arxiv.org/abs/2401.02038 # Authors - Yiheng Liu - Hao He - Tianle Han - Xu Zhang - Mengyuan Liu - Jiaming Tian - Yutong Zhang - Jiaqi Wang - Xiaohui Gao - Tianyang …

AkihikoWatanabe updated 2 weeks ago
3
NVIDIA/NeMo-Curator #335

Faster/More efficient duplicate removal for exact/fuzzy dedu…

**Is your feature request related to a problem? Please describe.** The current deduplication examples suggest `compute` on the list of duplicate documents produced via exact/fuzzy deduplication and us…

ayushdg updated 1 month ago
1
TheAlgorithms/Python #12322

Add Radial Basis Function Neural Network (RBFNN)

### Feature description Radial Basis Function Neural Networks (RBFNNs) are a type of neural network that combines elements of clustering and function approximation, making them powerful for both regr…

JeninaAngelin updated 1 month ago
2

上一页 1...7 8 9 10 11 12 13...100 下一页

1000+ results for efficient-training

1000+ results
for efficient-training