efficient-training Search Results

1000+ results
for efficient-training

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

MrNeRF/gaussian-splatting-cuda #5

Show us some cool CUDA tricks. Shorten the training time.

As a potential user of this project, I am interested in training 3D Gaussian splatting models on my own dataset. However, I am using a moderately capable graphics card, and I want to avoid long traini…

MrNeRF updated 1 month ago
1
rohitinu6/Stock-Price-Prediction #15

To Improve Model fastness we can use LightGBM and CatBoost

LightGBM: Efficiency: LightGBM is designed to be highly efficient and can handle large datasets with faster training times. Accuracy: It often provides better accuracy compared to other gradient b…

praveenarjun updated 1 month ago
3
yashasvini121/predictive-calc #101

[new model] : flight delay predictor

🔍 Problem Description: The flight delay prediction model aims to predict if a flight will be delayed based on factors like airline, origin, destination, departure time, and day of the week. This help…

snehaapratap updated 1 month ago
2
aolabsai/MNIST_streamlit #7

[Upcoming Bounty] - Extend application to digest the EMNIST …

LINK TO GRAYSCALE MNIST: https://github.com/Seqaeon/MNIST_streamlit Our weightless neural networks framework running on MNIST and MNIST-grayscale already achieves great results in terms of traini…

mi3law updated 8 hours ago
1
YeonwooSung/MLOps #106

How Meta trains large language models at scale

[meta engineering blog post](https://engineering.fb.com/2024/06/12/data-infrastructure/training-large-language-models-at-scale-meta/) - Meta requires massive computational power to train large lang…

YeonwooSung updated 5 months ago
1
OpenGVLab/InternVL #704

[Feature] Mixed Preference Optimization for 76B internVL

### Motivation Is it possible to apply Mixed Preference Optimization for 76B internVL. Similar to 8B but for 76B? ### Related resources _No response_ ### Additional context _No response_

CCRss updated 1 week ago
2
pytorch/torchtitan #677

Fine-Tuning Llama Model with Large Context and Customized D…

Hi, I am trying to fine-tune a Llama model with a large context size, and I found that to efficiently shard activations across multiple GPUs, I need to use Torchtitan. Here are some questions relat…

Amerehei updated 1 week ago
7
SakanaAI/AI-Scientist #116

The process has been stuck at the retrieval phase for about …

(AI_Scientist) root@intern-studio-50102651:~/AI-Scientist# python launch_scientist.py --model "gpt-4o-2024-05-13" --experiment nanoGPT --num-ideas 1 Using GPUs: [0] Using OpenAI API with model gpt-4…

Wuyuhang11 updated 2 months ago
2
dmlc/xgboost #11010

Cost-Efficient Gradient Boosting (GreedyMiser) for XGBoost

I'm interested in having support for [cost-efficient gradient boosting](https://dl.acm.org/doi/pdf/10.5555/3294771.3294919) in XGBoost. Glossing over non-essential details, CEBG is the application of …

eugeneyarovoi updated 1 week ago
1
bitsandbytes-foundation/bitsandbytes #1119

Pytorch XLA/PJRT TPU support

### Feature request Pytorch XLA/PJRT TPU support for bitsandbytes ### Motivation Would allow for faster and more memory efficient training of models on TPUs. ### Your contribution Happy to prov…

opooladz updated 3 weeks ago
2

上一页 1...5 6 7 8 9 10 11...100 下一页

1000+ results for efficient-training

1000+ results
for efficient-training