-
I'm writing to share [my GPU-boosted implementation of PhenoGraph](https://gitlab.com/eburling/grapheno). Instead of using the CPU-bound libraries numpy, scipy.sparse, and sklearn as in the legacy imp…
-
### Bug description
When running multi-node/multi-GPU training with different number of GPUs on each node, `Fabric` `ddp` and `fsdp` will have an incorrect `num_replicas` in `distributed_sampler_kwar…
-
Hi,
I've implemented a clinical entity extraction pipeline using DSPy for processing patient notes. The pipeline extracts various entities (drugs, diseases, procedures, lab tests) and performs cond…
-
When dealing with large datasets and memory constraints, one popular clustering algorithm that can be effective is the DBSCAN (Density-Based Spatial Clustering of Applications with Noise) algorithm. D…
-
https://github.com/dask/dask-ml/blob/d5801584d092d8f13f1b38aaf4da5dc3caa6a213/dask_ml/datasets.py#L332 isn't great, especially in settings like Hyperband #221, that are using the distributed scheduler…
-
The plan for the next major release of tsinfer is to integrate with two upstream projects to improve tsinfer's scalability. There are two major parts to this:
1. Replace our custom data file format…
-
I'm fine tunning Llama-2 13-b model with jsonl file it fails. I've tried with 7b model and I've enabled billing also.
```python
DESTINATION_MODEL_NAME = 'deepakkumar07-debug/llama-midjournery'
…
-
### Please check that this issue hasn't been reported before.
- [X] I searched previous [Bug Reports](https://github.com/axolotl-ai-cloud/axolotl/labels/bug) didn't find any similar reports.
### Exp…
-
# Research and compile a list of health related datasets with public/open licenses to store on decentralized storage
**Motivation/ background / user story:**
Open data is used by many stakeholde…
-
I run the training like below, but throught out an Erro: cuDNN error: CUDNN_STATUS_NOT_INITIALIZED .
torchrun --num_processes 1 train_network.py \
--pretrained_model_name_or_path=/aigc2/liutl/m…