-
### 🚀 The feature, motivation and pitch
It'd be great to have a fused linear and cross-entropy function in PyTorch, for example, `torch.nn.functional.linear_cross_entropy`. This function acts as a fu…
-
As title. Many thanks for your answer.
-
Hello,
could we please have 13b and 7b models with the updated architecture that includes grouped query attention? A lot of people are running these models on machines with low memory and this woul…
-
Describe the bug
Times out when trying to finalize a large existing data model
To Reproduce
Steps to reproduce the behavior:
Go to : https://oxnetdwa02.oxnet.nhs.uk/mauro-data-mapper/#/home
Fin…
-
Basically, as the number of entity kinds and relationships in a given data model increases, it will become very difficult to get a good understanding of what is what. The graph itself will help out, b…
-
# URL
- https://arxiv.org/abs/2309.15427
# Affiliations
- Yijun Tian, N/A
- Huan Song, N/A
- Zichen Wang, N/A
- Haozhu Wang, N/A
- Ziqing Hu, N/A
- Fang Wang, N/A
- Nitesh V. Chawla, N/A…
-
> bash run.sh
# 下面是运行脚本内容
#!/usr/bin/env bash
export OMP_NUM_THREADS=2
torchrun --nproc_per_node 2 \
-m .run \
--output_dir ../models/bge-large-zh-medical-v2 \
--model_name_or_path ../BAAI/bge-…
-
We propose [MeteoRA: Multiple-tasks Embedded LoRA for Large Language Models](https://arxiv.org/pdf/2405.13053). Our proposed MeteoRA (Multiple-Tasks embedded LoRA) is a scalable and efficient framewor…
-
With the recent advent of large models (take Llama 3.1 405b, for example!), distributed inference support is a must! We currently support naive device mapping, which works by allowing a combination of…
-
"Feature Request: Enhance Project with Support for Additional Large Language Models (LLMs) - Including Local AI Assistants
I've been utilizing your project, and it's truly impressive! I wanted to p…