issues
search
alexriggio
/
BERT-LoRA-TensorRT
This repository contains a custom implementation of the BERT model, fine-tuned for specific tasks, along with an implementation of Low Rank Approximation (LoRA). The models are optimized for high performance using NVIDIA's TensorRT.
Apache License 2.0
45
stars
6
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
CUDA out of memory
#4
ZZZsleepyheadZZZ
opened
5 months ago
1
THANK YOU; Q-LORA for vision language models
#3
Abhiram-kandiyana
opened
5 months ago
1
AssertionError: mismatched keys
#2
martijnsiepel01
closed
7 months ago
2
Mismatched keys in custom implemetation
#1
JAYANTH-MOHAN
closed
8 months ago
2