issues
search
databricks
/
dbrx
Code examples and resources for DBRX, a large language model developed by Databricks
https://www.databricks.com/
Other
2.47k
stars
231
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Update README.md
#33
computerscienceiscool
opened
2 months ago
0
Transformers Key Error : 'dbrx'
#32
vionwinnie
opened
2 months ago
0
Update README.md
#31
computerscienceiscool
opened
2 months ago
0
Include llama.cpp and llama_index example
#30
dennyglee
closed
2 months ago
0
Fine-tune dbrx-instruct on a single VM with 8 H100s
#29
hosseinsarshar
opened
2 months ago
1
Bad performance on PrOntoQA benchmark
#28
huskydoge
opened
2 months ago
1
Stuck on the output "Setting `pad_token_id` to `eos_token_id`:100257 for open-end generation." more then 10 mins
#27
wbgentleman
closed
2 months ago
2
HumanEval
#26
eyuansu62
opened
2 months ago
0
Does the tokenizer of this model have a network to load successfully?
#25
Rclurn
closed
2 months ago
1
How to get hands on experience as a newbie
#24
simkimsia
closed
3 months ago
1
Why pretrainModel class "_supports_sdpa" is False?
#23
duguwanglong
opened
3 months ago
0
Bump torch to 2.1.0
#22
hanlint
closed
3 months ago
1
`convert_ids_to_tokens` not working as expected.
#21
jcao-ai
closed
3 months ago
1
I have encountered a problem:LayerNorm.__init__() got an unexpected keyword argument 'bias'
#20
gyh123wqe
closed
3 months ago
1
doc - Fixed doc formatting: Added code documentation
#19
TtesseractT
closed
2 months ago
1
docker compose for GPU
#18
joecryptotoo
opened
3 months ago
0
Update README.md
#17
eltociear
closed
3 months ago
0
Missing tokenizer when use vllm
#16
matrixssy
opened
3 months ago
4
Quantized distilled version
#15
florian-hoenicke
closed
3 months ago
1
Silu or Glu activation?
#14
jcao-ai
closed
3 months ago
1
Add finetuning yaml links
#13
hanlint
closed
3 months ago
0
Fine Tuning?
#12
aiyinyuedejustin
closed
3 months ago
1
Is DBRX, the most powerful open-source LLM yet?
#11
ateeq-pk
closed
3 months ago
1
Real Performance versus llama-70B?
#10
JadeRay
closed
3 months ago
1
How inference efficiency is measured
#9
FC-Li
opened
3 months ago
11
What's the optimal parallel strategy using TensorRT-LLM?
#8
iteratorlee
opened
3 months ago
2
How to use API?
#7
AlyssaZyt
closed
3 months ago
1
training slow
#6
jiangix-paper
closed
3 months ago
1
Loading over multiple gpus in 8bit and 4bit with transformers loader
#5
RandomInternetPreson
opened
3 months ago
4
Update vLLM instructions
#4
bandish-shah
closed
3 months ago
0
generate.py : tiktoken.py throws Encoding import error
#3
cpumaxx
closed
3 months ago
1
fix typo in quick start
#2
asnelling
closed
3 months ago
0
Update PR links for inference libraries
#1
megha95
closed
3 months ago
0