-
### Feature request
I see [llama](https://github.com/huggingface/transformers/blob/main/src/transformers/models/llama/modeling_llama.py#L829-L835) will remove tuple past key values in 4.43.
### Moti…
-
### Type of Controller
Xbox One
### OS Version
13.2.1 (22D68)
### Driver Version
0.16.11_Notarized
### Connection Method
Wired
### Device Name and Info
(If you don't know this informa…
Heyl0 updated
4 months ago
-
### Is there an existing issue for this?
- [X] I have searched the existing issues and checked the recent builds/commits
### What happened?
When having the option "Use cross attention optimizations…
-
For the [workspace -l] command, the user can type anything after the '-l'. It will be better to detect incorrect user inputs and notify the users
![Snipaste_2020-11-13_16-38-07.png](https://raw.githu…
-
the code is as follows. It is from https://www.tensorflow.org/guide/keras/masking_and_padding
import numpy as np
import tensorflow as tf
from tensorflow import keras
from tensorflow.keras import…
-
The dump is quite large, but here's the initial error:
```julia
error in running finalizer: MethodError(f=Base.delete!, args=(SVGMakie.Screen(scene=Makie.Scene(parent=nothing, events=Makie.Events(wi…
-
We appreciate you go through Apollo documentations and search previous issues before creating an new one. If neither of the sources helped you with your issues, please report the issue using the follo…
-
### Is there an existing issue for this?
- [X] I have searched the existing issues
### Current Behavior
微调报错 ValueError: None is not in list
$ sh train.sh
08/16/2023 16:29:49 - WARNING - __main__…
-
Hello, I have a question.
Currently, I am trying to use multiple RTSP streams. However, when I run several streams, the speed becomes very slow, making it practically unusable.
I am wondering if…
-
HELLO !
I'm writting a program that works without pathos but it uses 1 processor / 16.
I found a prime number with 40000 digits but it takes 1 week to get the result (only 1 processor)
My goal is t…