-
We need tests to make sure our implementation gives the same results as NYs version.
-
Bad documenttaion. not very long errors
Detecting toxicity in outputs generated by Large Language Models (LLMs) is crucial for ensuring that these models produce safe, respectful, and appropriate con…
-
**Keras version:** 3.5.0
**Backend:** TensorFlow 2.17.0
I encountered a strange bug when working with the GRU layer. If you create a simple model with a GRU layer and set `recurrent_dropout=0.5`…
-
-
It will be great to have the chart showing points (8760) instead of bars, like the one below:
![image](https://user-images.githubusercontent.com/67355063/168959441-e03dc719-cccd-4354-bd78-9faea63ada7…
-
@lukaszkaiser
This is to illustrate what I have discussed on gitter.
Working with WMT EN-FR, I have observed the following.
You can replicate the paper results with "transformer -base" with 4 GP…
-
1/25 (수) 10시
1. 11.3.1
2. 11.3.2
3. 11.3.3-11.3.4
![image](https://user-images.githubusercontent.com/50584633/213716658-dae86ea5-7a11-43af-bbde-5d3c01e5adf3.png)
-
### Bug Description
Hello, I have issue while using TimeSeriesForecaster to forecast my data.
### Bug Reproduction
Code for reproducing the bug:
```py
import pandas as pd
import matplotlib.p…
-
Thưa thầy @KienTrann
Hiện tại chúng em đã viết hoàn chỉnh phiên bản đầu tiên của phần Tóm tắt và Chương 1: Giới thiệu.
Xin thầy xem qua và nhận xét giúp ạ.
Ngoài ra thì em cũng muốn hỏi thêm…
-
Using https://github.com/bmaltais/kohya_ss/tree/sd3-flux.1 branch, I keep getting access violation errors.
System has 16Gb of ram and using RTX 4070Ti Super.
```
Traceback (most recent call last)…