-
Hi,
I've added LayoutLMv2 and LayoutXLM to HuggingFace Transformers. I've also created several notebooks to fine-tune the model on custom data, as well as to use it for inference. Demo notebooks ca…
-
The keras package as a number of helper functions for doing data cleaning for neural networks, including pad_sequences, and texts_to_sequences. These functions seems to pass the R data to python which…
-
**Describe the bug**
Hello, I'm trying to run SCENIC+ using SnakeMake in a linux machine (centos 9), on the tutorial dataset.
I ran scATAC-seq preprocessing in python (using pycistopic, using the tu…
-
# Task Name
Animal Sound Classification
## Task Objective
Experts try to comprehend animal sound and the distinctions between them. This task aims to classify animal sound into different cate…
-
The issue here is that iterative_train_test_split is defined as
```
def iterative_train_test_split(
X: np.ndarray,
y: np.ndarray,
...
)
```
but somehow expects X and y to have a …
-
**Describe the bug**
Fail when input datasets have bool columns with many missing values.
**To Reproduce**
Steps to reproduce the behavior:
1. Show your code calling `generate_code()`.
scr…
-
# Task Name
Audio Tagging on Multiple Datasets
## Task Objective
This task is a variation of "Audio Tagging on AudioSet" before. For details of the original task, please refer to https://gith…
-
Great work on source code generation.
The details of the preprocessing of texts (naturel languges) and source code are missing from the paper. Would you kindly let me known what kind of preprossing h…
-
According to [doc](https://fluxml.ai/Flux.jl/stable/models/layers/#Convolution-and-Pooling-Layers-1)
> Data should be stored in WHCN order (width, height, # channels, batch size). In other words, a…
-
Link: https://huggingface.co/datasets/The-OpenROAD-Project/ORAssistant_RAG_Dataset
Goal: Do not spend any more time building/preprocessing the data.
- [x] `build_docs.py` should be used for gen…