-
- [x] Wikipediaのpreprocessing scriptが生成するjsonlの“text”のフィールドに空欄の多い件
- [ ] Abejaのtokenizerの動作確認
-
Ocrs does not currently apply any perspective correction to extracted text lines before applying recognition. The recognition model is trained to handle skewed and rotated inputs, but this only works …
-
It seems that these Spanish « open and closing » double quote marks are removed from the training data.
A lot of pre and post processing is required to support them:
I did a find and replace to ch…
-
介绍中提到本项目可以支持seq2seq_text任务:
seq2seq_text: Sequence to Sequence text generation problem
想请教下,数据输入部分(data_preprocessing)的函数应该如何写?谢谢~
-
For custom properties that make use of the 9 SGF data types a nice idea would be to expose the internal value type descriptor system in a simplified fashion. The main problem with this approach is tha…
-
Hi,
Does PreProcessing take binary edgelist or edgelist in text format?
I am passing edgelist in text format and it is failing since it's trying to find the number of edges from the file size and…
-
When using yolov8_datagen.py, the code yolo v8 cannot find image paths. Please check error
-
**The bug**
I simply try to identify a noisy channel in the databrowser, right after loading the data.
`cfg = [];
cfg.dataset = 'subj_1.eeg';
cfg.continu…
-
**Describe the bug**
I have successfully installed the tuva project demo and used dbt build to load the demo data into snowflake. When I try to do this with a local postgres DB, however, I get the fo…
-
```
Integrate AraNLP.
---
https://sites.google.com/site/mahajalthobaiti/resources
AraNLP library is a Java-based toolkit for the processing of Arabic text. It
supports the most important preproces…