issues
search
liyongsea
/
parallel_corpus_mnbvc
parallel corpus dataset from the mnbvc project
Apache License 2.0
7
stars
5
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Update README.md
#27
liyongsea
closed
1 year ago
0
RuleBasedDetector
#26
voidf
closed
1 year ago
1
Create Readme for Textsegmenter
#25
liyongsea
closed
1 year ago
0
Rule-based detector and script for gpt segmentation
#24
voidf
closed
1 year ago
0
add: OfflineGptDetector
#23
voidf
closed
1 year ago
0
Create simple evaluation framework and val dataset
#22
liyongsea
closed
1 year ago
2
New Artificial Segmentation Detector
#21
Wzixiao
closed
1 year ago
0
Text Segmenter and evaluation
#20
liyongsea
closed
1 year ago
0
人工反向工程数据集
#19
liyongsea
closed
10 months ago
3
chatgpt合成分段数据集
#18
liyongsea
closed
1 year ago
3
Extract text
#17
Wzixiao
opened
1 year ago
0
重铸REPO的结构
#16
liyongsea
closed
7 months ago
0
[UN dataset] 阿拉伯文字乱码
#15
liyongsea
closed
7 months ago
0
Download sitemap pdf
#14
Wzixiao
closed
1 year ago
0
Get pdf
#13
Wzixiao
closed
1 year ago
0
Download pdf
#12
Wzixiao
closed
1 year ago
0
Download pdf
#11
Wzixiao
closed
1 year ago
0
Download after 2000 year pdf
#10
Wzixiao
closed
1 year ago
0
make pdf information datsets and upload
#9
Wzixiao
closed
1 year ago
0
Download sitemap url resources and parse
#8
Wzixiao
closed
1 year ago
0
Transalte pdf to text
#7
Wzixiao
closed
1 year ago
0
feat[en]: rule-based english paragragh join
#6
voidf
closed
6 months ago
2
Transalte pdf to text
#5
Wzixiao
closed
1 year ago
0
[UN corpus] 下载联合国digital library的pdf并且转化成文本格式上传huggingface
#4
liyongsea
closed
10 months ago
5
[Alignment] Propose alignment algorithm draft
#3
liyongsea
closed
1 year ago
11
Get pdf
#2
Wzixiao
closed
1 year ago
0
add get_news_content.py
#1
Wzixiao
closed
1 year ago
0
Previous