issues
search
liyongsea
/
parallel_corpus_mnbvc
parallel corpus dataset from the mnbvc project
Apache License 2.0
11
stars
6
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
【游戏语料】Metro系列
#93
voidf
opened
6 days ago
0
【平行语料来源探索】歌词语料
#92
voidf
opened
2 weeks ago
0
【联合国语料】数据review - 管线重跑2023
#91
voidf
opened
2 weeks ago
1
【语料格式升级】老语料升级成新的语料
#90
voidf
opened
2 weeks ago
1
【游戏语料】如龙系列(需要人做繁中转简中)
#89
voidf
opened
2 weeks ago
9
【联合国语料】论文相关工作
#88
voidf
opened
2 weeks ago
0
【联合国语料】docx表格检测工具 - xml形式里面能不能对齐
#87
voidf
opened
1 month ago
2
Readme patch
#86
voidf
opened
1 month ago
2
Readme update
#85
Function-Samuel
closed
1 month ago
0
更新平行语料的后处理脚本
#84
voidf
closed
1 month ago
0
【游戏语料】【持续更新】米厂全家桶(原、鸣、星)
#83
voidf
opened
1 month ago
1
【游戏语料】【占坑待有买游戏的人提供一份游戏】无主之地2、3
#82
voidf
opened
2 months ago
0
【游戏语料】【已传中转站,处理完毕,待统一格式转换】魔女之泉R、魔女之泉3(steam版本)
#81
voidf
opened
2 months ago
3
【游戏语料】【已传中转站,可以先看下值不值得收录】lb
#80
voidf
opened
2 months ago
1
【游戏语料】【GTA5、GTA4、大表哥2已传中转站,待处理】【马克思佩恩可解但缺游戏本体】
#79
voidf
opened
2 months ago
2
【术语库】SDL Trados微软术语库
#78
voidf
closed
2 months ago
2
【平行语料来源探索】字幕语料
#77
voidf
opened
4 months ago
6
【Linux中国官方数据集】
#76
voidf
opened
4 months ago
4
【游戏语料】【已完成处理,待收录】黑帝斯
#75
voidf
closed
2 months ago
2
【游戏语料】【已传中转站,处理完毕,待收录】底特律:变人
#74
voidf
closed
1 month ago
5
【游戏语料】【已传中转站,待整理】杀戮尖塔
#73
voidf
opened
4 months ago
3
【游戏语料】【环和狼已发布,魂3等收录,魂12暂无游戏源文件】老头环、只狼、魂123
#72
voidf
opened
4 months ago
4
remove duplicated it_text
#71
voidf
closed
5 months ago
0
首页的json格式在本地显示错误?修改了下
#70
Function-Samuel
closed
1 month ago
1
update schema clarfication
#69
voidf
closed
5 months ago
0
update readme
#68
Wzixiao
closed
5 months ago
0
【游戏语料】【已传中转站,处理完毕,待收录】赛博朋克2077
#67
voidf
opened
6 months ago
5
【游戏语料】【已收录】博德之门3
#66
voidf
closed
4 months ago
2
rename schema
#65
voidf
closed
5 months ago
0
Change the json structure of the readme
#64
Wzixiao
closed
7 months ago
0
corpus-format-patch
#63
Wzixiao
closed
7 months ago
0
联合国6国语料的整条管线接入,爬虫重做
#62
voidf
opened
8 months ago
1
fix typo
#61
voidf
closed
8 months ago
0
更新README里描述的平行语料格式
#60
voidf
closed
8 months ago
0
merge main to doc2docx
#59
voidf
closed
10 months ago
0
Update README.md
#58
liyongsea
closed
10 months ago
0
Add AoPS web Crawler and use help.
#57
Leozw12
closed
11 months ago
0
大使馆语料转化成可发布的对齐格式
#56
liyongsea
closed
10 months ago
0
docx -> txt 需求
#55
voidf
closed
10 months ago
1
doc -> docx
#54
voidf
closed
8 months ago
1
Add U.S.Embassy crawler script
#53
Leozw12
closed
10 months ago
0
Add ChinaDaily crawler script.
#52
Leozw12
closed
1 year ago
0
Crawler script for China Daily.
#51
Leozw12
closed
1 year ago
0
download UN files
#50
cheng780
closed
10 months ago
1
crawl the link to the doc document of the UN
#49
liyongsea
closed
10 months ago
1
[UN corpus] 下载联合国documents(doc)
#48
Wzixiao
closed
10 months ago
5
Retrain bert and lstm as detector
#47
liyongsea
opened
1 year ago
0
Add setup function and upload pypi
#46
Wzixiao
opened
1 year ago
0
把mnbvc做成一个可以用pip install 的形式
#45
liyongsea
closed
1 year ago
0
correct preprocess statistic notebook
#44
voidf
closed
10 months ago
0
Next