issues
search
liyongsea
/
parallel_corpus_mnbvc
parallel corpus dataset from the mnbvc project
Apache License 2.0
7
stars
5
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
【游戏语料】底特律:变人
#74
voidf
opened
1 day ago
0
【游戏语料】杀戮尖塔
#73
voidf
opened
1 day ago
1
【游戏语料】老头环
#72
voidf
opened
1 day ago
0
remove duplicated it_text
#71
voidf
closed
2 weeks ago
0
首页的json格式在本地显示错误?修改了下
#70
Function-Samuel
opened
1 month ago
1
update schema clarfication
#69
voidf
closed
1 month ago
0
update readme
#68
Wzixiao
closed
1 month ago
0
【游戏语料】赛博朋克2077
#67
voidf
opened
1 month ago
1
【游戏语料】博德之门3
#66
voidf
opened
1 month ago
1
rename schema
#65
voidf
closed
1 month ago
0
Change the json structure of the readme
#64
Wzixiao
closed
2 months ago
0
corpus-format-patch
#63
Wzixiao
closed
3 months ago
0
联合国6国语料的整条管线接入,爬虫重做
#62
voidf
opened
3 months ago
1
fix typo
#61
voidf
closed
4 months ago
0
更新README里描述的平行语料格式
#60
voidf
closed
4 months ago
0
merge main to doc2docx
#59
voidf
closed
6 months ago
0
Update README.md
#58
liyongsea
closed
6 months ago
0
Add AoPS web Crawler and use help.
#57
Leozw12
closed
6 months ago
0
大使馆语料转化成可发布的对齐格式
#56
liyongsea
closed
6 months ago
0
docx -> txt 需求
#55
voidf
closed
6 months ago
1
doc -> docx
#54
voidf
closed
4 months ago
1
Add U.S.Embassy crawler script
#53
Leozw12
closed
6 months ago
0
Add ChinaDaily crawler script.
#52
Leozw12
closed
10 months ago
0
Crawler script for China Daily.
#51
Leozw12
closed
10 months ago
0
download UN files
#50
cheng780
closed
6 months ago
1
crawl the link to the doc document of the UN
#49
liyongsea
closed
6 months ago
1
[UN corpus] 下载联合国documents(doc)
#48
Wzixiao
closed
6 months ago
5
Retrain bert and lstm as detector
#47
liyongsea
opened
11 months ago
0
Add setup function and upload pypi
#46
Wzixiao
opened
11 months ago
0
把mnbvc做成一个可以用pip install 的形式
#45
liyongsea
closed
9 months ago
0
correct preprocess statistic notebook
#44
voidf
closed
6 months ago
0
Refactor preprocess script
#43
voidf
closed
1 year ago
2
Alignment Scheme
#42
Wzixiao
opened
1 year ago
0
preprocess script
#41
voidf
closed
1 year ago
0
fix: make sure script can run
#40
voidf
closed
1 year ago
0
feat: add single file request script
#39
voidf
closed
1 year ago
0
训练自己的成段的模型
#38
liyongsea
closed
9 months ago
0
中英文对齐的探索
#37
liyongsea
closed
6 months ago
2
使用一下新的16K的chatgpt
#36
liyongsea
closed
9 months ago
0
GPT并行合成数据(跑97%accuracy的版本)
#35
liyongsea
closed
9 months ago
7
Create production script
#34
Wzixiao
closed
1 year ago
1
Found a pandas lib bug and tried to fix it
#33
Wzixiao
closed
1 year ago
1
add eval output and display
#32
liyongsea
closed
1 year ago
0
GPT Batch Sequential Detector
#31
voidf
closed
1 year ago
1
Error investigation
#30
liyongsea
closed
1 year ago
4
Add fuzzy compare at 'gptbatchdetector' branch
#29
Wzixiao
closed
1 year ago
0
Batch Detector and GPT linebreak detection
#28
liyongsea
closed
1 year ago
0
Update README.md
#27
liyongsea
closed
1 year ago
0
RuleBasedDetector
#26
voidf
closed
1 year ago
1
Create Readme for Textsegmenter
#25
liyongsea
closed
1 year ago
0
Next