issues
search
hlp-ai
/
mt-data
MT Data
Apache License 2.0
1
stars
2
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
新建build_vec_index_tsv()函数
#17
CyberRambler
closed
1 year ago
0
实现score_tsv_margin()方法,新建程序score_margin_test.py进行测试
#16
CyberRambler
closed
1 year ago
0
重做built_vec_index方法,分批处理大文件句子集
#15
CyberRambler
closed
1 year ago
0
更新了build_seg_vec_index.py,解决了分批建索引覆盖前一个的问题,把嵌入向量和建索引分开进行,都汇报进度
#14
CyberRambler
closed
1 year ago
0
MT-Data开发更新
#13
CyberRambler
closed
1 year ago
0
data开发更新:完善边缘分数打分程序
#12
CyberRambler
closed
1 year ago
0
更新了 url_language.py 中的语言类代码
#11
CyberRambler
closed
1 year ago
0
增加了 SentenceVectorizationLaBSE_2 类
#10
CyberRambler
closed
1 year ago
0
计算向量余弦相似度改用批处理,测试计算单个句对边缘值耗时约为0.19 ms
#9
CyberRambler
closed
1 year ago
0
实现sentence_vector中的三个类
#8
CyberRambler
closed
1 year ago
0
Add class SqliteLangStat
#7
Tsangski
closed
1 year ago
0
url配对功能实现
#6
CyberRambler
closed
1 year ago
0
Focused crawler with Python
#5
hlp-ai
closed
1 year ago
1
Extract sentences from HTML page
#4
hlp-ai
closed
1 year ago
1
Align sentendes of a Web site based on cross-lingual sentence embeddings
#3
hlp-ai
opened
1 year ago
1
From CommonCrawl WET files, for each web page count the lengths of texts of different languages
#2
hlp-ai
opened
1 year ago
1
Web page alignment based on URL patterns
#1
hlp-ai
opened
1 year ago
0