VIPL-SLP / VAC_CSLR

Visual Alignment Constraint for Continuous Sign Language Recognition. ( ICCV 2021)
https://openaccess.thecvf.com/content/ICCV2021/html/Min_Visual_Alignment_Constraint_for_Continuous_Sign_Language_Recognition_ICCV_2021_paper.html
Apache License 2.0
116 stars 19 forks source link

请问ctcdecode初始化所用的vocab为什么能用chr(20000-21296)生成呢? #12

Closed blankspark closed 2 years ago

blankspark commented 2 years ago

您的工作非常出色! 在ctcdecode的文档中,vocab要用待解码的字典来初始化,为什么代码实现用chr(20000+(0~1296))就可以实现呢?20000这个数字是特定的吗? 另外,您的论文中图5给出了模型生成标签与ground_truth和视频的对齐效果,但是我通过ctcdecode只能生成标签但无法用于对齐标注,请问这部分工作是需要额外的代码实现吗? 期待您的答复!

ycmin95 commented 2 years ago

Thanks for your attention to our work, the choice of init number (e.g., 20000) does not change the recognition results. As we only use the decoded indexes for predicition. It seems like vocabulary in ctcdecode can help word segmentation (space_id), which does not exist in current CSLR dataset.

Similar issue about the alignment results.

blankspark commented 2 years ago

感谢您的回复!我将尝试实现