yuka-friends / Windrecorder

Windrecorder is a memory search app by records everything on your screen in small size, to let you rewind what you have seen, query through OCR text or image description, and get activity statistics.
GNU General Public License v2.0
2.69k stars 109 forks source link

feat: Recognize multiple OCR languages ​​simultaneously #186

Open Antonoko opened 2 weeks ago

Antonoko commented 2 weeks ago

Current situation: Only one language can be specified for Windows Media OCR to recognize. In the case of multiple languages, all texts cannot be recognized. This may cause inconvenience for multilingual users or learners.

Goal: Support users to configure multiple OCR languages ​​for simultaneous recognition.

Specific implementation: Complete the integration of multilingual recognition results by executing Windows Media OCR multiple times.

User voice: https://github.com/yuka-friends/Windrecorder/discussions/168

UI reference: image

Steven52065 commented 2 weeks ago

我觉得与其多次执行识别多种语言不如换用相对更通用的OCR程序, 识别英文没有尝试,但是识别中文的准确度稍微是有点低了,换用更准确的OCR方案直接将不同的语言一起识别出来也可能省去多次执行并整合到一起的步骤吧。
就像这个效果:https://github.com/yuka-friends/Windrecorder/issues/18#issuecomment-1874195557


Translation: I think it is better to use a relatively more general OCR program instead of executing multiple language recognition multiple times. I haven't tried recognizing English, but the accuracy of recognizing Chinese is a bit low. Using a more accurate OCR solution to directly recognize different languages ​​together may also save the steps of multiple executions and integration.
Like this effect:https://github.com/yuka-friends/Windrecorder/issues/18#issuecomment-1874195557

Antonoko commented 2 weeks ago

我觉得与其多次执行识别多种语言不如换用相对更通用的OCR程序, 识别英文没有尝试,但是识别中文的准确度稍微是有点低了,换用更准确的OCR方案直接将不同的语言一起识别出来也可能省去多次执行并整合到一起的步骤吧。

俺觉得可以,这个应该会和 OCR 改造一起做(https://github.com/yuka-friends/Windrecorder/pull/146 )。不过因为 Windows 自带的 OCR 是最轻量节能和基础的方案,所以也会针对性地优化(同时考虑到对语言支持也是最全面的,用户可以任意选择微软支持的语言包来组合识别