microsoft / PowerToys

Windows system utilities to maximize productivity
MIT License
111k stars 6.53k forks source link

Vietnamese OCR #26007

Closed quochungtule closed 1 year ago

quochungtule commented 1 year ago

Description of the new feature / enhancement

I love the tool, it so great. But it would better to support Vietnamese letters in the text extractor add on. Thank you

Scenario when this would be used?

all financial statements in Vietnam are all paper scan version, It would be great to support Vietnamese languague

Supporting information

No response

NguyenASang commented 1 year ago

@quochungtule Basically, this tool uses Window API to extract text (read more here) so it would be really hard and complicated to add since we have to do it from scratch.

crutkas commented 1 year ago

https://learn.microsoft.com/en-us/windows/powertoys/text-extractor#supported-languages

https://learn.microsoft.com/en-us/windows/powertoys/text-extractor#how-to-install-an-ocr-language-pack

phuhuyhnm commented 10 months ago

Currently it does not support Vietnamese. You can use the PowerShell tool with administrator rights and execute the following command to see the list:

Get-WindowsCapability -Online | Where-Object { $_.Name -Like 'Language.OCR*' }

I recommend using the alternative application Capture2Text with Vietnamese support package for your purposes.