Performance Settings - Githubissues

zyddnys / manga-image-translator

Translate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/

https://cotrans.touhou.ai/

GNU General Public License v3.0

5.15k stars 533 forks source link

Performance Settings #106

Open KSimulation opened 1 year ago

KSimulation commented 1 year ago

Thanks for putting together such a cool project!

I wonder what performance settings I can change to boost the translation speed with a higher-end GPU or even multiple GPUs. I was trying it on an RTX3090 and noticed that the GPU usage is always below 30-40%.

I have played around with some settings but didn't seem to do the trick. Since the pipeline is quite long and involves multiple models, could you point out some performance settings that can speed up the translation?

Thanks in advance.

thatDudo commented 1 year ago

There is not much, you can obviously choose the bigger translator models if you choose to use offline translators but otherwise im not sure what might cause the bottleneck.. Maybe the cpu or disk read write speed? Perhaps in the future a mode could be implemented wherein multiple manga images could be translated concurrently.

KSimulation commented 1 year ago

When running a batch job, the CPU(13900K) usage is often below 20% and the disk read/write speed is below 30MB/sec... I think we need to figure out a way to do more parallel processing at each stage.

rspreet92 commented 1 year ago

@dmMaze has implemented a pretty good batch processing in BallonsTranslator

zyddnys commented 1 year ago

Running tasks in batches is possible but it will not fully utilize a high end GPU as the bottleneck is going to be the CPU.

thatDudo commented 1 year ago

Perhaps what is currently most impactful is single thread performance, which explains the 20% of cpu usage. So making the batch translation multi threaded might be well advised.

ryanolee commented 1 year ago

Translation is normally limited by the translation service you are using. If you are not using the offline modes everything gets sent over the network to an external API meaning it is entirely dependant on the external service to return before things can begin processing again. You will only see performance gains (At least for actual text translation) from having a larger GPU when specifically using the offline translation models :eyes:

singersbalm commented 1 year ago

You could increase the performance with jit or a compiler like cython