Open photkey opened 9 months ago
I thought about it again, and perhaps a better approach would be to add a parameter that exports only the JSON. In this case, we would only need to read the text once. When encountering text that needs to be read at an accelerated pace, there would be no need to read it again. Additionally, there would be no need to merge all the audio files in the final step. Instead, we would only export a JSON file that contains the rate information. This would make the process more efficient for this particular use case
In actual use, sometimes certain segments are read too quickly, making it difficult to hear clearly. Therefore, it is requested to export the original srt file in JSON format along with the audio file, which includes the actual reading speed for each text segment. With this JSON file, we can achieve better reading effects by re-editing the video or re-editing the srt text.
srt:
json: In the following example snippets,
rate
represents the actual reading speed.