Makememo / MemoAI

MemoAI Video to translated text, subtitles and notes made easy.
https://memo.ac
502 stars 6 forks source link

Medium size model (English Only) generates random timeline, and no subtitle in the player #52

Closed yiqiaowang-arch closed 1 year ago

yiqiaowang-arch commented 1 year ago

Describe the bug As described in the title, Memo AI v1.0.10 behaves unexpected after creating subtitle for a 90-min long video.

394 00:44:44,000 --> 00:00:03,430 You see something like this here for the Swiss rivers.

395 00:44:48,000 --> 00:00:10,630 Technical operational constraints like ramping rates and so, and policy and taxes.



**To Reproduce**

1. Video with clear human voice in English uploaded
2. Prompt: This is a lecture in Energy System Analysis course in ETH Zurich, specifically, a lecture about bottom-up energy optimization. 
3. video playing while Memo AI transcribing
4. See error

**Expected behavior**
All subtitles should be in order and the next sentence should always have a timestamp later than the previous sentence.

**Screenshots**
![image](https://github.com/Makememo/MemoAI/assets/28997207/f1b524cd-fed2-4140-b65a-e3f6d7a5b049)

**Environment:**
 - OS: Windows 11 Pro
 - Memo AI v1.0.10
 - GPU enabled (RTX 3060 Laptop, memory=6GB)
Makememo commented 1 year ago

Thank you for your feedback. We will check the problem and fix it in the next version.

yiqiaowang-arch commented 1 year ago

Update: after regenerating using prompt: This is a lecture in Energy System Analysis course about bottom-up energy optimization. Each identification waits for the discussion to end instead of breaking in the middle. The timeline issue disappears.

Makememo commented 1 year ago

Update: after regenerating using prompt:

This is a lecture in Energy System Analysis course about bottom-up energy optimization. Each identification waits for the discussion to end instead of breaking in the middle.

The timeline issue disappears.

Can you share the original video with us? Let's find this problem.

yiqiaowang-arch commented 1 year ago

Update: after regenerating using prompt: This is a lecture in Energy System Analysis course about bottom-up energy optimization. Each identification waits for the discussion to end instead of breaking in the middle. The timeline issue disappears.

Can you share the original video with us? Let's find this problem.

Here's the video link in OneDrive: https://1drv.ms/v/s!Ai_5pui2MokRk_IQ2Sjg9EzuCJ6W5Q?e=G3w36g Hope it helps!

Makememo commented 1 year ago

We have made some fix, please see if there is an error timeline. https://memo.ac/releases.html

yiqiaowang-arch commented 1 year ago

Unfortunately I still have wrong timelines. This time I used the prompt: "Mathematical Optimization Course. Please add punctuation." image I will try again using my previous prompt and see if this time it works.

Makememo commented 1 year ago

@yiqiaowang-arch Is there a problem opening VAD? image

Makememo commented 1 year ago

And, what is the transcribe model?

yiqiaowang-arch commented 1 year ago

After the second try, this time the problem remains. image

In my case VAD is not enabled. I am using Medium.en 1.53 GB in all these cases. I will try again using VAD with your VAD settings in the screenshot.

yiqiaowang-arch commented 1 year ago

export_log.zip After VAD enabled I am not able to see the transcripted text anymore. Here's the log file for you to check.

Makememo commented 1 year ago

@yiqiaowang-arch Can you refer to this setting and try again. 0.2 or 0.3 2023-11-04 at 21 57 42@2x

yiqiaowang-arch commented 1 year ago

I have set this value to exactly as shown in your previous screenshot (mode=lenient, threshold=0.3) and it doesn't output anything.

Makememo commented 1 year ago

I have set this value to exactly as shown in your previous screenshot (mode=lenient, threshold=0.3) and it doesn't output anything.

Can you send me log? and try 0.2. 2023-11-04 at 23 18 50@2x

yiqiaowang-arch commented 12 months ago

Here's the log file with VAD threshold=0.3. I'll try 0.2 now.

2023-11-05.log

Update: 0.2 doesn't work as well. Log file attached below: 2023-11-05_03.log

Makememo commented 12 months ago

The new version is being tested and will be released recently. Do you have zoom? I want to see the problem remotely.

Please close VAD first.

yiqiaowang-arch commented 12 months ago

Yes, I am happy to share my experience via zoom. I will be available during the weekends, if that also works for you. I will first try to update to the newest version and see if the problem remains.

yiqiaowang-arch commented 11 months ago

Please contact via my email, in case you want me to show how the software behaves.