Support Large v2 - Githubissues

anotherjesse commented 1 year ago

https://github.com/openai/whisper/commit/4179ed2475cc84cba66868b516232ef1b74dacdf

OpenAI seems to have shipped a new version of the large model.

anotherjesse commented 1 year ago

The "large-v2" model is trained for more epochs with regularization and shows improved performance compared to the previous large.

It has the same architecture as the original large model.

When load_model("large") is called, the "large-v2" model will be loaded.

We will soon update the paper regarding this new model.

chenxwh commented 1 year ago

just pushed a version supporting this :)

R-Gallagher commented 1 year ago

Hi @chenxwh ,

Thanks for all your great work on this!

Just wanted to let you know that this new change is causing failures for models requesting the original "large" model that are set to use the latest model version.

Failing:

model = replicate.models.get("openai/whisper")
prediction = replicate.predictions.create(
        input={
                "audio": signed_audio_storage_link, 
                "model": "large", 
                "translate":False
            }, 
            version=model.versions.list()[0] # select latest model version
            )

Fix:

model = replicate.models.get("openai/whisper")
prediction = replicate.predictions.create(
        input={
                "audio": signed_audio_storage_link, 
                "model": "large-v2", 
                "translate":False
            }, 
            version=model.versions.list()[0] # select latest model version
            )

chenxwh commented 1 year ago

yes now it is set to use large-v1 for the original and large-v2 for the new weights

R-Gallagher commented 1 year ago

Ah, I see. I was using the above code in production and started getting errors, just left this here to help! I suppose I should pin the version number instead of always grabbing latest.

Cheers and thanks again!

ronyfadel commented 1 year ago

Will https://replicate.com/openai/whisper be updated with this?

chenxwh / cog-whisper

Support Large v2 #10