aws-solutions / content-analysis-on-aws

As of August 30, 2023, this AWS Solution is no longer available. Existing deployments will continue to run. The functionality provided by Content Analysis on AWS will be superseded with functionality in Media2Cloud on AWS and Content Localization on AWS. We encourage you to explore these solutions.
https://aws.amazon.com/solutions/implementations/aws-content-analysis/
Apache License 2.0
96 stars 47 forks source link

transcribe_error : The input file that you specified exceeds the maximum size of 2048.00 Mb. Try again with a smaller file #84

Closed manishkhr closed 1 year ago

manishkhr commented 1 year ago

error with big files on ‘AWS Content Analysis’ is due to the transcription "'transcribe_error': 'An error occurred (BadRequestException) when calling the StartTranscriptionJob operation: The input file that you specified exceeds the maximum size of 2048.00 Mb. Try again with a smaller file.'}"

Step output { "Name": "TranscribeVideo", "AssetId": "f4377f80-9ebd-40f5-b886-1d6e12d42734", "WorkflowExecutionId": "b4b6ea77-6523-4dac-9b30-8a9a80598a5f", "Input": { "Media": { "Thumbnail": { "S3Bucket": "*", "S3Key": "***" }, "Audio": { "S3Bucket": "*", "S3Key": "**" }, "Video": { "S3Bucket": "****", "S3Key": "*" }, "ProxyEncode": { "S3Bucket": "****", "S3Key": "private/assets/***" } }, "MetaData": { "MediaconvertJobId": "1673454776422-2v7ebs", "AssetId": "f4377f80-9ebd-40f5-b886-1d6e12d42734", "Mediainfo_num_audio_tracks": "1", "WorkflowExecutionId": "b4b6ea77-6523-4dac-9b30-8a9a80598a5f", "MediaconvertInputFile": "*", "JobId": "23c48ab517beac038cd8a2f736298a17890dc0ee62bc4d3472d05c2875c2123a", "PageToken": "****" } }, "Configuration": { "MediaType": "Video", "Enabled": true, "TranscribeLanguage": "en-US" }, "Status": "Started", "MetaData": {}, "Media": {}, "Outputs": { "Error": "MasExecutionError", "Cause": "{\"errorMessage\": \"{'Name': 'TranscribeVideo', 'AssetId': 'f4377f80-9ebd-40f5-b886-1d6e12d42734', 'WorkflowExecutionId': 'b4b6ea77-6523-4dac-9b30-8a9a80598a5f', 'Input': {'Media': {'Thumbnail': {'S3Bucket': '*', 'S3Key': '**'}, 'Audio': {'S3Bucket': '***', 'S3Key': ''}, 'Video': {'S3Bucket': '', 'S3Key': ''}, 'ProxyEncode': {'S3Bucket': '*', 'S3Key': ''}}, 'MetaData': {'MediaconvertJobId': '1673454776422-2v7ebs', 'AssetId': 'f4377f80-9ebd-40f5-b886-1d6e12d42734', 'Mediainfo_num_audio_tracks': '1', 'WorkflowExecutionId': 'b4b6ea77-6523-4dac-9b30-8a9a80598a5f', 'MediaconvertInputFile': 's3:// **', 'JobId': '23c48ab517beac038cd8a2f736298a17890dc0ee62bc4d3472d05c2875c2123a', 'PageToken': '*vkHEav0/tUqbsuG29Rtaf7o8MveOm3HGnOyD4NLE60zxO2MG+8'}}, 'Configuration': {'MediaType': 'Video', 'Enabled': True, 'TranscribeLanguage': 'en-US'}, 'Status': 'Error', 'MetaData': {'transcribe_error': 'An error occurred (BadRequestException) when calling the StartTranscriptionJob operation: The input file that you specified exceeds the maximum size of 2048.00 Mb. Try again with a smaller file.'}, 'Media': {}}\", \"errorType\": \"MasExecutionError\", \"stackTrace\": [\" File \\"/var/task/start_transcribe.py\\", line 91, in lambda_handler\n raise MasExecutionError(operator_object.return_output_object())\n\"]}" } }

manishkhr commented 1 year ago

This looks to be due to the 2 GB file size quota limit of transcribe service itself .. https://docs.aws.amazon.com/transcribe/latest/dg/limits-guidelines.html