Create an Audio Transcript with HAQM Transcribe
Introduction
Implementation
Create an S3 bucket and upload sample
In this step, you will download a sample audio file, create an S3 bucket, then upload the sample file to the S3 bucket. HAQM Transcribe accesses audio and video files for transcription exclusively from S3 buckets.
1. Download the file
To download the sample audio file to transcribe later in the tutorial, choose transcribe-sample.mp3.
2. Open the console
Select AWS Management Console to open the console in a new browser window, so you can keep this step-by-step guide open. When the screen loads, enter your user name and password to get started.
Using the Region drop down, select a Region that has HAQM Transcribe.

3. Open the S3 console
Type S3 in the search bar and select S3 to open the console.

4. Create a bucket
In the S3 dashboard choose Create bucket.
If this is the first time you have created a bucket, you will see a screen that looks like the image pictured here.
If you have already created S3 buckets, your S3 dashboard will list all the buckets you have created.

5. Enter a bucket name
Enter a unique bucket name. Bucket names must be unique across all existing bucket names in HAQM S3. There are a number of other restrictions on S3 bucket names as well.
Then select a Region to create your bucket in.

6. Review configuration details and create bucket
You have the ability to set up permissions for your S3 bucket. Leave the default values and scroll down.
You have many useful options for your S3 bucket including Versioning, Server Access Logging, Tags, Object-level Logging, and Default Encryption. We won't enable these features for this tutorial.
Select Create bucket.

7. Select your bucket
You will see your new bucket in the S3 console. Click on your bucket’s name to navigate to the bucket. Your bucket name will not be the same as pictured in the screenshot to the right.

8. Upload the sample file
You are in your bucket’s home page.
Select Upload.

9. Select the sample file and upload it
Upload the transcribe-sample.mp3 file by selecting Add files and selecting the file or dragging the transcribe-sample.mp3 file to the upload box.
Select Upload.

10. Copy the S3 URI
On successful upload, select the transcribe-sample.mp3 file in your bucket. A file detail page will be displayed for the transcribe-sample.mp3 file.
Copy the S3 URI link to the file and save it for use later in the tutorial.

Create transcription job
In this step, you will create and run a transcription job using the HAQM Transcribe console.
1. Open the Transcribe console
From the top menu bar, select Services then begin typing Transcribe in the search bar and select HAQM Transcribe to open the service console.

2. Open the Transcription jobs page
On the HAQM Transcribe console main page, open the navigation pane and click Transcription jobs.

3. Create Transcription job
On the Transcription jobs page, click Create job.

3. Create Transcription job
On the Transcription jobs page, click Create job.

4. Specify job details
On the Create transcription job page, in the Name field, type sample-transcription-job.
Leave the default Language as English.
Leave the default Model type as General model.
In the Input file location on S3 field, paste the link to the sample file in your S3 bucket. The link to your sample file will be different than the one shown in the screenshot to the right.
You can use the Custom vocabulary feature to help HAQM Translate recognize words and phrases that are specific to your application, such as a non-English name like Etienne. You won't use this feature in this tutorial.

5. Specify output data
Leave the default Output data location type as Service-managed S3 bucket.
HAQM Transcribe supports WebVTT (VTT) and SubRip (SRT) file types for subtitles. In the Subtitle file format field, you can choose either or both file types for output. If you select both types, you get two files that are exported to the same S3 bucket. Neither format is used in this tutorial.
Select Next.

5. Create job
You can configure additional audio, content, and custom vocabulary settings on the Configure job page.
For this tutorial, leave the default choices and select Create job.

Review transcription results
In this step, you will learn how to check on the progress and review the results of your transcription job.
1. Monitor the status of the job
After you click the Create job button, you will be taken to the Transcription jobs screen. It will show the status of sample-transcription-job. The status can be In progress, Complete, or Failed.
When the status is Complete, click on the sample-transcription-job link in the Name column to view the transcription results.

2. Check the transcription output
Next you will see the sample-transcription-job details. Scroll down to the Transcription panel to view the transcription job output.
In the JSON pane you can view the transcription results as it would be returned from the Transcribe API or AWS CLI.

Clean up
In this step, you will delete the sample file from your S3 bucket to avoid unnecessary charges.
1. Open the S3 console
In this upper navigation menu, begin typing S3 in the search bar and select S3 to open the console.

3. Delete the sample file
Select the transcribe-sample.mp3 file contained within your bucket and select Delete. Confirm the deletion.

Conclusion
As you have seen in this tutorial, HAQM Transcribe enables voice to text at scale. Use HAQM Transcribe for a wide range of audio or videos files, such as customer service calls, business meetings, broadcast TV, and on-demand videos.
Next Steps
Was this page helpful?
Let us know so we can improve the quality of our content.