AWS Machine Learning Blog
Category: HAQM Transcribe
Get started with automated metadata extraction using the AWS Media Analysis Solution
You can easily get started extracting meaningful metadata from your media files by using the Media Analysis Solution on AWS. The Media Analysis Solution provides AWS CloudFormation templates that you can use to start extracting meaningful metadata from your media files within minutes. With a web-based user interface, you can easily upload files and see the metadata that is automatically extracted. This solution uses HAQM Rekognition for facial recognition, HAQM Transcribe to create a transcript, and HAQM Comprehend to run sentiment analysis on the transcript. You can also upload your own images to an HAQM Rekognition collection and train the solution to recognize individuals. In this blog post, we’ll show you step-by step how to launch the solution and upload an image and video. You’ll be able to see firsthand how metadata is seamlessly extracted.
HAQM Transcribe now supports multi-channel transcriptions
HAQM Transcribe is an automatic speech recognition (ASR) service that makes it easy for developers to add speech-to-text capability to applications. We’re excited to announce the availability of a new feature called channel identification, which allows users to process multi-channel audio files and retrieve a single transcript annotated with respective channel labels.
Announcing the Artificial Intelligence (AI) Hackathon: Build Intelligent Applications using machine learning APIs and serverless
HAQM Web Services (AWS) brings image and video analysis, natural language processing, speech recognition, text-to-speech, and machine translation within the reach of every developer. With machine learning (ML) services by AWS, you can plug in prebuilt AI functionality into your apps without having to worry about ML models. Thousands of developers have used HAQM ML […]
Create video subtitles with translation using machine learning
Businesses from around the globe require fast and reliable ways to transcribe an audio or video file, and often in multiple languages. This audio and video content can range from a news broadcast, call center phone interactions, a job interview, a product demonstration, or even court proceedings. The traditional process for transcription is both expensive […]
HAQM Transcribe now lets you designate your own HAQM S3 buckets to store transcription outputs
HAQM Transcribe is an automatic speech recognition (ASR) service that makes it easy for you to add a speech-to-text capability to your applications. You can use HAQM Transcribe to create text transcripts of audio and video files. Starting today, you can designate your own S3 buckets to store transcription outputs rather than S3 buckets maintained […]
Monitor HAQM Transcribe applications with AWS CloudTrail and HAQM CloudWatch Events
Monitoring your AWS resources is critical for security, performance, compliance, and cost control purposes. Therefore, our customers always ask for features to enable monitoring. Today, we are pleased to announce that HAQM Transcribe is integrated with AWS CloudTrail and HAQM CloudWatch Events to give you more visibility and control of your HAQM Transcribe resources. Let’s […]
VidMob combines computer vision and language AI services for data-driven creative asset production
VidMob is a social video creation platform that marketers of all sizes can use to develop personalized advertising communications at scale. VidMob uses machine learning (ML) to power its SaaS application. This application uses metadata extraction and sentiment analysis to provide marketers with actionable insights into which creative assets resonate with their intended audience, and […]