AWS Machine Learning Blog
Category: HAQM SageMaker
Efficient Pre-training of Llama 3-like model architectures using torchtitan on HAQM SageMaker
In this post, we collaborate with the team working on PyTorch at Meta to showcase how the torchtitan library accelerates and simplifies the pre-training of Meta Llama 3-like model architectures. We showcase the key features and capabilities of torchtitan such as FSDP2, torch.compile integration, and FP8 support that optimize the training efficiency.
Time series forecasting with HAQM SageMaker AutoML
In this blog post, we explore a comprehensive approach to time series forecasting using the HAQM SageMaker AutoMLV2 Software Development Kit (SDK). SageMaker AutoMLV2 is part of the SageMaker Autopilot suite, which automates the end-to-end machine learning workflow from data preparation to model deployment.
How Aviva built a scalable, secure, and reliable MLOps platform using HAQM SageMaker
In this post, we describe how Aviva built a fully serverless MLOps platform based on the AWS Enterprise MLOps Framework and HAQM SageMaker to integrate DevOps best practices into the ML lifecycle. This solution establishes MLOps practices to standardize model development, streamline ML model deployment, and provide consistent monitoring.
Visier’s data science team boosts their model output 10 times by migrating to HAQM SageMaker
In this post, we learn how Visier was able to boost their model output by 10 times, accelerate innovation cycles, and unlock new opportunities using HAQM SageMaker.
Import a question answering fine-tuned model into HAQM Bedrock as a custom model
In this post, we provide a step-by-step approach of fine-tuning a Mistral model using SageMaker and import it into HAQM Bedrock using the Custom Import Model feature.
Using task-specific models from AI21 Labs on AWS
In this blog post, we will show you how to leverage AI21 Labs’ Task-Specific Models (TSMs) on AWS to enhance your business operations. You will learn the steps to subscribe to AI21 Labs in the AWS Marketplace, set up a domain in HAQM SageMaker, and utilize AI21 TSMs via SageMaker JumpStart.
How Northpower used computer vision with AWS to automate safety inspection risk assessments
In this post, we share how Northpower has worked with their technology partner Sculpt to reduce the effort and carbon required to identify and remediate public safety risks. Specifically, we cover the computer vision and artificial intelligence (AI) techniques used to combine datasets into a list of prioritized tasks for field teams to investigate and mitigate.
Scalable training platform with HAQM SageMaker HyperPod for innovation: a video generation case study
In this post, we share an ML infrastructure architecture that uses SageMaker HyperPod to support research team innovation in video generation. We will discuss the advantages and pain points addressed by SageMaker HyperPod, provide a step-by-step setup guide, and demonstrate how to run a video generation algorithm on the cluster.
Control data access to HAQM S3 from HAQM SageMaker Studio with HAQM S3 Access Grants
In this post, we demonstrate how to simplify data access to HAQM S3 from SageMaker Studio using S3 Access Grants, specifically for different user personas using IAM principals.
Llama 3.2 models from Meta are now available in HAQM SageMaker JumpStart
In this post, we show how you can discover and deploy the Llama 3.2 11B Vision model using SageMaker JumpStart. We also share the supported instance types and context for all the Llama 3.2 models available in SageMaker JumpStart.