AWS Machine Learning Blog

Category: AWS Cloud9

Solution overview

Build flexible and scalable distributed training architectures using Kubeflow on AWS and HAQM SageMaker

In this post, we demonstrate how Kubeflow on AWS (an AWS-specific distribution of Kubeflow) used with AWS Deep Learning Containers and HAQM Elastic File System (HAQM EFS) simplifies collaboration and provides flexibility in training deep learning models at scale on both HAQM Elastic Kubernetes Service (HAQM EKS) and HAQM SageMaker utilizing a hybrid architecture approach. […]