AWS HPC Blog
Category: AWS ParallelCluster
Improve HPC workloads on AWS for environmental sustainability
Need to cut your carbon footprint without sacrificing productivity? Migrating HPC workloads to the cloud allowed Baker Hughes to reduce emissions by 99%! Get tips for optimizing compute, storage, networking so you can do better.
A library of HPC Applications Best Practices on AWS
Want insights on running HPC codes efficiently on AWS? Our HPC specialists compiled their know-how into a new public GitHub repo. Get best practices, templates, scripts and more to optimize your workloads.
Call for participation: HPC tutorial series from the HPCIC
Interested in getting hands-on experience with cutting-edge HPC tools? Check out this blog post on an upcoming virtual training series from @LLNL and @AWSCloud. Learn emerging technologies from the experts this August.
Securing HPC on AWS: implementing STIGs in AWS ParallelCluster
Want to accelerate creating compliant HAQM EC2 images? Learn how HPC users can leverage cloud-native methods for applying STIG security standards.
Large scale training with NVIDIA NeMo Megatron on AWS ParallelCluster using P5 instances
Launching distributed GPT training? See how AWS ParallelCluster sets up a fast shared filesystem, SSH keys, host files, and more between nodes. Our guide has the details for creating a Slurm-managed cluster to train NeMo Megatron at scale.
Best practices for running molecular dynamics simulations on AWS Graviton3E
If you run molecular dynamics simulations, you need to read this. We walk through running benchmarks of popular apps like GROMACS and LAMMPS on new Hpc7g instances and Graviton3E processors. The results – up to 35% better vector performance versus Graviton3! Learn how to optimize your own workflows.
Renewable energy transition: examining the impacts of wind energy through simulation
As we move towards a greener future, understanding wind energy’s climate impacts is key. Check out this blog post by our friends at Whiffle, to learn how large-scale simulations reveal wind power’s effect on our atmosphere.
Protein language model training with NVIDIA BioNeMo framework on AWS ParallelCluster
In this new post, we discuss pre-training ESM-1nv for protein language modeling with NVIDIA BioNeMo on AWS. Learn how you can efficiently deploy and customize generative models like ESM-1nv on GPU clusters with ParallelCluster. Whether you’re studying protein sequences, predicting properties, or discovering new therapeutics, this post has tips to accelerate your protein AI workloads on the cloud.
Dynamic HPC budget control using a core-limit approach with AWS ParallelCluster
Balancing fixed budgets with fluctuating HPC needs is challenging. Discover a customizable solution for automatically setting weekly resource limits based on previous spending.
Enhancing ML workflows with AWS ParallelCluster and HAQM EC2 Capacity Blocks for ML
No more guessing if GPU capacity will be available when you launch ML jobs! EC2 Capacity Blocks for ML let you lock in GPU reservations so you can start tasks on time. Learn how to integrate Caacity Blocks into AWS ParallelCluster to optimize your workflow in our latest technical blog post.