Posted On: Nov 29, 2023

You can now use HAQM Bedrock to process prompts in batch to get responses for model evaluation, experimentation, and offline processing.

Using the batch API makes it more efficient to run inference with foundation models (FMs). It also allows you to aggregate responses and analyze them in batches.

Batch processing is available in preview in US East (N. Virginia), US West (Oregon), Asia Pacific (Singapore), Asia Pacific (Tokyo), and Europe (Frankfurt) AWS Regions.

To learn more about batch inference in HAQM Bedrock, see HAQM Bedrock API reference. Pricing for Batch mode is the same as pricing for On-Demand mode. For details, see the HAQM Bedrock pricing page.

Update: 2/27/2024 - The original post mistakenly listed the launch as generally available whereas it’s actually in preview, and has been updated accordingly.