AWS Big Data Blog
Category: Analytics
Infor’s HAQM OpenSearch Service Modernization: 94% faster searches and 50% lower costs
In this post, we’ll explore Infor’s journey to modernize its search capabilities, the key benefits they achieved, and the technologies that powered this transformation. We’ll also discuss how Infor’s customers are now able to more effectively search through business messages, documents, and other critical data within the ION OneView platform.
Demystify data sharing and collaboration patterns on AWS: Choosing the right tool for the job
Adoption of data lakes and the data mesh framework emerges as a powerful approach. By decentralizing data ownership and distribution, enterprises can break down silos and enable seamless data sharing. In this post, we discuss how to choose the right tool for building an enterprise data platform and enabling data sharing, collaboration and access within your organization and with third-party providers. We address three business use cases using AWS Glue, AWS Data Exchange, AWS Clean Rooms, and HAQM DataZone through three different use cases.
A customer’s journey with HAQM OpenSearch Ingestion pipelines
In this post, we share the journey of a multi-national financial credit reporting company, including the hurdles they faced, and why they went with HAQM OpenSearch Ingestion pipelines to make their log management smoother.
Single sign-on SSO for HAQM OpenSearch Service using SAML and Keycloak
In this post, we walk you through how to configure service provider-initiated authentication for OpenSearch Dashboards by using OpenSearch Service and Keycloak. We also discuss how to set up users, groups, and roles in Keycloak and configure their access to OpenSearch Dashboards.
Get started with HAQM DynamoDB zero-ETL integration with HAQM Redshift
We’re excited to announce the general availability (GA) of HAQM DynamoDB zero-ETL integration with HAQM Redshift, which enables you to run high-performance analytics on your DynamoDB data in HAQM Redshift with little to no impact on production workloads running on DynamoDB. As data is written into a DynamoDB table, it’s seamlessly made available in HAQM Redshift, eliminating the need to build and maintain complex data pipelines.
Elevate your search and analytics skills with the new HAQM OpenSearch Service YouTube channel
We’re thrilled to announce the launch of the official HAQM OpenSearch Service YouTube channel—a comprehensive resource for anyone looking to master HAQM OpenSearch Service. Whether you’re just getting started with searches , vectors, analytics, or you’re looking to optimize large-scale implementations, our channel can be your go-to resource to help you unlock the full potential of OpenSearch Service.
Migrate from HAQM Kinesis Data Analytics for SQL to HAQM Managed Service for Apache Flink and HAQM Managed Service for Apache Flink Studio
HAQM Kinesis Data Analytics for SQL is a data stream processing engine that helps you run your own SQL code against streaming sources to perform time series analytics, feed real-time dashboards, and create real-time metrics. AWS has made the decision to discontinue Kinesis Data Analytics for SQL, effective January 27, 2026. In this post, we explain why we plan to end support for Kinesis Data Analytics for SQL, alternative AWS offerings, and how to migrate your SQL queries and workloads.
Enriching metadata for accurate text-to-SQL generation for HAQM Athena
In this post, we demonstrate the critical role of metadata in text-to-SQL generation through an example implemented for HAQM Athena using HAQM Bedrock. We discuss the challenges in maintaining the metadata as well as ways to overcome those challenges and enrich the metadata.
Enhance HAQM EMR scaling capabilities with Application Master Placement
Starting with the HAQM EMR 7.2 release, HAQM EMR on EC2 introduced a new feature called Application Master (AM) label awareness, which allows users to enable YARN node labels to allocate the AM containers within On-Demand nodes only. In this post, we explore the key features and use cases where this new functionality can provide significant benefits, enabling cluster administrators to achieve optimal resource utilization, improved application reliability, and cost-efficiency in your EMR on EC2 clusters.
Take manual snapshots and restore in a different domain spanning across various Regions and accounts in HAQM OpenSearch Service
This post provides a detailed walkthrough about how to efficiently capture and manage manual snapshots in OpenSearch Service. It covers the essential steps for taking snapshots of your data, implementing safe transfer across different AWS Regions and accounts, and restoring them in a new domain. This guide is designed to help you maintain data integrity and continuity while navigating complex multi-Region and multi-account environments in OpenSearch Service.