AWS Big Data Blog

Category: HAQM Managed Streaming for Apache Kafka (HAQM MSK)

How Gupshup built their multi-tenant messaging analytics platform on HAQM Redshift

Gupshup is a leading conversational messaging platform, powering over 10 billion messages per month. Across verticals, thousands of large and small businesses in emerging markets use Gupshup to build conversational experiences across marketing, sales, and support. Gupshup’s carrier-grade platform provides a single messaging API for 30+ channels, a rich conversational experience-building tool kit for any […]

Break data silos and stream your CDC data with HAQM Redshift streaming and HAQM MSK

Data loses value over time. We hear from our customers that they’d like to analyze the business transactions in real time. Traditionally, customers used batch-based approaches for data movement from operational systems to analytical systems. Batch load can run once or several times a day. A batch-based approach can introduce latency in data movement and […]

HAQM MSK now provides up to 29% more throughput and up to 24% lower costs with AWS Graviton3 support

HAQM Managed Streaming for Apache Kafka (HAQM MSK) is a fully managed service that enables you to build and run applications that use Apache Kafka to process streaming data. Today, we’re excited to bring the benefits of Graviton3 to Kafka workloads, with HAQM MSK now offering M7g instances for new MSK provisioned clusters. AWS Graviton […]

HAQM MSK Serverless now supports Kafka clients written in all programming languages

HAQM MSK Serverless is a cluster type for HAQM Managed Streaming for Apache Kafka (HAQM MSK) that is the most straightforward way to run Apache Kafka clusters without having to manage compute and storage capacity. With MSK Serverless, you can run your applications without having to provision, configure, or optimize clusters, and you pay for […]

HAQM MSK IAM authentication now supports all programming languages

The AWS Identity and Access Management (IAM) authentication feature in HAQM Managed Streaming for Apache Kafka (HAQM MSK) now supports all programming languages. Administrators can simplify and standardize access control to Kafka resources using IAM. This support is based on SASL/OUATHBEARER, an open standard for authorization and authentication. Both HAQM MSK provisioned and serverless cluster […]

Resolve private DNS hostnames for HAQM MSK Connect

HAQM MSK Connect is a feature of HAQM Managed Streaming for Apache Kafka (HAQM MSK) that offers a fully managed Apache Kafka Connect environment on AWS. With MSK Connect, you can deploy fully managed connectors built for Kafka Connect that move data into or pull data from popular data stores like HAQM S3 and HAQM […]

Non-JSON ingestion using HAQM Kinesis Data Streams, HAQM MSK, and HAQM Redshift Streaming Ingestion

Organizations are grappling with the ever-expanding spectrum of data formats in today’s data-driven landscape. From Avro’s binary serialization to the efficient and compact structure of Protobuf, the landscape of data formats has expanded far beyond the traditional realms of CSV and JSON. As organizations strive to derive insights from these diverse data streams, the challenge […]

Architecture Diagram

Build event-driven architectures with HAQM MSK and HAQM EventBridge

Based on immutable facts (events), event-driven architectures (EDAs) allow businesses to gain deeper insights into their customers’ behavior, unlocking more accurate and faster decision-making processes that lead to better customer experiences. In EDAs, modern event brokers, such as HAQM EventBridge and Apache Kafka, play a key role to publish and subscribe to events. EventBridge is […]

Stitch Fix seamless migration: Transitioning from self-managed Kafka to HAQM MSK

Stitch Fix is a personalized clothing styling service for men, women, and kids. In this post, we will describe how and why we decided to migrate from self-managed Kafka to HAQM Managed Streaming for Apache Kafka (HAQM MSK).

Externalize HAQM MSK Connect configurations with Terraform

Managing configurations for HAQM MSK Connect, a feature of HAQM Managed Streaming for Apache Kafka (HAQM MSK), can become challenging, especially as the number of topics and configurations grows. In this post, we address this complexity by using Terraform to optimize the configuration of the Kafka topic to HAQM S3 Sink connector. By adopting this […]