AWS Database Blog

Category: Analytics

Stream data from HAQM DocumentDB to HAQM Kinesis Data Firehose using AWS Lambda

February 9, 2024: HAQM Kinesis Data Firehose has been renamed to HAQM Data Firehose. Read the AWS What’s New post to learn more. In this post, we discuss how to create the data pipelines from HAQM DocumentDB (with MongoDB compatibility) to HAQM Kinesis Data Firehose and publish changes to your destination store. HAQM DocumentDB (with […]

Migrate an Informix database to HAQM Aurora PostgreSQL using CData Connect Cloud from within AWS Glue Studio

HAQM Aurora PostgreSQL-Compatible Edition is a fully managed PostgreSQL-compatible database engine running in AWS and is a drop-in replacement for PostgreSQL. Aurora PostgreSQL is cost-effective to set up, operate, and scale, and can be deployed for new or existing applications. Informix is a relational database management system from IBM and supports OLTP and other workloads. […]

Stream data with HAQM DocumentDB, HAQM MSK Serverless, and HAQM MSK Connect

A common trend in modern application development and data processing is the use of Apache Kafka as a standard delivery mechanism for data pipeline and fan-out approach. HAQM Managed Streaming for Apache Kafka (HAQM MSK) is a fully-managed, highly available, and secure service that makes it simple for developers and DevOps managers to run applications […]

Automate the migration of Microsoft SSIS packages to AWS Glue with AWS SCT

When you migrate Microsoft SQL Server workloads to AWS, you might want to automate migration and minimize changes to existing applications, but still use a cost-effective option without commercial licenses and reduce operational overhead. For example, SQL Server workloads often use SQL Server Integration Services (SSIS) to extract, transform, and load (ETL) data. In this […]

Migrate data from Apache HBase to HAQM DynamoDB

Over the last few years, organizations have started adopting a cloud first strategy, and we are seeing enterprises migrate their mission-critical applications, along with their data platforms, to the cloud. Occasionally, organizations need guidance in selecting the right service and solution in the cloud, along with an approach to assist with the migration. In this […]

Joining historical data between HAQM Athena and HAQM RDS for PostgreSQL

While databases are used to store and retrieve data, there are situations where applications should archive or purge the data to reduce storage costs or improve performance. However, there are often business requirements where an application must query both active data and archived data simultaneously. Developers need a solution that lets them benefit from using […]

Security is time series: How VMware Carbon Black improves and scales security observability with HAQM Timestream

August 30, 2023: HAQM Kinesis Data Analytics has been renamed to HAQM Managed Service for Apache Flink. Read the announcement in the AWS News Blog and learn more. HAQM Timestream is a fast, serverless, and secure time series database and analytics service that can scale to process trillions of time series events per day. Organizations […]

Migrate billions of records from an Oracle data warehouse to HAQM Redshift using AWS DMS

Customers are migrating to HAQM Redshift to modernize their data warehouse solution and help save on their licensing, support, operations, and maintenance costs. To migrate data from an on-premises data warehouse to HAQM Redshift, you can use services such as AWS Database Migration Service (AWS DMS), AWS Schema Conversion Tool (AWS SCT), HAQM Simple Storage […]

Implement vertical partitioning in HAQM DynamoDB using AWS Glue

In this post, we show you how to use AWS Glue to perform vertical partitioning of JSON documents when migrating document data from HAQM Simple Storage Service (HAQM S3) to HAQM DynamoDB. You can use this technique for other data sources, including relational and NoSQL databases. DynamoDB can store and retrieve any amount of data, […]

How CSC Generation powers product discovery with knowledge graphs using HAQM Neptune

This post is co-written with Bobber Cheng and Ronit Rudra from CSC Generation. CSC Generation is a company that focuses on acquiring overlooked stores and catalog-based retailers and transforming them into high-performance, digital-first brands. As we grew through multiple acquisitions, it became apparent that our legacy product information system (PIM), backed by relational databases, was […]