AWS Big Data Blog

Tag: HAQM RDS

Automate the archive and purge data process for HAQM RDS for PostgreSQL using pg_partman, HAQM S3, and AWS Glue

The post Archive and Purge Data for HAQM RDS for PostgreSQL and HAQM Aurora with PostgreSQL Compatibility using pg_partman and HAQM S3 proposes data archival as a critical part of data management and shows how to efficiently use PostgreSQL’s native range partition to partition current (hot) data with pg_partman and archive historical (cold) data in […]

Optimize Federated Query Performance using EXPLAIN and EXPLAIN ANALYZE in HAQM Athena

HAQM Athena is an interactive query service that makes it easy to analyze data in HAQM Simple Storage Service (HAQM S3) using standard SQL. Athena is serverless, so there is no infrastructure to manage, and you pay only for the queries that you run. In 2019, Athena added support for federated queries to run SQL […]

How to delete user data in an AWS data lake

General Data Protection Regulation (GDPR) is an important aspect of today’s technology world, and processing data in compliance with GDPR is a necessity for those who implement solutions within the AWS public cloud. One article of GDPR is the “right to erasure” or “right to be forgotten” which may require you to implement a solution […]

Create data science environments on AWS for health analysis using OHDSI

This blog post demonstrates how to combine some of the OHDSI projects (Atlas, Achilles, WebAPI, and the OMOP Common Data Model) with AWS technologies. By doing so, you can quickly and inexpensively implement a health data science and informatics environment.

Deploy a Data Warehouse Quickly with HAQM Redshift, HAQM RDS for PostgreSQL and Tableau Server

One of the benefits of a data warehouse environment using both HAQM Redshift and HAQM RDS for PostgreSQL is that you can leverage the advantages of each service. HAQM Redshift is a high performance, petabyte-scale data warehouse service optimized for the online analytical processing (OLAP) queries typical of analytic reporting and business intelligence applications. On […]

How SmartNews Built a Lambda Architecture on AWS to Analyze Customer Behavior and Recommend Content

This is a guest post by Takumi Sakamoto, a software engineer at SmartNews. SmartNews in their own words: “SmartNews is a machine learning-based news discovery app that delivers the very best stories on the Web for more than 18 million users worldwide.” Data processing is one of the key technologies for SmartNews. Every team’s workload […]