AWS Big Data Blog
Category: HAQM S3 Glacier
HAQM EMR streamlines big data processing with simplified HAQM S3 Glacier access
In this post, we demonstrate how to set up and use HAQM EMR on EC2 with S3 Glacier for cost-effective data processing.
Working with timestamp with time zone in your HAQM S3-based data lake
With a data lake built on HAQM Simple Storage Service (HAQM S3), you can use the purpose-built analytics services for a range of use cases, from analyzing petabyte-scale datasets to querying the metadata of a single object. AWS analytics services support open file formats such as Parquet, ORC, JSON, Avro, CSV, and more, so it’s […]
Keeping your data lake clean and compliant with HAQM Athena
With the introduction of CTAS support for HAQM Athena (see Use CTAS statements with HAQM Athena to reduce cost and improve performance), you can not only query but also create tables using Athena with the associated data objects stored in HAQM Simple Storage Service (HAQM S3). These tables are often temporary in nature and used […]