AWS Big Data Blog
Category: HAQM Redshift
Connecting R with HAQM Redshift
Markus Schmidberger is a Senior Big Data Consultant for AWS Professional Services HAQM Redshift is a fast, petabyte-scale cloud data warehouse for PB of data. AWS customers are moving huge amounts of structured data into HAQM Redshift to offload analytics workloads or to operate their DWH fully in the cloud. Business intelligence and analytic teams […]
Building a Binary Classification Model with HAQM Machine Learning and HAQM Redshift
Guy Ernest is a Solutions Architect with AWS This post builds on Guy’s earlier posts Building a Numeric Regression Model with HAQM Machine Learning and Building a Multi-Class ML Model with HAQM Machine Learning. Many decisions in life are binary, answered either Yes or No. Many business problems also have binary answers. For example: “Is […]
Test drive two big data scenarios from the ‘Building a Big Data Platform on AWS’ bootcamp
Matt Yanchyshyn is a Sr. Manager for AWS Solutions Architecture AWS offers a number of events during the year such as our annual AWS re:Invent conference, the AWS Summit series, the AWS Pop-up Loft, and a variety of roadshows. All of these provide opportunities for AWS customers to attend talks focused on big data and […]
Optimizing for Star Schemas and Interleaved Sorting on HAQM Redshift
Chris Keyser is a Solutions Architect for AWS Many organizations implement star and snowflake schema data warehouse designs and many BI tools are optimized to work with dimensions, facts, and measure groups. Customers have moved data warehouses of all types to HAQM Redshift with great success. The HAQM Redshift team has released support for interleaved […]
A Zero-Administration HAQM Redshift Database Loader
Ian Meyers is a Solutions Architecture Senior Manager with AWS With this new AWS Lambda function, it’s never been easier to get file data into HAQM Redshift. You simply push files into a variety of locations on HAQM S3 and have them automatically loaded into your HAQM Redshift clusters. Using AWS Lambda with HAQM Redshift […]
Building Multi-AZ or Multi-Region HAQM Redshift Clusters
This blog post was last reviewed July, 2022. This post explores customer options for building multi-region or multi-availability zone (AZ) clusters. By default, HAQM Redshift has excellent tools to back up your cluster via snapshot to HAQM Simple Storage Service (HAQM S3). These snapshots can be restored in any AZ in that region or transferred […]
Using Attunity CloudBeam at UMUC to Replicate Data to HAQM RDS and HAQM Redshift
Matt Yanchyshyn is a Principal Solutions Architect at AWS. Brad Helicher, Director of Cloud Business at Attunity, also contributed to this post. Attunity is an APN Big Data Competency Partner. Introduction University of Maryland University College’s mission is to provide a quality education at an affordable cost to busy professionals, mainly adults who are juggling […]
Using HAQM Redshift to Analyze Your Elastic Load Balancer Traffic Logs
Biff Gaut is a Solutions Architect with AWS Introduction With the introduction of Elastic Load Balancing (ELB) access logs, administrators have a tremendous amount of data describing all traffic through their ELB. While HAQM Elastic MapReduce (HAQM EMR) and some partner tools are excellent solutions for ongoing, extensive analysis of this traffic, they can require […]
Best Practices for Micro-Batch Loading on HAQM Redshift
NOTE: HAQM Kinesis Data Firehose is a fully managed service for delivering real-time streaming data to HAQM Redshift. For more information, please visit the HAQM Kinesis Data Firehose documentation page, “Choosing HAQM Redshift for Your Destination.” February 9, 2024: HAQM Kinesis Data Firehose has been renamed to HAQM Data Firehose. Read the AWS What’s New […]