AWS Big Data Blog
Tag: HAQM Redshift
Orchestrate HAQM Redshift-Based ETL workflows with AWS Step Functions and AWS Glue
In this post, I show how to use AWS Step Functions and AWS Glue Python Shell to orchestrate tasks for those HAQM Redshift-based ETL workflows in a completely serverless fashion. AWS Glue Python Shell is a Python runtime environment for running small to medium-sized ETL tasks, such as submitting SQL queries and waiting for a response. Step Functions lets you coordinate multiple AWS services into workflows so you can easily run and monitor a series of ETL tasks. Both AWS Glue Python Shell and Step Functions are serverless, allowing you to automatically run and scale them in response to events you define, rather than requiring you to provision, scale, and manage servers.
Protect and Audit PII data in HAQM Redshift with DataSunrise Security
This post focuses on active security for HAQM Redshift, in particular DataSunrise’s capabilities for masking and access control of personally identifiable information (PII), which you can back with DataSunrise’s passive security offerings such as auditing access of sensitive information. This post discusses DataSunrise security for HAQM Redshift, how it works, and how to get started.
Automate HAQM Redshift cluster creation using AWS CloudFormation
In this post, I explain how to automate the deployment of an HAQM Redshift cluster in an AWS account. AWS best practices for security and high availability drive the cluster’s configuration, and you can create it quickly by using AWS CloudFormation. I walk you through a set of sample CloudFormation templates, which you can customize as per your needs.
How to migrate a large data warehouse from IBM Netezza to HAQM Redshift with no downtime
In this article, we explain how this customer performed a large-scale data warehouse migration from IBM Netezza to HAQM Redshift without downtime, by following a thoroughly planned migration process, and leveraging AWS Schema Conversion Tool (SCT) and HAQM Redshift best practices.
Bringing your stored procedures to HAQM Redshift
HAQM always works backwards from the customer’s needs. Customers have made strong requests that they want stored procedures in HAQM Redshift, to make it easier to migrate their existing workloads from legacy, on-premises data warehouses.
With that primary goal in mind, AWS chose to implement PL/pqSQL stored procedure to maximize compatibility with existing procedures and simplify migrations. In this post, we discuss how and where to use stored procedures to improve operational efficiency and security. We also explain how to use stored procedures with AWS Schema Conversion Tool.
How 3M Health Information Systems built a healthcare data reporting tool with HAQM Redshift
After reviewing many solutions, 3M HIS chose HAQM Redshift as the appropriate data warehouse solution. We concluded HAQM Redshift met our needs; a fast, fully managed, petabyte-scale data warehouse solution that uses columnar storage to minimize I/O, provides high data compression rates, and offers fast performance. We quickly spun up a cluster in our development environment, built out the dimensional model, loaded data, and made it available to perform benchmarking and testing of the user data. An extract, transform, load (ETL) tool was used to process and load the data from various sources into HAQM Redshift.
Query your HAQM Redshift cluster with the new Query Editor
Data warehousing is a critical component for analyzing and extracting actionable insights from your data. HAQM Redshift is a fast, scalable data warehouse that makes it cost-effective to analyze all of your data across your data warehouse and data lake. The HAQM Redshift console recently launched the Query Editor. The Query Editor is an in-browser […]
Federate HAQM Redshift access with Okta as an identity provider
December 2022: This post was reviewed and updated for accuracy. Managing database users and access can be a daunting and error-prone task. In the past, database administrators had to determine which groups a user belongs to and which objects a user/group is authorized to use. These lists were maintained within the database and could easily […]
Grant fine-grained access to the HAQM Redshift Management Console
As a fully managed service, HAQM Redshift is designed to be easy to set up and use. In this blog post, we demonstrate how to grant access to users in an operations group to perform only specific actions in the HAQM Redshift Management Console. If you implement a custom IAM policy, you can set it […]
Build a modern analytics stack optimized for sharing and collaborating with Mode and HAQM Redshift
Leading technology companies, such as Netflix and Airbnb, are building on AWS to solve problems on the edge of the data ecosystem. While these companies show us what data and analytics make possible, the complexity and scale of their problems aren’t typical. Most of our challenges aren’t figuring out how to process billions of records […]