AWS Big Data Blog

Tag: HAQM Redshift

HAQM Redshift UDF repository on AWSLabs

Christopher Crosbie is a Healthcare and Life Science Solutions Architect with HAQM Web Services Zach Christopherson, an HAQM Redshift Database Engineer, contributed to this post Did you ever have a need for complex string parsing in HAQM Redshift and wish you could simply add f_parse_url_query_string(url) to your SQL query? Have you ever tried to weigh which would be less […]

Query Routing and Rewrite: Introducing pgbouncer-rr for HAQM Redshift and PostgreSQL

This post was last reviewed and updated August, 2022 with a section on Deploying pgbouncer in Elastic Kubernetes Service (EKS). NOTE: You can now use federated queries in HAQM Redshift to query and analyze data across operational databases, data warehouses, and data lakes. For more information, please review the HAQM Redshift documentation article, “Querying Data […]

Migrating Metadata when Encrypting an HAQM Redshift Cluster

NOTE: HAQM Redshift now supports enabling and disabling encryption with 1-click. For more information, please review this “What’s New” post. ————————————— John Loughlin is a Solutions Architect with HAQM Web Services. A customer came to us asking for help expanding and modifying their HAQM Redshift cluster. In the course of responding to their request, we […]

Introduction to Python UDFs in HAQM Redshift

Christopher Crosbie is a Healthcare and Life Science Solutions Architect with HAQM Web Services When your doctor takes out a prescription pad at your yearly checkup, do you ever stop to wonder what goes into her thought process as she decides on which drug to scribble down? We assume that journals of scientific evidence coupled […]

Integrating HAQM Kinesis, HAQM S3 and HAQM Redshift with Cascading on HAQM EMR

This is a guest post by Ryan Desmond, Solutions Architect at Concurrent. Concurrent is an AWS Advanced Technology Partner. With HAQM Kinesis developers can quickly store, collate and access large, distributed data streams such as access logs, click streams and IoT data in real-time. The question then becomes, how can we access and leverage this […]

Connecting R with HAQM Redshift

Markus Schmidberger is a Senior Big Data Consultant for AWS Professional Services HAQM Redshift is a fast, petabyte-scale cloud data warehouse for PB of data. AWS customers are moving huge amounts of structured data into HAQM Redshift to offload analytics workloads or to operate their DWH fully in the cloud. Business intelligence and analytic teams […]

Building a Binary Classification Model with HAQM Machine Learning and HAQM Redshift

Guy Ernest is a Solutions Architect with AWS This post builds on Guy’s earlier posts Building a Numeric Regression Model with HAQM Machine Learning and Building a Multi-Class ML Model with HAQM Machine Learning. Many decisions in life are binary, answered either Yes or No. Many business problems also have binary answers. For example: “Is […]

Test drive two big data scenarios from the ‘Building a Big Data Platform on AWS’ bootcamp

Matt Yanchyshyn is a Sr. Manager for AWS Solutions Architecture AWS offers a number of events during the year such as our annual AWS re:Invent conference, the AWS Summit series, the AWS Pop-up Loft, and a variety of roadshows. All of these provide opportunities for AWS customers to attend talks focused on big data and […]

Optimizing for Star Schemas and Interleaved Sorting on HAQM Redshift

Chris Keyser is a Solutions Architect for AWS Many organizations implement star and snowflake schema data warehouse designs and many BI tools are optimized to work with dimensions, facts, and measure groups. Customers have moved data warehouses of all types to HAQM Redshift with great success. The HAQM Redshift team has released support for interleaved […]