AWS Big Data Blog

HAQM Redshift UDF repository on AWSLabs

Christopher Crosbie is a Healthcare and Life Science Solutions Architect with HAQM Web Services

Zach Christopherson, an HAQM Redshift Database Engineer, contributed to this post

Did you ever have a need for complex string parsing in HAQM Redshift and wish you could simply add f_parse_url_query_string(url) to your SQL query? Have you ever tried to weigh which would be less effort: writing a complicated regex in SQL to parse a query string or pulling the data into Python simply to take advantage of packages like urlparse? When was the last time you were developing a report and wished there was an easy way to get the next business day from a query result or even get the next business day according to your own company’s calendar?

These scenarios represent just a few of the Python UDF functions that AWS has released as part of the initial AWS Labs HAQM Redshift UDF repository:parsing, date functions, and more! No longer are you constrained to the world of SQL within your HAQM Redshift data warehouse. Python UDFs allow you to extend HAQM Redshift SQL with Python’s rich library of packages.

We encourage all Python UDF and HAQM Redshift developers to take a peek at what’s available today. We also encourage you to submit your own pull requests to show off what you can do with HAQM Redshift and Python UDF capabilities.

If you have questions or suggestions, please leave a comment below.

If you have questions or suggestions, please leave a comment below.

——————————————–

Related

Introduction to Python UDFs in HAQM Redshift