Posted On: May 15, 2020

HAQM Redshift now leverages Bloom filters to enable early and effective data filtering for up to 2x faster query performance on external tables in HAQM S3. A Bloom filter is a probabilistic, memory-efficient data structure that accelerates join queries at scale by filtering rows that do not match the join relation, significantly reducing the amount of data transferred over the network. HAQM Redshift automatically determines what queries are suitable for leveraging Bloom filters at query runtime. You can power a lake house architecture with HAQM Redshift Spectrum to directly query and join data across your data warehouse and data lake, enabling you to gain unique insights not possible otherwise.  

HAQM Redshift Bloom filters are enabled by default with release version 1.0.15503 or later in the AWS regions where HAQM Redshift Spectrum is available. For more information visit the documentation. Refer to the AWS Region Table for HAQM Redshift availability.