AWS Big Data Blog

Category: HAQM DataZone

How Volkswagen streamlined access to data across multiple data lakes using HAQM DataZone – Part 1

This blog post introduces HAQM DataZone and explores how VW used it to build their data mesh to enable streamlined data access across multiple data lakes. It focuses on the key aspect of the solution, which was enabling data providers to automatically publish data assets to HAQM DataZone, which served as the central data mesh for enhanced data discoverability. Additionally, the post provides code to guide you through the implementation.

HAQM DataZone introduces OpenLineage-compatible data lineage visualization in preview

We are excited to announce the preview of API-driven, OpenLineage-compatible data lineage in HAQM DataZone to help you capture, store, and visualize lineage of data movement and transformations of data assets on HAQM DataZone. With the HAQM DataZone OpenLineage-compatible API, domain administrators and data producers can capture and store lineage events beyond what is available […]

Enhance data security with fine-grained access controls in HAQM DataZone

Fine-grained access control is a crucial aspect of data security for modern data lakes and data warehouses. As organizations handle vast amounts of data across multiple data sources, the need to manage sensitive information has become increasingly important. Making sure the right people have access to the right data, without exposing sensitive information to unauthorized […]

HAQM DataZone enhances data discovery with advanced search filtering

HAQM DataZone, a fully managed data management service, helps organizations catalog, discover, analyze, share, and govern data between data producers and consumers. We are excited to announce the introduction of advanced search filtering capabilities in the HAQM DataZone business data catalog. With the improved rendering of glossary terms, you can now navigate large sets of […]

HAQM DataZone announces custom blueprints for AWS services

Last week, we announced the general availability of custom AWS service blueprints, a new feature in HAQM DataZone allowing you to customize your HAQM DataZone project environments to use existing AWS Identity and Access Management (IAM) roles and AWS services to embed the service into your existing processes. In this post, we share how this […]

Governing data in relational databases using HAQM DataZone

Data governance is a key enabler for teams adopting a data-driven culture and operational model to drive innovation with data. HAQM DataZone is a fully managed data management service that makes it faster and easier for customers to catalog, discover, share, and govern data stored across HAQM Web Services (AWS), on premises, and on third-party […]

HAQM DataZone announces integration with AWS Lake Formation hybrid access mode for the AWS Glue Data Catalog

Last week, we announced the general availability of the integration between HAQM DataZone and AWS Lake Formation hybrid access mode. In this post, we share how this new feature helps you simplify the way you use HAQM DataZone to enable secure and governed sharing of your data in the AWS Glue Data Catalog. We also […]

HAQM DataZone now integrates with AWS Glue Data Quality and external data quality solutions

Today, we are pleased to announce that HAQM DataZone is now able to present data quality information for data assets. This information empowers end-users to make informed decisions as to whether or not to use specific assets. In this post, we discuss the latest features of HAQM DataZone for data quality, the integration between HAQM DataZone and AWS Glue Data Quality and how you can import data quality scores produced by external systems into HAQM DataZone via API.

AI recommendations for descriptions in HAQM DataZone for enhanced business data cataloging and discovery is now generally available

In March 2024, we announced the general availability of the generative artificial intelligence (AI) generated data descriptions in HAQM DataZone. In this post, we share what we heard from our customers that led us to add the AI-generated data descriptions and discuss specific customer use cases addressed by this capability. We also detail how the […]

Unstructured Data Management - AWS Native Architecture

Unstructured data management and governance using AWS AI/ML and analytics services

In this post, we discuss how AWS can help you successfully address the challenges of extracting insights from unstructured data. We discuss various design patterns and architectures for extracting and cataloging valuable insights from unstructured data using AWS. Additionally, we show how to use AWS AI/ML services for analyzing unstructured data.