AWS Big Data Blog

Category: HAQM SageMaker

Streamline data discovery with precise technical identifier search in HAQM SageMaker Unified Studio

We’re excited to introduce a new enhancement to the search experience in HAQM SageMaker Catalog, part of the next generation of HAQM SageMaker—exact match search using technical identifiers. In this post, we demonstrate how to streamline data discovery with precise technical identifier search in HAQM SageMaker Unified Studio.

Connect, share, and query where your data sits using HAQM SageMaker Unified Studio

In this blog post, we will demonstrate how business units can use HAQM SageMaker Unified Studio to discover, subscribe to, and analyze these distributed data assets. Through this unified query capability, you can create comprehensive insights into customer transaction patterns and purchase behavior for active products without the traditional barriers of data silos or the need to copy data between systems.

foundational planes

Foundational blocks of HAQM SageMaker Unified Studio: An admin’s guide to implement unified access to all your data, analytics, and AI

In this post, we discuss the foundational building blocks of SageMaker Unified Studio and how, by abstracting complex technical implementations behind user-friendly interfaces, organizations can maintain standardized governance while enabling efficient resource management across business units. This approach provides consistency in infrastructure deployment while providing the flexibility needed for diverse business requirements.

Use DeepSeek with HAQM OpenSearch Service vector database and HAQM SageMaker

OpenSearch Service provides rich capabilities for RAG use cases, as well as vector embedding-powered semantic search. You can use the flexible connector framework and search flow pipelines in OpenSearch to connect to models hosted by DeepSeek, Cohere, and OpenAI, as well as models hosted on HAQM Bedrock and SageMaker. In this post, we build a connection to DeepSeek’s text generation model, supporting a RAG workflow to generate text responses to user queries.

How EUROGATE established a data mesh architecture using HAQM DataZone

In this post, we show you how EUROGATE uses AWS services, including HAQM DataZone, to make data discoverable by data consumers across different business units so that they can innovate faster. Two use cases illustrate how this can be applied for business intelligence (BI) and data science applications, using AWS services such as HAQM Redshift and HAQM SageMaker.

Introducing a new unified data connection experience with HAQM SageMaker Lakehouse unified data connectivity

With HAQM SageMaker Lakehouse unified data connectivity, you can confidently connect, explore, and unlock the full value of your data across AWS services and achieve your business objectives with agility. This post demonstrates how SageMaker Lakehouse unified data connectivity helps your data integration workload by streamlining the establishment and management of connections for various data sources.

An integrated experience for all your data and AI with HAQM SageMaker Unified Studio

HAQM SageMaker Unified Studio is an integrated development environment (IDE) for data, analytics, and AI. Discover your data and put it to work using familiar AWS tools to complete end-to-end development workflows, including data analysis, data processing, model training, generative AI app building, and more, in a single governed environment. This post demonstrates how SageMaker Unified Studio unifies your analytic workloads.

Simplify data access for your enterprise using HAQM SageMaker Lakehouse

HAQM SageMaker Lakehouse offers a unified solution for enterprise data access, combining data from warehouses and lakes. This post demonstrates how SageMaker Lakehouse integrates scattered data sources, enabling secure enterprise-wide access, and allowing teams to use their preferred tools for predicting and analyzing customer churn. The solution involves multiple data sources, including HAQM S3, HAQM Redshift, and AWS Glue Data Catalog, with AWS Lake Formation managing permissions.

Author visual ETL flows on HAQM SageMaker Unified Studio

HAQM SageMaker Unified Studio (preview) provides an integrated data and AI development environment within HAQM SageMaker. This post shows how you can build a low-code and no-code (LCNC) visual ETL flow that enables seamless data ingestion and transformation across multiple data sources.