AWS Big Data Blog
Tag: Apache HBase
Enhancing data durability in HAQM EMR HBase on HAQM S3 with the HAQM EMR WAL feature
In this post, we dive deep into the new HAQM EMR WAL feature to help you understand how it works, how it enhances durability, and why it’s needed. We explore several scenarios that are well-suited for this feature.
Implement HAQM EMR HBase Graceful Scaling
Apache HBase is a massively scalable, distributed big data store in the Apache Hadoop ecosystem. We can use HAQM EMR with HBase on top of HAQM Simple Storage Service (HAQM S3) for random, strictly consistent real-time access for tables with Apache Kylin. This post demonstrates how to gracefully decommission target region servers programmatically.
Setting up Read Replica Clusters with HBase on HAQM S3
Many customers have taken advantage of the numerous benefits of running Apache HBase on HAQM S3 for data storage, including lower costs, data durability, and easier scalability. Customers such as FINRA have lowered their costs by 60% by moving to an HBase on S3 architecture along with the numerous operational benefits that come with decoupling […]
Tips for Migrating to Apache HBase on HAQM S3 from HDFS
Starting with HAQM EMR 5.2.0, you have the option to run Apache HBase on HAQM S3. Running HBase on S3 gives you several added benefits, including lower costs, data durability, and easier scalability. HBase provides several options that you can use to migrate and back up HBase tables. The steps to migrate to HBase on […]
Low-Latency Access on Trillions of Records: FINRA’s Architecture Using Apache HBase on HAQM EMR with HAQM S3
John Hitchingham is Director of Performance Engineering at FINRA The Financial Industry Regulatory Authority (FINRA) is a private sector regulator responsible for analyzing 99% of the equities and 65% of the option activity in the US. In order to look for fraud, market manipulation, insider trading, and abuse, FINRA’s technology group has developed a robust […]
Supercharge SQL on Your Data in Apache HBase with Apache Phoenix
With today’s launch of HAQM EMR release 4.7, you can now create clusters with Apache Phoenix 4.7.0 for low-latency SQL and OLTP workloads. Phoenix uses Apache HBase as its backing store (HBase 1.2.1 is included on HAQM EMR release 4.7.0), using HBase scan operations and coprocessors for fast performance. Additionally, you can map Phoenix tables […]