DZone
Big Data Zone
Thanks for visiting DZone today,
Edit Profile
  • Manage Email Subscriptions
  • How to Post to DZone
  • Article Submission Guidelines
Sign Out View Profile
  • Post an Article
  • Manage My Drafts
Over 2 million developers have joined DZone.
Log In / Join
  • Refcardz
  • Trend Reports
  • Webinars
  • Zones
  • |
    • Agile
    • AI
    • Big Data
    • Cloud
    • Database
    • DevOps
    • Integration
    • IoT
    • Java
    • Microservices
    • Open Source
    • Performance
    • Security
    • Web Dev
DZone > Big Data Zone > New Study: Evaluating Apache Hbase Performance on Modern Storage Media

New Study: Evaluating Apache Hbase Performance on Modern Storage Media

For the first time, this new study by Intel software engineers analyzes the performance impact of using Apache HBase on various modern storage technologies.

Justin Kestelyn user avatar by
Justin Kestelyn
·
Jun. 05, 16 · Big Data Zone · Opinion
Like (1)
Save
Tweet
4.01K Views

Join the DZone community and get the full member experience.

Join For Free

As more “fast” storage technologies (such as SSD and NVMe SSD) emerge, organizations with big data use cases want to make better use of them to achieve better throughput and latency. But to this point, there have been no detailed analyses published about the true significance of that performance boost, nor about how to best mix fast and “slow” storage to achieve the best balance between performance and cost.

Recently, software engineers in Intel’s Software Solution Group group did a detailed study of Apache HBase write performance on different storage media. (Results were originally published via the ASF Blog.) In the study, we used the hierarchy storage management support in HDFS, using YCSB as the benchmark, to store different categories of HBase data on three different storage types: HDD, SSD, and RAMDISK. (HDD is the most popular storage in current use, SATA SSD is faster storage that is growing in popularity, and RAMDISK was used to emulate extremely high-performance PCIe SSDs.)

In general, this study tries to answer following questions:

  • What is the maximum performance a user can achieve by using fast storage?
  • Where are the bottlenecks?
  • What is the best balance between performance and cost, and how can it be achieved?
  • How can the performance of a cluster with different storage combinations be predicted?

We believe that this study provides the first comprehensive and objective analysis of HBase performance on fast storage technology.

Download the full report here.

Jingcheng Du and Wei Zhou are Software Engineers at Intel, and HBase contributors.

Apache HBase Media (communication)

Published at DZone with permission of Justin Kestelyn. See the original article here.

Opinions expressed by DZone contributors are their own.

Popular on DZone

  • Basic Convolutional Neural Network Architectures
  • Servlets Listeners Introduction and Examples
  • Testing Schema Registry: Spring Boot and Apache Kafka With JSON Schema
  • Create a Self-Service Customer Support Chatbot Without Code

Comments

Big Data Partner Resources

X

ABOUT US

  • About DZone
  • Send feedback
  • Careers
  • Sitemap

ADVERTISE

  • Advertise with DZone

CONTRIBUTE ON DZONE

  • Article Submission Guidelines
  • MVB Program
  • Become a Contributor
  • Visit the Writers' Zone

LEGAL

  • Terms of Service
  • Privacy Policy

CONTACT US

  • 600 Park Offices Drive
  • Suite 300
  • Durham, NC 27709
  • support@dzone.com
  • +1 (919) 678-0300

Let's be friends:

DZone.com is powered by 

AnswerHub logo