Movatterモバイル変換

[0]ホーム

Jump to content

Apache HBase

Edit links

From Wikipedia, the free encyclopedia

(Redirected fromHBase)

Open-source distributed database

Apache HBase

Original author(s)

Powerset

Developer(s)

Apache Software Foundation

Initial release

28 March 2008; 16 years ago (2008-03-28)

Stable release

2.4.x	2.4.14 / 29 August 2022; 2 years ago (2022-08-29)^[1]
2.5.x	2.5.3 / 5 February 2023; 2 years ago (2023-02-05)^[1]

Preview release

3.0.0-alpha-3 / 27 June 2022; 2 years ago (2022-06-27)^[1]

Repository

GitHub Repository,Gitbox Repository

Written in

Website

HBase is anopen-source non-relational distributed database modeled afterGoogle's Bigtable and written inJava. It is developed as part ofApache Software Foundation'sApache Hadoop project and runs on top ofHDFS (Hadoop Distributed File System) orAlluxio, providing Bigtable-like capabilities for Hadoop. That is, it provides afault-tolerant way of storing large quantities ofsparse data (small amounts of information caught within a large collection of empty or unimportant data, such as finding the 50 largest items in a group of 2 billion records, or finding the non-zero items representing less than 0.1% of a huge collection).

HBase features compression, in-memory operation, andBloom filters on a per-column basis as outlined in the original Bigtable paper.^[2] Tables in HBase can serve as the input and output forMapReduce jobs run in Hadoop, and may be accessed through the Java API but also throughREST,Avro orThrift gateway APIs. HBase is awide-column store and has been widely adopted because of its lineage with Hadoop and HDFS. HBase runs on top of HDFS and is well-suited for fast read and write operations on large datasets with high throughput and low input/output latency.

HBase is not a direct replacement for a classicSQL database, howeverApache Phoenix project provides a SQL layer for HBase as well asJDBC driver that can be integrated with variousanalytics andbusiness intelligence applications. TheApache Trafodion project provides a SQL query engine withODBC andJDBC drivers anddistributed ACID transaction protection across multiple statements, tables and rows that use HBase as a storage engine.

HBase is now serving several data-driven websites^[3] butFacebook's Messaging Platform migrated from HBase toMyRocks in 2018.^[4]^[5] Unlike relational and traditional databases, HBase does not support SQL scripting; instead the equivalent is written in Java, employing similarity with a MapReduce application.

In the parlance of Eric Brewer'sCAP Theorem, HBase is a CP type system.^[6]

History

[edit]

Apache HBase began as a project by the companyPowerset out of a need to process massive amounts of data for the purposes ofnatural-language search. Since 2010 it is a top-level Apache project.

Facebook elected to implement its new messaging platform using HBase in November 2010, but migrated away from HBase in 2018.^[4]

The 2.4.x series is the current stable release line, it supersedes earlier release lines.

Use cases & production deployments

[edit]

Enterprises that use HBase

[edit]

The following is a list of notable enterprises that have used or are using HBase:

23andMe
Adobe
Airbnb uses HBase as part of its AirStream realtime stream computation framework^[7]
Alibaba Group
Amadeus IT Group, as its main long-term storage DB.
Bloomberg, for time series data storage
Facebook used HBase for its messaging platform between 2010 and 2018
Flipkart uses HBase for its search index^[8] and user insights.^[9]
Flurry
HubSpot
Imgur uses HBase to power its notifications system^[10]^[11]
Kakao^[12]
Netflix^[13]
Pinterest^[14]
Quicken Loans
Rocket Fuel
Salesforce.com^[15]
Sears
Sophos, for some of their back-end systems.
Spotify uses HBase as base for Hadoop and machine learning jobs.^[16]
Twitter
Tuenti uses HBase for its messaging platform.^[17]^[18]
Xiaomi
Yahoo!

References

[edit]

^^a ^b ^c"Apache HBase – Apache HBase Downloads". Retrieved27 September 2022.
^Chang, et al. (2006). Bigtable: A Distributed Storage System for Structured Data
^"Apache HBase – Powered By Apache HBase".hbase.apache.org. Retrieved8 April 2018.
^^a ^b"Migrating Messenger storage to optimize performance".www.facebook.com. 26 June 2018. Retrieved5 July 2018.
^Facebook: Why our 'next-gen' comms ditched MySQL Retrieved: 17 December 2010
^"Consistency Tradeoffs in Modern Distributed Database System Design"(PDF). February 2012. Retrieved23 October 2024.
^HBaseCon (2 August 2016)."Apache HBase at Airbnb".slideshare.net. Retrieved8 April 2018.
^"Near Real Time Search Indexing". 4 January 2018.
^"Is data locality always out of the box in Hadoop?". 10 March 2018.
^"Why Imgur Dropped MySQL in Favor of HBase - DZone Database".dzone.com. Retrieved8 April 2018.
^"Tech Tuesday: Imgur Notifications: From MySQL to HBase - The Imgur Blog".blog.imgur.com. Retrieved8 April 2018.
^Doyung Yoon."S2Graph : A Large-Scale Graph Database with HBase".
^Cheolsoo Park and Ashwin Shankar."Netflix: Integrating Spark at Petabyte Scale".
^Engineering, Pinterest (30 March 2018)."Improving HBase backup efficiency at Pinterest".Medium. Retrieved14 April 2020.{{cite web}}:|first= has generic name (help)
^"Hbase at Salesforce.com".
^Josh Baer."How Apache Drives Spotify's Music Recommendations".
^"Tuenti Group Chat: Simple, yet complex". Archived fromthe original on 24 November 2012. Retrieved29 September 2015.
^"Tuenti Asyncthrift".GitHub. 6 November 2013.

Bibliography

[edit]

Dimiduk, Nick; Khurana, Amandeep (28 November 2012).HBase in Action (1st ed.).Manning Publications. p. 350.ISBN 978-1617290527.
George, Lars (20 September 2011).HBase: The Definitive Guide (1st ed.).O'Reilly Media. p. 556.ISBN 978-1449396107.
Jiang, Yifeng (16 August 2012).HBase Administration Cookbook (1st ed.).Packt Publishing. p. 332.ISBN 978-1849517140.

External links

[edit]

Official Apache HBase homepage

v t e The Apache Software Foundation
Top-level projects	Accumulo ActiveMQ Airavata Airflow Allura Ambari Ant Aries Arrow Apache HTTP Server APR Avro Axis Axis2 Beam Bloodhound Brooklyn Calcite Camel CarbonData Cassandra Cayenne CloudStack Cocoon Cordova CouchDB cTAKES CXF Derby Directory Drill Druid Empire-db Felix Flex Flink Flume FreeMarker Geronimo Groovy Guacamole Gump Hadoop HBase Helix Hive Iceberg Ignite Impala Jackrabbit James Jena JMeter Kafka Kudu Kylin Lucene Mahout Maven MINA mod_perl MyFaces Mynewt NiFi NetBeans Nutch NuttX OFBiz Oozie OpenEJB OpenJPA OpenNLP OрenOffice ORC PDFBox Parquet Phoenix POI Pig Pinot Pivot Qpid Roller RocketMQ Samza Shiro SINGA Sling Solr Spark Storm SpamAssassin Struts 1 Subversion Superset SystemDS Tapestry Thrift Tika TinkerPop Tomcat Trafodion Traffic Server UIMA Velocity Wicket Xalan Xerces XMLBeans Yetus ZooKeeper
Commons	BCEL BSF Daemon Jelly Logging
Incubator	Taverna
Other projects	Batik FOP Ivy Log4j
Attic	Apex AxKit Beehive iBATIS Click Continuum Deltacloud Etch Giraph Hama Harmony Jakarta Marmotta MXNet ODE River Shale Slide Sqoop Stanbol Tuscany Wave XML
Licenses	Apache License
Category