4/24/12
BigTable/HBase • A distributed, 3-‐D table data structure – time as the third dimension (versioning)
• Rows sorted based on a primary key • Supports – updates – random reads – real-‐time querying
4/24/12
Eleni Stroulia, CS, UoA (Analy7cs, Big Data, and the Cloud)
13
HBase Tables
• Sorted by RowKey • Table has one or more “column families”. • A column family is – A group of column qualiZiers (deZined at run time) – Stored as one Zile in HDFS
• Sparse tables are supported • Timestamp: 3rd dimension • A cell is identiZied by Table:Rowkey:CF:CQ:timestamp 4/24/12
Eleni Stroulia, CS, UoA (Analy7cs, Big Data, and the Cloud)
15
7