Eleni Srtoulia -Migrating to the Hadoop Ecosystem.ppt

Page 7

4/24/12

BigTable/HBase •  A distributed, 3-­‐D table data structure –  time as the third dimension (versioning)

•  Rows sorted based on a primary key •  Supports –  updates –  random reads –  real-­‐time querying

4/24/12

Eleni Stroulia, CS, UoA (Analy7cs, Big Data, and the Cloud)

13

HBase Tables

•  Sorted by RowKey •  Table has one or more “column families”. •  A column family is –  A group of column qualiZiers (deZined at run time) –  Stored as one Zile in HDFS

•  Sparse tables are supported •  Timestamp: 3rd dimension •  A cell is identiZied by Table:Rowkey:CF:CQ:timestamp 4/24/12

Eleni Stroulia, CS, UoA (Analy7cs, Big Data, and the Cloud)

15

7


Issuu converts static files into: digital portfolios, online yearbooks, online catalogs, digital photo albums and more. Sign up and create your flipbook.