ReadWriteHack

Video: HBase Dos and Don'ts

Cloudera VP Customer Solutions, Omer Trajman, presented a talk on HBase Dos and Don'ts to the Los Angeles Hadoop Users Group earlier this month.

The full video is available after the jump.

LA-HUG HBASE DO's and DON'Ts from Shopzilla on Vimeo.

At glance, the dos and don'ts are:

Do:


  • Use a key prefix that distributes well

  • Keep the number of regions reasonable - about 100 per node.

  • Disable auto-compaction

  • Use compression

  • Explicitly put hbase-site.xml in your CLASSPATH

  • Monitor the health of your cluster

  • Store multiple copies for different access patterns

Don't:


  • Wholesale replacement of every RDBMS

  • Run huge MR Jobs directly off of HBase

  • Use timestamps as teh first part of your key

  • Allocate all CPUs to your TaskTrackers

  • Mixed workloads with SLAs

  • Use a single client bulk load or bulk load with put

  • Let the Region Server Swap


ReadWriteWeb encourages comments, but please remember: Keep it nice, keep it clean, and avoid promotional comments. We do pre-moderate some comments with links. For more information, please read our full comment policy.
blog comments powered by Disqus
Recommended Story
RWW SPONSORS



RWW PARTNERS