Known Use Cases
OpenTSDB
- The Scalable Time Series Database runs on the top of HBase used for storing massive amount of time series data 
 
- StumbleUpon - Uses HBase to make the best recommendations, since it has to manage a lot of user signals. Every thumb-up, stumble, and share (among other feedback from users) is stored. 
 
- Mozilla - For managing crash reports etc. 
 
- Adobe - Using HBase in several areas from social services to structured data and processing for internal use. 
 
- Explorys - uses an HBase cluster containing over a billion anonymized clinical records, to enable subscribers to search and analyse patient populations, treatment protocols, and clinical outcomes. 
- Facebook - uses HBase to power their Messages infrastructure. 
- Openlogic - stores all the world ’s Open Source packages, versions, and analytical purposes. files, and lines of code in HBase for both near-real-time access 
- Openplaces - is a search engine for travel that uses HBase to store terabytes of web pages and travel-related entity records. 
- Powerset (a Microsoft company) - uses HBase to store raw documents. - Shopping Engine at Tokenizer is a web crawler; it uses HBase to store URLs and Outlinks (more than a billion). 
- Tracker - uses HBase to store and serve online influencer data in real—time. They use MapReduce to frequently re-score their entire data set as they keep updating inf luencer metrics on a daily basis. 
- Twitter - runs HBase across its entire Hadoop cluster. HBase provides a distributed, read/write backup of all mysql tables in Twitter's production backend, allowing engineers to run MapReduce jobs over the data while maintaining the ability to apply periodic row updates 
- WorldLingo - The WorldLingo Multilingual Archive use HBase to store millions of documents that they scan using Map/Reduce jobs to machine translate them into all or selected target languages from their set of available machine translation languages. 
 
Last updated
Was this helpful?