Title | Programming Hive |
---|---|
Sub Title | Data Warehouse and Query Language for Hadoop |
Author | Dean Wampler Edward Capriolo Jason Rutherglen |
Category | Computer & Programming |
Language | English |
Region | |
Tags | Big Data Data Mining |
ISBN | 978-1-4493-1933-5 |
Year | 2012 |
Format | |
Pages | 350 |
File Size | 3.8 MB |
Total Download | 483 |
Need to move a relational database application to Hadoop? This comprehensive guide introduces you to Apache Hive, Hadoop’s data warehouse infrastructure. You’ll quickly learn how to use Hive’s SQL dialect – HiveQL – to summarize, query, and analyze large datasets stored in Hadoop’s distributed filesystem. This example-driven guide shows you how to set up and configure Hive in your environment, provides a detailed overview of Hadoop and MapReduce, and demonstrates how Hive works within the Hadoop ecosystem. You’ll also find real-world case studies that describe how companies have used Hive to solve unique problems involving petabytes of data.