eBooks-it.org Logo
eBooks-IT.org Inner Image

Hadoop

The Definitive Guide

4th Edition
Hadoop Image

Book Details:

Publisher:O'Reilly Media
Series: OReilly , The Definitive Guide
Author:Tom White
Edition:4
ISBN-10:1491901632
ISBN-13:9781491901632
Pages:728
Published:Apr 10 2015
Posted:Apr 04 2015
Language:English
Book format:PDF
Book size:7.04 MB

Book Description:

Ready to unlock the power of your data? With the fourth edition of this comprehensive guide, you';ll learn how to build and maintain reliable, scalable, distributed systems with Apache Hadoop. This book is ideal for programmers looking to analyze datasets of any size, and for administrators who want to set up and run Hadoop clusters.You';ll find illuminating case studies that demonstrate how Hadoop is used to solve specific problems. This edition includes new case studies, updates on Hadoop 2, a refreshed HBase chapter, and new chapters on Crunch and Flume. Author Tom White also suggests learning paths for the book.Store large datasets with the Hadoop Distributed File System (HDFS)Run distributed computations with MapReduceUse Hadoop';s data and I/O building blocks for compression, data integrity, serialization (including Avro), and persistenceDiscover common pitfalls and advanced features for writing real-world MapReduce programsDesign, build, and administer a dedicated Hadoop cluster-or run Hadoop in the cloudLoad data from relational databases into HDFS, using SqoopPerform large-scale data processing with the Pig query languageAnalyze datasets with Hive, Hadoop';s data warehousing systemTake advantage of HBase for structured and semi-structured data, and ZooKeeper for building distributed systems

Download Link:

Related Books:

Apache Hadoop YARN

Moving beyond MapReduce and Batch Processing with Apache Hadoop 2
Apache Hadoop YARN Image
'This book is a critically needed resource for the newly released Apache Hadoop 2.0, highlighting YARN as the significant breakthrough that broadens Hadoop beyond the MapReduce paradigm.' -From the Foreword by Raymie Stata, CEO of Altiscale The Insider's Guide to Building Distributed, Big Data Applications with Apache Hadoop YARN Apache Hadoop is helping drive the Big Data revolution. Now, its data processing has been completely overhauled: Apache Hadoop YARN provides resource management at data center scale and easier ways to create distributed applications that process petabytes of data. And now in Apache Hadoop YARN, two Hadoop technical leaders show you how to develop new applications and adapt existing code to fully leverage these revol...

Hadoop

The Definitive Guide
Hadoop Image
3rd Edition
Ready to unlock the power of your data? With this comprehensive guide, you'll learn how to build and maintain reliable, scalable, distributed systems with Apache Hadoop. This book is ideal for programmers looking to analyze datasets of any size, and for administrators who want to set up and run Hadoop clusters. You'll find illuminating case studies that demonstrate how Hadoop is used to solve specific problems. This third edition covers recent changes to Hadoop, including material on the new MapReduce API, as well as MapReduce 2 and its more flexible execution model (YARN).Store large datasets with the Hadoop Distributed File System (HDFS) Run distribu...

Hadoop

The Definitive Guide
Hadoop Image
Hadoop: The Definitive Guide helps you harness the power of your data. Ideal for processing large datasets, the Apache Hadoop framework is an open source implementation of the MapReduce algorithm on which Google built its empire. This comprehensive resource demonstrates how to use Hadoop to build reliable, scalable, distributed systems: programmers will find details for analyzing large datasets, and administrators will learn how to set up and run Hadoop clusters. Complete with case studies that illustrate how Hadoop solves specific problems, this book helps you:Use the Hadoop Distributed File System (HDFS) for storing large datasets, and run distributed computations over those datasets using MapReduce Become familiar with Hadoop's data and I/O buildi...

Hadoop MapReduce Cookbook

Hadoop MapReduce Cookbook Image
Recipes for analyzing large and complex datasets with Hadoop MapReduce Overview Learn to process large and complex data sets, starting simply, then diving in deep Solve complex big data problems such as classifications, finding relationships, online marketing and recommendations. More than 50 Hadoop MapReduce recipes, presented in a simple and straightforward manner, with step-by-step instructions and real world examples. In Detail We are facing an avalanche of data. The unstructured data we gather can contain many insights that might hold the key to business success or failure. Harnessing the ability to analyze and process this data with Hadoop MapReduce is one of the most highly sought after skills in today's job market. "Hadoop MapReduce Co...



2007 - 2021 © eBooks-IT.org