Book Details:
Publisher: | O'Reilly Media |
Series: |
OReilly
|
Author: | Eric Sammer |
Edition: | 1 |
ISBN-10: | 1449327052 |
ISBN-13: | 9781449327057 |
Pages: | 298 |
Published: | Oct 16 2012 |
Posted: | Nov 19 2014 |
Language: | English |
Book format: | PDF |
Book size: | 7.38 MB |
Book Description:
If you've been asked to maintain large and complex Hadoop clusters, this book is a must. Demand for operations-specific material has skyrocketed now that Hadoop is becoming the de facto standard for truly large-scale data processing in the data center. Eric Sammer, Principal Solution Architect at Cloudera, shows you the particulars of running Hadoop in production, from planning, installing, and configuring the system to providing ongoing maintenance.Rather than run through all possible scenarios, this pragmatic operations guide calls out what works, as demonstrated in critical deployments.Get a high-level overview of HDFS and MapReduce: why they exist and how they work Plan a Hadoop deployment, from hardware and OS selection to network requirements Learn setup and configuration details with a list of critical properties Manage resources by sharing a cluster across multiple groups Get a runbook of the most common cluster maintenance tasks Monitor Hadoop clusters--and learn troubleshooting with the help of real-world war stories Use basic tools and techniques to handle backup and catastrophic failure
Over 60 recipes showing you how to design, configure, manage, monitor, and tune a Hadoop cluster Overview Hands-on recipes to configure a Hadoop cluster from bare metal hardware nodes Practical and in depth explanation of cluster management commands Easy-to-understand recipes for securing and monitoring a Hadoop cluster, and design considerations Recipes showing you how to tune the performance of a Hadoop cluster Learn how to build a Hadoop cluster in the cloud In Detail We are facing an avalanche of data. The unstructured data we gather can contain many insights that could hold the key to business success or failure. Harnessing the ability to analyze and process this data with Hadoop is one of the most highly sought after skills in today's job ma...
Moving beyond MapReduce and Batch Processing with Apache Hadoop 2
'This book is a critically needed resource for the newly released Apache Hadoop 2.0, highlighting YARN as the significant breakthrough that broadens Hadoop beyond the MapReduce paradigm.' -From the Foreword by Raymie Stata, CEO of Altiscale The Insider's Guide to Building Distributed, Big Data Applications with Apache Hadoop YARN Apache Hadoop is helping drive the Big Data revolution. Now, its data processing has been completely overhauled: Apache Hadoop YARN provides resource management at data center scale and easier ways to create distributed applications that process petabytes of data. And now in Apache Hadoop YARN, two Hadoop technical leaders show you how to develop new applications and adapt existing code to fully leverage these revol...
7th Edition
It is now a third of a century since the 1967 publication of the first edition of the pathbreaking Introduction to Operations Research, when the field was still relatively new. A great deal has changed since then in regard to both developments in the field and evolving pedagogical demands of students. The seventh edition, in both regards, brings the book fully into the twenty-first century.This new package contains version 2.0 of the CD-ROM, in which all of the software has been updated....
2007 - 2021 © eBooks-IT.org