eBooks-it.org Logo
eBooks-IT.org Inner Image

Hadoop Cluster Deployment

Hadoop Cluster Deployment Image

Book Details:

Publisher:Packt Publishing
Series: Packt
Author:Danil Zburivsky
Edition:1
ISBN-10:1783281715
ISBN-13:9781783281718
Pages:126
Published:Nov 25 2013
Posted:Nov 19 2014
Language:English
Book format:PDF
Book size:4.29 MB

Book Description:

Construct a modern Hadoop data platform effortlessly and gain insights into how to manage clusters efficiently Overview Choose the hardware and Hadoop distribution that best suits your needs Get more value out of your Hadoop cluster with Hive, Impala, and Sqoop Learn useful tips for performance optimization and security In Detail Big Data is the hottest trend in the IT industry at the moment. Companies are realizing the value of collecting, retaining, and analyzing as much data as possible. They are therefore rushing to implement the next generation of data platform, and Hadoop is the centerpiece of these platforms. This practical guide is filled with examples which will show you how to successfully build a data platform using Hadoop. Step-by-step instructions will explain how to install, configure, and tie all major Hadoop components together. This book will allow you to avoid common pitfalls, follow best practices, and go beyond the basics when building a Hadoop cluster. This book will walk you through the process of building a Hadoop cluster from the ground up. By using practical examples and command samples, you will be able to get a cluster up and running in no time, and you will also gain a deep understanding of how various Hadoop components work and interact with each other. You will learn how to pick the right hardware for different types of Hadoop clusters and about the differences between various Hadoop distributions. By the end of this book, you will be able to install and configure several of the most popular Hadoop ecosystem projects including Hive, Impala, and Sqoop, and you will also be given a sneak peek into the pros and cons of using Hadoop in the cloud. What you will learn from this book Choose the optimal hardware configuration for your Hadoop cluster Decipher the differences between various Hadoop versions and distributions Make your cluster crash-proof with Namenode High Availability Learn tips and tricks for Jobtracker, Tasktracker, and Datanodes Discover the most important Hadoop ecosystem projects Get more value out of your cluster by using SQL with Hive and real-time query processing with Impala Set up a proper permissions model for your cluster Secure Hadoop with Kerberos Deploy a Hadoop cluster in a cloud environment Approach This book is a step-by-step tutorial filled with practical examples which will show you how to build and manage a Hadoop cluster along with its intricacies. Who this book is written for This book is ideal for database administrators, data engineers, and system administrators, and it will act as an invaluable reference if you are planning to use the Hadoop platform in your organization. It is expected that you have basic Linux skills since all the examples in this book use this operating system. It is also useful if you have access to test hardware or virtual machines to be able to follow the examples in the book.

Download Link:

Related Books:

Hadoop Operations and Cluster Management Cookbook

Hadoop Operations and Cluster Management Cookbook Image
Over 60 recipes showing you how to design, configure, manage, monitor, and tune a Hadoop cluster Overview Hands-on recipes to configure a Hadoop cluster from bare metal hardware nodes Practical and in depth explanation of cluster management commands Easy-to-understand recipes for securing and monitoring a Hadoop cluster, and design considerations Recipes showing you how to tune the performance of a Hadoop cluster Learn how to build a Hadoop cluster in the cloud In Detail We are facing an avalanche of data. The unstructured data we gather can contain many insights that could hold the key to business success or failure. Harnessing the ability to analyze and process this data with Hadoop is one of the most highly sought after skills in today's job ma...

Linux Enterprise Cluster

Build a Highly Available Cluster with Commodity Hardware and Free Software
Linux Enterprise Cluster Image
The Linux Enterprise Cluster explains how to take a number of inexpensive computers with limited resources, place them on a normal computer network, and install free software so that the computers act together like one powerful server. This makes it possible to build a very inexpensive and reliable business system for a small business or a large corporation. The book includes information on how to build a high-availability server pair using the Heartbeat package, how to use the Linux Virtual Server load balancing software, how to configure a reliable printing system in a Linux cluster environment, and how to build a job scheduling system in Linux with no single point of failure.The book also includes information on high availability techniques that c...

Apache Hadoop YARN

Moving beyond MapReduce and Batch Processing with Apache Hadoop 2
Apache Hadoop YARN Image
'This book is a critically needed resource for the newly released Apache Hadoop 2.0, highlighting YARN as the significant breakthrough that broadens Hadoop beyond the MapReduce paradigm.' -From the Foreword by Raymie Stata, CEO of Altiscale The Insider's Guide to Building Distributed, Big Data Applications with Apache Hadoop YARN Apache Hadoop is helping drive the Big Data revolution. Now, its data processing has been completely overhauled: Apache Hadoop YARN provides resource management at data center scale and easier ways to create distributed applications that process petabytes of data. And now in Apache Hadoop YARN, two Hadoop technical leaders show you how to develop new applications and adapt existing code to fully leverage these revol...



2007 - 2021 © eBooks-IT.org