eBooks-it.org Logo
eBooks-IT.org Inner Image

Learning Hadoop 2

Learning Hadoop 2 Image

Book Details:

Publisher:Packt Publishing
Series: Packt , Learning
Author:Garry Turkington
Edition:1
ISBN-10:1783285516
ISBN-13:9781783285518
Pages:316
Published:Feb 13 2015
Posted:Nov 19 2014
Language:English
Book format:PDF
Book size:2.45 MB

Book Description:

Design and implement data processing, lifecycle management, and analytic workflows with the cutting-edge toolbox of Hadoop 2 About This BookConstruct state-of-the-art applications using higher-level interfaces and tools beyond the traditional MapReduce approachUse the unique features of Hadoop 2 to model and analyze Twitter's global stream of user generated dataDevelop a prototype on a local cluster and deploy to the cloud (Amazon Web Services)Who This Book Is ForIf you are a system or application developer interested in learning how to solve practical problems using the Hadoop framework, then this book is ideal for you. You are expected to be familiar with the Unix/Linux command-line interface and have some experience with the Java programming language. Familiarity with Hadoop would be a plus. In Detail This book introduces you to the world of building data-processing applications with the wide variety of tools supported by Hadoop 2. Starting with the core components of the frameworkHDFS and YARNthis book will guide you through how to build applications using a variety of approaches.You will learn how YARN completely changes the relationship between MapReduce and Hadoop and allows the latter to support more varied processing approaches and a broader array of applications. These include real-time processing with Apache Samza and iterative computation with Apache Spark. Next up, we discuss Apache Pig and the dataflow data model it provides. You will discover how to use Pig to analyze a Twitter dataset.With this book, you will be able to make your life easier by using tools such as Apache Hive, Apache Oozie, Hadoop Streaming, Apache Crunch, and Kite SDK. The last part of this book discusses the likely future direction of major Hadoop components and how to get involved with the Hadoop community.

Download Link:

Related Books:

Apache Hadoop YARN

Moving beyond MapReduce and Batch Processing with Apache Hadoop 2
Apache Hadoop YARN Image
'This book is a critically needed resource for the newly released Apache Hadoop 2.0, highlighting YARN as the significant breakthrough that broadens Hadoop beyond the MapReduce paradigm.' -From the Foreword by Raymie Stata, CEO of Altiscale The Insider's Guide to Building Distributed, Big Data Applications with Apache Hadoop YARN Apache Hadoop is helping drive the Big Data revolution. Now, its data processing has been completely overhauled: Apache Hadoop YARN provides resource management at data center scale and easier ways to create distributed applications that process petabytes of data. And now in Apache Hadoop YARN, two Hadoop technical leaders show you how to develop new applications and adapt existing code to fully leverage these revol...

Big Data Forensics Learning Hadoop Investigations

Big Data Forensics Learning Hadoop Investigations Image
Perform forensic investigations on Hadoop clusters with cutting-edge tools and techniques About This Book * Identify, collect, and analyze Hadoop evidence forensically * Learn about Hadoop's internals and Big Data file storage concepts * A step-by-step guide to help you perform forensic analysis using freely available tools Who This Book Is For This book is meant for statisticians and forensic analysts with basic knowledge of digital forensics. They do not need to know Big Data Forensics. If you are an IT professional, law enforcement professional, legal professional, or a student interested in Big Data and forensics, this book is the perfect hands-on guide for learning how to conduct Hadoop forensic investigations. Each topic and step in the for...

Learning UML 2.0

Learning UML 2.0 Image
"Since its original introduction in 1997, the Unified Modeling Language has revolutionized software development. Every integrated software development environment in the world--open-source, standards-based, and proprietary--now supports UML and, more importantly, the model-driven approach to software development. This makes learning the newest UML standard, UML 2.0, critical for all software developers--and there isn't a better choice than this clear, step-by-step guide to learning the language."--Richard Mark Soley, Chairman and CEO, OMGIf you're like most software developers, you're building systems that are increasingly complex. Whether you're creating a desktop application or an enterprise system, complexity is the big hairy monster you...



2007 - 2021 © eBooks-IT.org