Distributed Log Collection for Hadoop
Book Details:
Publisher: | Packt Publishing |
Series: |
Packt
|
Author: | Steve Hoffman |
Edition: | 2 |
ISBN-10: | 1784392170 |
ISBN-13: | 9781784392178 |
Pages: | 175 |
Published: | Feb 25 2015 |
Posted: | Nov 19 2014 |
Language: | English |
Book format: | PDF |
Book size: | 1.82 MB |
Book Description:
Design and implement a series of Flume agents to send streamed data into Hadoop About This BookConstruct a series of Flume agents using the Apache Flume service to efficiently collect, aggregate, and move large amounts of event dataConfigure failover paths and load balancing to remove single points of failureUse this step-by-step guide to stream logs from application servers to Hadoop's HDFSWho This Book Is ForIf you are a Hadoop programmer who wants to learn about Flume to be able to move datasets into Hadoop in a timely and replicable manner, then this book is ideal for you. No prior knowledge about Apache Flume is necessary, but a basic knowledge of Hadoop and the Hadoop File System (HDFS) is assumed. In Detail Apache Flume is a distributed, reliable, and available service used to efficiently collect, aggregate, and move large amounts of log data. It is used to stream logs from application servers to HDFS for ad hoc analysis.This book starts with an architectural overview of Flume and its logical components. It explores channels, sinks, and sink processors, followed by sources and channels. By the end of this book, you will be fully equipped to construct a series of Flume agents to dynamically transport your stream data and logs from your systems into Hadoop.A step-by-step book that guides you through the architecture and components of Flume covering different approaches, which are then pulled together as a real-world, end-to-end use case, gradually going from the simplest to the most advanced features.
Distributed Log Collection for Hadoop
If your role includes moving datasets into Hadoop, this book will help you do it more efficiently using Apache Flume. From installation to customization, it's a complete step-by-step guide on making the service work for you. Overview Integrate Flume with your data sources Transcode your data en-route in Flume Route and separate your data using regular expression matching Configure failover paths and load-balancing to remove single points of failure Utilize Gzip Compression for files written to HDFS In Detail Apache Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log data. Its main goal is to deliver data from applications to Apache Hadoop's HDFS. It has a simple and flexib...
Solutions and Examples for Apache Administrators
2nd Edition
There's plenty of documentation on installing and configuring the Apache web server, but where do you find help for the day-to-day stuff, like adding common modules or fine-tuning your activity logging? That's easy. The new edition of the Apache Cookbook offers you updated solutions to the problems you're likely to encounter with the new versions of Apache. Written by members of the Apache Software Foundation, and thoroughly revised for Apache versions 2.0 and 2.2, recipes in this book range from simple tasks, such installing the server on Red Hat Linux or Windows, to more complex tasks, such as setting up name-based virtual hosts or securing and manag...
Application Development with Apache
"Do you learn best by example and experimentation? This book is ideal. Have your favorite editor and compiler readyyou'll encounter example code you'll want to try right away. You've picked the right bookthis is sure to become the de facto standard guide to writing Apache modules." Rich Bowen, coauthor, Apache Administrators Handbook, Apache Cookbook, and The Definitive Guide to Apache mod_rewrite "A first-rate guide to getting the most out of Apache as a modular application platformsure to become a must-read for any Apache programmer, from beginner to experienced professional. It builds up carefully and meticulously from the absolute basics, while including chapters on everything from the popular Apache DBD Framework to best practi...
2007 - 2021 © eBooks-IT.org