Book Details:
Pages: | 536 |
Published: | Oct 10 2012 |
Posted: | Nov 19 2014 |
Language: | English |
Book format: | PDF |
Book size: | 14.74 MB |
Book Description:
Summary Hadoop in Practice collects 85 Hadoop examples and presents them in a problem/solution format. Each technique addresses a specific task you'll face, like querying big data using Pig or writing a log file loader. You'll explore each problem step by step, learning both how to build and deploy that specific solution along with the thinking that went into its design. As you work through the tasks, you'll find yourself growing more comfortable with Hadoop and at home in the world of big data. About the Technology Hadoop is an open source MapReduce platform designed to query and analyze data distributed across large clusters. Especially effective for big data systems, Hadoop powers mission-critical software at Apple, eBay, LinkedIn, Yahoo, and Facebook. It offers developers handy ways to store, manage, and analyze data. About the Book Hadoop in Practice collects 85 battle-tested examples and presents them in a problem/solution format. It balances conceptual foundations with practical recipes for key problem areas like data ingress and egress, serialization, and LZO compression. You'll explore each technique step by step, learning how to build a specific solution along with the thinking that went into it. As a bonus, the book's examples create a well-structured and understandable codebase you can tweak to meet your own needs. This book assumes the reader knows the basics of Hadoop. Purchase of the print book comes with an offer of a free PDF, ePub, and Kindle eBook from Manning. Also available is all code from the book. What's InsideConceptual overview of Hadoop and MapReduce 85 practical, tested techniques Real problems, real solutions How to integrate MapReduce and RTable of ContentsPART 1 BACKGROUND AND FUNDAMENTALS Hadoop in a heartbeat PART 2 DATA LOGISTICS Moving data in and out of Hadoop Data serialization?working with text and beyond PART 3 BIG DATA PATTERNSApplying MapReduce patterns to big data Streamlining HDFS for big dataDiagnosing and tuning performance problems PART 4 DATA SCIENCE Utilizing data structures and algorithms Integrating R and Hadoop for statistics and more Predictive analytics with Mahout PART 5 TAMING THE ELEPHANT Hacking with Hive Programming pipelines with PigCrunch and other technologies Testing and debugging
Hadoop in Action teaches readers how to use Hadoop and write MapReduce programs. The intended readers are programmers, architects, and project managers who have to process large amounts of data offline. Hadoop in Action will lead the reader from obtaining a copy of Hadoop to setting it up in a cluster and writing data analytic programs. The book begins by making the basic idea of Hadoop and MapReduce easier to grasp by applying the default Hadoop installation to a few easy-to-follow tasks, such as analyzing changes in word frequency across a body of documents. The book continues through the basic concepts of MapReduce applications developed using Hadoop, including a close look at framework components, use of Hadoop for a variety of data analysis task...
Real Estate Development Modeling in the Real World
This book is a practical guide to using Argus Developer, the world's most widely used real estate development feasibility modeling software. Using practical examples and many case studies, it takes readers beyond basic training and provides the in-depth knowledge required to analyze potential real estate deals and help ensure a profitable development. Argus Developer in Practice fills an important gap in the market. Argus Developer, and its predecessor Circle Developer, has long had a dominant position as the primary real estate development appraisal tool. It is used all over the world on a variety of projects ranging from simple residential projects to huge and complex master planned, mixed-use, commercial, residential, and leisure projects. It also...
2nd Edition
This award-winning book, substantially updated to reflect the latest developments in the field, introduces the concepts and best practices of software architecture--how a software system is structured and how that system's elements are meant to interact. Distinct from the details of implementation, algorithm, and data representation, an architecture holds the key to achieving system quality, is a reusable asset that can be applied to subsequent systems, and is crucial to a software organization's business strategy. Drawing on their own extensive experience, the authors cover the essential technical topics for designing, specifying, and validating a sy...
2007 - 2021 © eBooks-IT.org