Practical Data Science with R
Book Details:
Pages: | 416 |
Published: | Apr 13 2014 |
Posted: | Nov 19 2014 |
Language: | English |
Book format: | PDF |
Book size: | 20.26 MB |
Book Description:
SummaryPractical Data Science with R lives up to its name. It explains basic principles without the theoretical mumbo-jumbo and jumps right to the real use cases you'll face as you collect, curate, and analyze the data crucial to the success of your business. You'll apply the R programming language and statistical analysis techniques to carefully explained examples based in marketing, business intelligence, and decision support.Purchase of the print book includes a free eBook in PDF, Kindle, and ePub formats from Manning Publications.About the BookBusiness analysts and developers are increasingly collecting, curating, analyzing, and reporting on crucial business data. The R language and its associated tools provide a straightforward way to tackle day-to-day data science tasks without a lot of academic theory or advanced mathematics.Practical Data Science with R shows you how to apply the R programming language and useful statistical techniques to everyday business situations. Using examples from marketing, business intelligence, and decision support, it shows you how to design experiments (such as A/B tests), build predictive models, and present results to audiences of all levels.This book is accessible to readers without a background in data science. Some familiarity with basic statistics, R, or another scripting language is assumed.What's InsideData science for the business professionalStatistical analysis using the R languageProject lifecycle, from planning to deliveryNumerous instantly familiar use casesKeys to effective data presentationsAbout the AuthorsNina Zumel and John Mount are cofounders of a San Francisco-based data science consulting firm. Both hold PhDs from Carnegie Mellon and blog on statistics, probability, and computer science at win-vector.com.Table of ContentsPART 1 INTRODUCTION TO DATA SCIENCEThe data science processLoading data into RExploring dataManaging dataPART 2 MODELING METHODSChoosing and evaluating modelsMemorization methodsLinear and logistic regressionUnsupervised methodsExploring advanced methodsPART 3 DELIVERING RESULTSDocumentation and deploymentProducing effective presentations
89 hands-on recipes to help you complete real-world data science projects in R and Python About This BookLearn about the data science pipeline and use it to acquire, clean, analyze, and visualize dataUnderstand critical concepts in data science in the context of multiple projectsExpand your numerical programming skills through step-by-step code examples and learn more about the robust features of R and PythonWho This Book Is ForIf you are an aspiring data scientist who wants to learn data science and numerical programming concepts through hands-on, real-world project examples, this is the book for you. Whether you are brand new to data science or you are a seasoned expert, you will benefit from learning about the structure of data science projects, t...
Perform group-wise data manipulation and deal with large datasets using R efficiently and effectively Overview Perform factor manipulation and string processing Learn group-wise data manipulation using plyr Handle large datasets, interact with database software, and manipulate data using sqldf In Detail One of the most important aspects of computing with data is the ability to manipulate it to enable subsequent analysis and visualization. R offers a wide range of tools for this purpose. Data from any source, be it flat files or databases, can be loaded into R and this will allow you to manipulate data format into structures that support reproducible and convenient data analysis. This practical, example-oriented guide aims to discuss the split-apply...
Set up an integrated infrastructure of R and Hadoop to turn your data analytics into Big Data analytics Overview Write Hadoop MapReduce within R Learn data analytics with R and the Hadoop platform Handle HDFS data within R Understand Hadoop streaming with R Encode and enrich datasets into R In Detail Big data analytics is the process of examining large amounts of data of a variety of types to uncover hidden patterns, unknown correlations, and other useful information. Such information can provide competitive advantages over rival organizations and result in business benefits, such as more effective marketing and increased revenue. New methods of working with big data, such as Hadoop and MapReduce, offer alternatives to traditional data warehousing....
2007 - 2021 © eBooks-IT.org