eBooks-it.org Logo
eBooks-IT.org Inner Image

Using OpenRefine

Using OpenRefine Image

Book Details:

Publisher:Packt Publishing
Series: Packt , Using
Author:Ruben Verborgh
Edition:1
ISBN-10:1783289082
ISBN-13:9781783289080
Pages:114
Published:Sep 10 2013
Posted:Nov 19 2014
Language:English
Book format:PDF
Book size:2.11 MB

Book Description:

The essential OpenRefine guide that takes you from data analysis and error fixing to linking your dataset to the Web Overview Create links between your dataset and others in an instant Effectively transform data with regular expressions and the General Refine Expression Language Spot issues in your dataset and take effective action with just a few clicks In Detail Data is supposed to be the new gold, but how can you unlock the value in your data? Managing large datasets used to be a task for specialists, but you don't have to worry about inconsistencies or errors anymore. OpenRefine lets you clean, link, and publish your dataset in a breeze. Using OpenRefine takes you on a practical tour of all the handy features of this well-known data transformation tool. It is a hands-on recipe book that teaches you data techniques by example. Starting from the basics, it gradually transforms you into an OpenRefine expert. This book will teach you all the necessary skills to handle any large dataset and to turn it into high-quality data for the Web. After you learn how to analyze data and spot issues, we'll see how we can solve them to obtain a clean dataset. Messy and inconsistent data is recovered through advanced techniques such as automated clustering. We'll then show extract links from keyword and full-text fields using reconciliation and named-entity extraction. Using OpenRefine is more than a manual: it's a guide stuffed with tips and tricks to get the best out of your data. What you will learn from this book Import data in various formats Explore datasets in a matter of seconds Apply basic and advanced cell transformations Deal with cells that contain multiple values Create instantaneous links between datasets Filter and partition your data easily with regular expressions Use named-entity extraction on full-text fields to automatically identify topics Perform advanced data operations with the General Refine Expression Language Approach The book is styled on a Cookbook, containing recipes - combined with free datasets - which will turn readers into proficient OpenRefine users in the fastest possible way. Who this book is written for This book is targeted at anyone who works on or handles a large amount of data. No prior knowledge of OpenRefine is required, as we start from the very beginning and gradually reveal more advanced features. You don't even need your own dataset, as we provide example data to try out the book's recipes.

Download Link:

Related Books:

Data Structures and Problem Solving Using Java

Data Structures and Problem Solving Using Java Image
3rd Edition
Data Structures and Problem Solving Using Java3/e provides a practical introduction to data structures from a viewpoint of abstract thinking and problem solving, and incorporates the enhancements of Java 5.0. It includes coverage of generic programming, and content on the design of generic collection classes. This book is appropriate for readers who are familiar with basic Java programming concepts or are new to the language and want to learn how it treats data structures concepts....

Developing Flex 4 Components

Using ActionScript & MXML to Extend Flex and AIR Applications
Developing Flex 4 Components Image
The first book to completely demystify leading-edge component development with the Adobe Flex 3 platform - How to build components for Flex and AIR applications using ActionScript 3.0 and Adobe's powerful MXML user interface markup language - Covers expert techniques most books ignore, including component metadata, error handling, documentation, and creating Flex components in Flash using the Flex Component Kit - By Mike Jones, world-renowned Flex development consultant and speaker Summary Adobe Flex 3 offers a powerful new framework that web developers can use to quickly produce richer, more immersive, higher-value solutions. To help developers build the most powerful next-generation web applications, Adobe structured the Flex framework around compo...

Dojo

Using the Dojo JavaScript Library to Build Ajax Applications
Dojo Image
Dojo offers Web developers and designers a powerful JavaScript toolkit for rapidly developing robust Ajax applications. Now, for the first time, there's a complete, example-rich developer's guide to Dojo and its growing library of prepackaged widgets. Reviewed and endorsed by the Dojo Foundation, the creators of Dojo, this book brings together all the hands-on guidance and tested code samples you need to succeed. Expert Web developer James E. Harmon begins by demonstrating how to 'Ajax-ify' existing applications and pages with Dojo, adding Ajax features such as client- and server-side validation as quickly and nondisruptively as possible. Next, he presents in-depth coverage of Dojo's user interface, form, layout, and specialized Widgets, showing ho...



2007 - 2021 © eBooks-IT.org