Data Science Wrangling
1 Learning Objectives
- How to import data into R from different file formats
- How to scrape data from the web
- How to tidy data using the tidyverse to better facilitate analysis
- How to process strings with regular expressions (regex)
- How to wrangle data using dplyr
- How to work with dates and times as file formats
- How to mine text
1.1 Course Overview
1.1.1 Section 1: Data Import
You will learn how to import data from different sources.
1.1.2 Section 2: Tidy Data
You will learn the first pieces of converting data into a tidy format.
1.1.3 Section 3: String Processing
You will learn how to process strings using regular expressions (regex).
1.1.4 Section 4: Dates, Times, and Text Mining
You will learn how to work with dates and times as file formats and how to mine text.
1.2 Introduction to Wrangling
The textbook for this section is available here.
Key points
- The first step in data analysis is importing, tidying and cleaning the data. This is the process of data wrangling.
- In this course, we cover several common steps of the data wrangling process: tidying data, string processing, html parsing, working with dates and times, and text mining.