1 Learning Objectives

  • How to import data into R from different file formats
  • How to scrape data from the web
  • How to tidy data using the tidyverse to better facilitate analysis
  • How to process strings with regular expressions (regex)
  • How to wrangle data using dplyr
  • How to work with dates and times as file formats
  • How to mine text

1.1 Course Overview

1.1.1 Section 1: Data Import

You will learn how to import data from different sources.

1.1.2 Section 2: Tidy Data

You will learn the first pieces of converting data into a tidy format.

1.1.3 Section 3: String Processing

You will learn how to process strings using regular expressions (regex).

1.1.4 Section 4: Dates, Times, and Text Mining

You will learn how to work with dates and times as file formats and how to mine text.

1.2 Introduction to Wrangling

The textbook for this section is available here.

Key points

  • The first step in data analysis is importing, tidying and cleaning the data. This is the process of data wrangling.
  • In this course, we cover several common steps of the data wrangling process: tidying data, string processing, html parsing, working with dates and times, and text mining.