2 Section 1 - Introduction to Machine Learning Overview
In the Introduction to Machine Learning section, you will be introduced to machine learning.
After completing this section, you will be able to:
- Explain the difference between the outcome and the features.
- Explain when to use classification and when to use prediction.
- Explain the importance of prevalence.
- Explain the difference between sensitivity and specificity.
This section has one part: introduction to machine learning.
2.1 Notation
There is a link to the relevant section of the textbook: Notation
Key points
- \(X_1, ..., X_p\) denote the features, \(Y\) denotes the outcomes, and \(\hat{Y}\) denotes the predictions.
- Machine learning prediction tasks can be divided into categorical and continuous outcomes. We refer to these as classification and prediction, respectively.
2.2 An Example
There is a link to the relevant section of the textbook: An Example
Key points
- \(Y_i\) = an outcome for observation or index i.
- We use boldface for \(\mathbf{X_i}\) to distinguish the vector of predictors from the individual predictors \(X_{i,1}, ..., X_{i,784}\).
- When referring to an arbitrary set of features and outcomes, we drop the index i and use \(Y\) and bold \(\mathbf{X}\).
- Uppercase is used to refer to variables because we think of predictors as random variables.
- Lowercase is used to denote observed values. For example, \(\mathbf{X} = \mathbf{x}\).
2.3 Comprehension Check - Introduction to Machine Learning
- True or False: A key feature of machine learning is that the algorithms are built with data.
- A. True
- B. False
- True or False: In machine learning, we build algorithms that take feature values (X) and train a model using known outcomes (Y) that is then used to predict outcomes when presented with features without known outcomes.
- A. True
- B. False