Exercise Sheet 4

Data Mining Methods: Unit 4
Correlation and Simple Linear Regression

Interpretation of the correlation coefficient
Possible range: [-1, 1]
-1: perfect negative linear relationship
0: no linear relationship,
1: perfect positive linear relationship.

Regression: Objective

To predict one variable from other variables.
To explain the variability of one variable using the other variables.

Predicts scores on one variable from the scores on a second variable.

Response variable: predicting variable (Y )
Predictor variable: predictions based on this variable (X)

Simple regression:
Only one predictor variable; otherwise multiple regression

Linear regression:

Predictions of the response variable (Y ) is a linear function of  the predictor variable (X)

Spread the love