Data Mining Methods: Unit 4
Correlation and Simple Linear Regression
Interpretation of the correlation coefficient
Possible range: [-1, 1]
-1: perfect negative linear relationship
0: no linear relationship,
1: perfect positive linear relationship.
To predict one variable from other variables.
To explain the variability of one variable using the other variables.
Predicts scores on one variable from the scores on a second variable.
Response variable: predicting variable (Y )
Predictor variable: predictions based on this variable (X)
Only one predictor variable; otherwise multiple regression
Predictions of the response variable (Y ) is a linear function of the predictor variable (X)