Data Mining Methods: Unit 4
Correlation and Simple Linear Regression
Interpretation of the correlation coefficient
Possible range: [-1, 1]
-1: perfect negative linear relationship
0: no linear relationship,
1: perfect positive linear relationship.
Regression: Objective
To predict one variable from other variables.
To explain the variability of one variable using the other variables.
Predicts scores on one variable from the scores on a second variable.
Response variable: predicting variable (Y )
Predictor variable: predictions based on this variable (X)
Simple regression:
Only one predictor variable; otherwise multiple regression
Linear regression:
Predictions of the response variable (Y ) is a linear function of the predictor variable (X)