=## Linear Equations

Two-variable linear regressions take on the form:

where is the independent variable and is the dependent variable.

Scatterplots

Scatterplots show the direction of a bivariate relationship.

The Regression Equation

The line of best fit is also called a least-squares line in the form . is an estimate.

  • The least-squares line always passes through the point of the and sample means

Error/Residuals

Residuals are calculated using the formula .

The sum of squared errors (SSE) is calculated as . The minima of the SSE returns the line of best fit.

A residual plot should be random/show no pattern when a linear relation is present. A dataset with a strong linear relation has residuals which produce no pattern.

Correlation Coefficient

The correlation coefficient measures the strength of the relationship between and .

  • Values of close to -1 or 1 indicate a strong linear relationship
  • can be a horizontal linear correlation or no linear correlation
  • or indicate a perfect correlation

Coefficient of Determination

The coefficient of determination is typically stated as a percentage.

  • as a percentage represents the percent variation in that can be explained by
  • represents the percent variation not explained by