Question 1

What does the correlation coefficient r strictly signify?

Accepted Answer

It is a value bounding between -1 and 1. The closer its absolute value crawls toward 1, the tighter the points bunch into what looks like a definitive straight line (strong correlation); values tumbling towards 0 imply total chaotic scatter (absence of linear correlation).

Question 2

Why does a single 'outlier' derail the entire regression line?

Accepted Answer

Because the vertical residual is squared! An outlier stationed remarkably far from the line casts a staggeringly massive squared error area. To suppress this singular gigantic penalty, the optimization algorithm is forced into a compromise, aggressively snapping the entire regression line toward the outlier.

Dataset Config

Least Squares Tracker

Distribution & Least Squares

Core Concepts

Line of Best Fit

Residuals

Method of Least Squares

Why call it 'Least Squares'?

FAQ