Sum of Squares
A statistical tool that is used to identify the dispersion of data
HomeResourcesKnowledgeOtherSum of Squares
What is Sum of Squares?
Sum of squares (SS) is a statistical tool that is used to identify the dispersion of data as well as how well the data can fit the model in regression analysis. The sum of squares got its name because it is calculated by finding the sum of the squared differences.

This image is only for illustrative purposes.

The sum of squares is one of the most important outputs in regression analysis. The general rule is that a smaller sum of squares indicates a better model, as there is less variation in the data.
In finance, understanding the sum of squares is important because linear regression models are widely used in both theoretical and practical finance.

Types of Sum of Squares
In regression analysis, the three main types of sum of squares are the total sum of squares, regression sum of squares, and residual sum of squares.

1. Total sum of squares
The total sum of squares is a variation of the values of a dependent variable from the sample mean of the dependent variable. Essentially, the total sum of squares quantifies the total variation in a sample. It can be determined using the following formula:

Where:
y– the value in a sample
ȳ – the mean value of a sample

2. Regression sum of squares (also known as the sum of squares due to regression or explained sum of squares)
The regression sum of squares describes how well a regression model represents the modeled data. A higher regression sum of squares indicates that the model does not fit the data well.
The formula for calculating the regression sum of squares is:

Where:
ŷ– the value estimated by the regression line
ȳ – the mean value of a sample

3. Residual sum of squares (also known as the sum of squared errors of prediction)
The residual sum of squares essentially measures the variation of modeling errors. In other words, it depicts how the variation in the dependent variable in a regression model cannot be explained by the model. Generally, a lower residual sum of squares indicates that the regression model can better explain the data while a higher residual sum of squares indicates that the model poorly explains the data.
The residual sum of squares can be found using the formula below:

Where:
y– the observed value
ŷ– the value estimated by the regression line

The relationship between the three types of sum of squares can be summarized by the following equation:

CFI offers the Financial Modeling & Valuation Analyst (FMVA)™ certification program for those looking to take their careers to the next level. To keep learning and advancing your career, the following CFI resources will be helpful:
Guide to Financial Modeling
Harmonic Mean
Hypothesis Testing
3 Statement Model
Financial Analyst Certification
Become a certified Financial Modeling and Valuation Analyst (FMVA)® by completing CFI’s online financial modeling classes and training program!
Company
Certifications
Courses
Support
Resources
© 2015 to 2021 CFI Education Inc.
Notice
We and selected partners use cookies or similar technologies as specified in the cookie policy.
With respect to advertising, we and selected third parties, may use precise geolocation data and actively scan device characteristics for identification in order to store and/or access information on a device and process personal data (e.g. browsing data, IP addresses, usage data or unique identifiers) for the following purposes: personalised ads and content, ad and content measurement, and audience insights; develop and improve products.
You can freely give, deny, or withdraw your consent at any time by accessing the advertising preferences panel.
You can consent to the use of such technologies by closing this notice.