Variance of a population | Descriptive statistics | Probability and Statistics | Khan Academy
Introduction to Arithmetic Mean
In this section, the speaker introduces the concept of arithmetic mean and its relevance in measuring the average years of experience at Khan Academy.
Calculating the Population Mean
- The speaker surveys the entire population of Khan Academy to determine the average years of experience.
- With a population size of five people, they collect data on years of experience: 1 year, 3 years, 5 years, 7 years, and 14 years.
- To calculate the population mean (μ), they sum up all the data points and divide by the number of data points (in this case, 5).
- Calculation: (1 + 3 + 5 + 7 + 14) / 5 = 6
- Therefore, the population mean for years of experience at Khan Academy is found to be 6.
Introducing Variance
In this section, the speaker discusses variance as a measure of how much data points vary around the mean.
Calculating Population Variance
- The speaker aims to find a parameter that represents how much data points vary around the mean.
- They introduce population variance (σ^2) as a measure for this purpose.
- To calculate population variance:
- Take each data point and find its squared distance from the mean.
- Sum up these squared distances and divide by the number of data points.
- Calculation:
- Squared distance between 1 and mean = (-5)^2 = 25
- Squared distance between 3 and mean = (-3)^2 =9
- Squared distance between 5 and mean = (-1)^2 =1
- Squared distance between 7 and mean = (1)^2 =1
- Squared distance between 14 and mean = (8)^2 =64
- Sum of squared distances: 25 + 9 + 1 + 1 + 64 =100
- Population variance: 100 / 5 =20
- The average squared distance, or mean squared distance, from the population mean is found to be 20.
Understanding Squared Distance
In this section, the speaker clarifies that the calculated squared distances represent the squared difference between each data point and the population mean.
Clarifying Squared Distance Calculation
- The speaker emphasizes that the calculated squared distances are not actual distances but rather represent the squared difference between each data point and the population mean.
- The purpose of squaring is to ensure positive values for calculation purposes.
- Therefore, when interpreting these values, it's important to understand that they represent the squared differences from the population mean.
New Section
In this section, the speaker discusses how to calculate population variance and explains the steps involved in the process.
Calculating Population Variance
- To calculate population variance, we start by taking the sum of each data point's difference from the population mean.
- We square each difference and sum them up. This gives us the numerator of the variance formula.
- To obtain the actual variance value, we divide the numerator by the number of data points in the population.
New Section
The speaker continues explaining how to calculate population variance and emphasizes that it may seem daunting but is actually straightforward.
Steps for Calculating Population Variance
- Calculate the population mean before proceeding with variance calculation.
- For each data point, subtract the population mean from it.
- Square each difference obtained in step 2.
- Sum up all squared differences to get the numerator of the variance formula.
- Divide the numerator by the total number of data points to obtain the final variance value.
New Section
The speaker reiterates that calculating population variance involves squaring differences between data points and population mean, then dividing by total data points.
Detailed Calculation Process
- Subtracting each data point from the population mean gives us a set of differences.
- Squaring these differences helps eliminate negative values and emphasizes their magnitude.
- Summing up all squared differences provides us with a numerator for calculating variance.
- Dividing this numerator by total data points yields an accurate measure of population variance.
New Section
The speaker concludes that calculating population variance requires determining both individual differences and overall average.
Final Steps for Calculating Population Variance
- Calculate individual differences between each data point and population mean.
- Square these differences to emphasize their magnitude.
- Sum up all squared differences to obtain the numerator of the variance formula.
- Divide the numerator by the total number of data points to calculate population variance.
The transcript is in English, and the notes are written in English as well.