Statistics always deals with variability. There will be no statistics without variability. In our daily life, variability is ubiquitous. Today, we are going to study a kind of variability widely found in measurements. Let's see the case of manufacturing bottled water. Suppose that the labels on the bottles say that the volumes are 220 cc. Does it mean that all the bottles contain exactly 220 cc of water? In fact, if we measure the volumes accurately with a sophisticated measurement device, not all of them has the volume of 220 cc, but the volumes are around 220 cc. Some of them have the volume of 219 cc, some have the volume of 222 cc, 218, 219.5, etc. Statistically speaking, the volumes of water in the bottles have a mean, and since the volumes vary, they have a standard deviation.
Now, can we estimate the percentage of bottled water whose volume is less than 217 cc? This is what this post addresses. If we know that the volume is normally distributed with a certain mean and standard deviation, we will be able to answer it. What is a normal distribution?
Normal distribution (or Gaussian Distribution) is a continuous probability distribution with the following probability density function (pdf):
$n(x) = \frac{1}{\sigma \sqrt{2 \pi}} e^{- \frac{1}{2} (\frac{x - \mu}{\sigma})^2}$
So, if the volumes are normally distributed with the mean of 220 cc and standard deviation of 0.5 cc, then the pdf is: $n(x) = \frac{1}{0.5 \sqrt{2 \pi}} e^{- \frac{1}{2} (\frac{x - 220}{0.5})^2}$
(to be continued)
(to be continued)
Comments
Post a Comment