Parameter, Statistic, Random Variable, Estimator and Estimate
The most common problem while learning Statistics is that students’ lack of understanding of the basic terminologies, notations, definitions and concepts. Think of Statistics as building blocks, and you need a solid foundation to move forward. Here, I explain five common terms in Statistics: i) Parameter, ii) Statistic, iii) Random Variable, iv) Estimator, v) Estimate and their notations.
I will start with the definition of Population and Sample.
A population is a complete collection of individuals/ objects that we are interested in. A sample is a subset of a population.
Parameter
A parameter is a descriptive measure(numerical value) of the population. Parameters are usually denoted by Greek letters.
Examples of parameters:
- population mean
- population variance
Statistic
A statistic is a descriptive measure of a sample. For example, sample mean, sample standard deviation, etc. We will talk about the notations under estimator and estimate.
Random Variable
Before introducing random variable, let me very shortly recall what is a random experiment and what is a sample space.
A random experiment is a physical situation whose outcome cannot be predicted with certainty until it is observed. A random experiment can be repeated as many times as we want under the same conditions (leading to different outcomes). Each one of them a trial. Thus, a trial is a particular performance of a random experiment.
A Sample space is a set of all possible outcomes of a random experiment. In this blog post I use
Example 1:
Random Experiment: Tossing of a coin.
Sample Space:
library(prob)
tosscoin(1)
toss1
1 H
2 T
Example 2:
Random Experiment: Toss a coin three times.
Sample Space:
tosscoin(3)
toss1 toss2 toss3
1 H H H
2 T H H
3 H T H
4 T T H
5 H H T
6 T H T
7 H T T
8 T T T
Definition: Random Variable
Let
You need Measure Theory knowledge to understand the above definition.
Loosely speaking a Random variable is a function from the sample space to the real numbers. There are two types of random variables: i) Discrete random variable and ii) Continuous random variable. We use Roman capital letters to denote random variables (
The notation
An example will help you to understand how this works. To start with, let’s consider a simple experiment with two possible outcomes: PCR test result of a randomly selected individual.
The random variable
Solution 1.1
Sample space
Random variable
Solution 1.2
Simply drop
Let’s consider another experiment. The experiment consists in selecting a random undergraduate student in the University during a period of one week, and measuring their height.
The random variable
or
Notations for Random Variables and Observed Data (in Sampling)
Suppose you have a random variable
Let’s try to understand the concept with an example.
where
We performed the PCR test on randomly selected 5 individuals. We called this sample 1. The results are
Prior to obtaining
We again performed the PCR test on another randomly selected 5 individuals. We called this sample 2. Now results are
You can see observable values for
Estimator and Estimate
Distinction between the terms Estimator and Estimate is important.
Let
An estimator is a statistic,
Estimator: An estimator is always a random variable.
Estimate: An estimate is a constant.
Note that we use
Estimator is a function of observable random variables that is used to estimate an unknown parameter
Below example is useful to understand the concepts.
Let’s say you wanted to know the mean height of undergraduates in a certain university with a population of 1000 undergraduates. You take a random sample of 10 students and measure their height. The observed values are (in cm) 150, 155, 160, 161, 162, 152, 140, 141, 150, 155. Suppose you want to use sample mean to estimate the population mean.
The parameter we want to estimate is,
There are infinitely many possible estimators for
To do this, we first define random variables
Now we can write the estimator as
The estimate is
mean(c(150, 155, 160, 161, 162, 152, 140, 141, 150, 155))
[1] 152.6
If you are not given the observed values of
The observed value of the estimator varies from sample to sample.
Usage
- Writing distributions
pbinom(5, 10, 0.6)
[1] 0.3668967
- Calculation of expectations
This work is
licensed under a Creative Commons Attribution 4.0 International
License.