A regular variable x can take on different values. A random variable can take on different values, based on specific probability of each value. For example, P(X=3) represents the probability that X is measured to be 3. The notation P(X=x) represents the general probability of measuring some general value.
This means we can expect x to appear P(X=x)×100% of the time.
“Random” simply means different values can appear with different probability or frequency. The word does not retain the colloquial meaning of “arbitrary” or “chaotic”. In fact, our discussion of random variables usually require it to have a fixed distribution of values.
The sum of probabilities of a random variable add to one.
x∑P(X=x)=1
A random variable is discrete when there are a finite number of possible values, and continuous when there the possible values consist of an interval or multiple intervals of real numbers.
Example: Random variable X is defined by
P(X=x)={ax−10if x=3,4,6,8otherwise
Find a.
a(31+41+61+81)a(87)a=1=1=78■
For the following measures, be able to find them algebraically and on a graphing calculator using one-variable statistics.
Probability and cumulative probability built-in functions can be graphed when one of n, p, x is unknown, and can also be tabulated.
Expected value, E(X)
The expected value is useful to find the sum or mean of all values measured.
mean=number of measurementssum
and E(X) is the expected mean.
The expected value for for a discrete random variable is
E(X)=μ=x∑x⋅P(X=x)
Example: Random variable X is defined by
P(X=x)={78x−10if x=3,4,6,8otherwise
Find P(X=x).
The expected value is
E(X)=78(3(3)−1+4(4)−1+6(8)−1+3(8)−1)=732■
Mode
Mode is the value with the highest probability.
Example: Random variable X is defined by
P(X=x)={78x−10if x=3,4,6,8otherwise
Identify the mode.
The probability distribution is
value x
probability P(X=x)
3
0.381
4
0.256
6
0.190
8
0.143
The mode is value with the highest probability. It is 3■
A value xm is a candidate for a mode if in some consecutive values of x, it is the first (least) value that satisfies
P(X=xm)P(X=xm+1)<1
or the last (greatest) value that satisfies
P(X=xm−1)P(X=xm)>1
or that
P(X=xm−1)P(X=xm)=1
The candidates for mode should be compared to select the one with the highest probability.
Median
Median is the value that crosses the 0.5 probability
Example: Random variable X is defined by
P(X=x)={78x−10if x=3,4,6,8otherwise
Identify the median.
The median is the first value whose cumulative probability crosses 0.500.
value x
cumulative probability P(X≤x)
3
0.381
4
0.667
6
0.857
8
1.000
The median is 4■
HL: Variance and standard deviation
Variance is σ2 and standard deviation is σ
Var(X)=σ2=x∑(x−μ)2⋅P(X=x)
Variance is typically easier to calculate, especially in calculations beyond the syllabus. Standard deviations are easier to interpret, as the “average” distance to the mean. In reality the average is weighted by (x−μ)2, so values that are further from the mean are given more weight.