Binomial distributions

This is about probability of having exactly, at most, or at least kk successes in nn trials.

Updated 2026-05-13 on GDC (calculator) usage and removed substituting values into binomial distribution formula

Contents

When to Use

If it exhibits the following symptoms, binomial distribution may be for you

  • for probability of having a particular number of “successes” out of nn independent tries, or
  • for the expected number of “successes” out of nn independent tries

For picking items from a large pool (in the ~thousands), selection without replacement can still be approximate by binomial distribution.

Definition

Random variable XX is binomially distributed is notated as

XB(n,p)X \sim \text{B}(n, p)

with nn trials or attempts, and probability pp of success for each independent trial. It has the probability distribution

P(X=x)=(nx)px(1p)nx\text{P}(X = x) = \binom nx p^x(1-p)^{n-x}

where (nx)\displaystyle \binom nx is the binomial coefficient or nCx^nC_x.

This formula is not on the formula booklet and not required to know But it is very similar to the one for binomial expansion.

The (nx)\displaystyle \binom nx counts the different sequences of xx successes and nxn - x failures.

Note: In nn tries, xx could take on n+1n + 1 values from 00 to nn.

Binomial distribution is about defining the success for a single trial, and treating different successes separately.

Calculator

Keep in mind very important that < vs ≤ are not interchangeable for binomial. For Nspire and Casio, write cumulative probability as P(a ≤ X ≤ b). For TI 84, this is P(X ≤ b) - P(X ≤ a - 1).

Arguments in grey\color{grey}\text{grey} are optional.

navigationP(X = x)P(a ≤ X ≤ b), or P(X ≤ x)least x with P(X ≤ x) ≥ cdfleast n with P(X ≤ x) ≤ cdf
TI-84 2nd varsA.binompdf(n, p, x)B.binomcdf(n, p, x)C.invBinom(cdf, n, p)
Nspire menu 6.stats, then
5.distr
A.binomPdf(n, p, x)B.binomCdf(n, p, a\color{grey}a, b)C.invBinom(cdf, n, p)D.invBinomN(cdf, p, x)
CG50 / 9860GIIII.DIST / 2.STAT or
1.Run/Mat OPTN stat
Bpd(x, n, p)Bcd(x, n, p)InvB(cdf, n, p), Run-Mat and Stat only
CG50 onlyBcd(a,\color{grey}a, b, n, p)

aa is by default 0, which reduces to the TI 84 / 9860GIII case.

Casio: Use I.Distribution app by default. Use 2. Stat if needing InvN. Only use 1.Run-Mat OPTN if one of n or p is unknown. The 9860GIII 2.Statistics and Bcd can only find P(X ≤ b) and not P(a ≤ X ≤ b), but the I.Distribution app works fine.

Example: A coin is manufactured to land on head 52% of the time and the rest on its tail.

a) 800 people are each flipping such a coin 10 times in a row. Find the probability that exactly one person gets all tails or all heads.

b) Find the number of people needed to have a 95% chance of at least two people each flipping at least eight tails or at least eight heads in ten flips.


a) Let XX be the number of heads in ten flips, such that XB(10,0.52){X \sim \text{B}(10, 0.52)}.

P(X=0)+P(X=10)0.0020948\text{P}(X = 0) + \text{P}(X = 10) \approx 0.0020948

where we can store this in variable (Alpha letter) P, and when we find binompdf or bpd we can use our variable.

Then let YY be number of times someone gets all tails or heads in ten flips, out of 800800 people. YB(800,0.0020948){Y \sim \text{B}(800, 0.0020948)}.

P(Y=1)0.314\begin{align*} \text{P}(Y = 1) \approx 0.314 \qed \end{align*}

b) At least 8 tails or at least 8 heads means

P(X2)P(X8)\text{P}(X \leq 2) \text{P}(X \geq 8)

using the XX from previous part.

TI 84 can only do cumulative binomial distribution probability from 0 to xx, so this is

P(X2)+1P(X7)0.11218749\begin{align*}&\,\text{P}(X \leq 2) + 1 - \text{P}(X \leq 7)\\ \approx&\,0.11218749 \end{align*}

where we can store this in another variable, eg Q.

Now we can define ZB(n,0.11218749){Z \sim \text{B}(n, 0.11218749)} for number of people getting at least 8 tails or at least 8 heads and solve

P(Z2)0.95\text{P} (Z \geq 2) \geq 0.95

or equivalently

P(Z1)<0.05\text{P}(Z \leq 1) \lt 0.05

As nn is a discrete value, taking on only positive integers, this cannot be solved via solver, but rather through a table of values.

nn \quadP(Z1)\text{P}(Z \leq 1)
39390.05720.0572
40400.05190.0519
41410.04700.0470
42420.04260.0426

As n=41n = 41 is the first value to drop below 0.05000.0500, the answer is n=41n = 41 \qed

Alternatively, you can also use round( or int( to first plot the function, replacing n with round(n).

Alternatively using TI Nspire, we can use invBinomN.

Expected value

The expected value is

E(X)=np\text{E}(X) = np

This is why

expected count=# trials×probability\text{expected count} = \text{\# trials}\times\text{probability}

Variance and standard deviation

Var(X)=np(1p)\text{Var}(X) = np(1-p)
σ=np(1p)\sigma = \sqrt{np(1-p)}

Some ways that they could be used on exam is if you are given expected value and variance and you need to solve for nn and pp.

Mode and median

The mode and median are very close to npnp. Precise formulas are beyond scope of the course. If asked on exam, use methods discussed in discrete random variables to investigate.

The median is not necessarily equal to the mode(s).

Tips

  • Does it say at most (N\leq N), at least (N\geq N), less than(<N< N), more than (>N> N), or equal to (=N=N)?
  • See also probability formulas.
  • See also binomial expansion.
  • While for large nn, the binomial distribution can be approximated by a normal distributions, the normal approximation cannot reliably obtain correct answers to 3 sf.

Practice

Practice evaluating binomial probabilities using Bolt ⚡ Binomial.