Understanding the Probability Formula: P(A ∪ B) = P(A) + P(B) − P(A ∩ B) — A Complete Guide to Combining Events

In probability theory, one of the most fundamental concepts is calculating the likelihood that at least one of multiple events will occur. This is expressed by the key formula:

P(A ∪ B) = P(A) + P(B) − P(A ∩ B)

Understanding the Context

This equation helps us find the probability that either event A or event B (or both) happens, avoiding double-counting the overlap between the two events. While it applies broadly to any two events, it becomes especially useful in complex probability problems involving conditional outcomes, overlapping data, or real-world decision-making.


Breaking Down the Formula

The expression:

Key Insights

P(A ∪ B) = P(A) + P(B) − P(A ∩ B)

means that:

  • P(A) is the probability of event A occurring,
  • P(B) is the probability of event B occurring,
  • P(A ∩ B) is the probability that both events A and B occur simultaneously, also called their intersection.

If A and B were mutually exclusive (i.e., they cannot happen at the same time), then P(A ∩ B) = 0, and the formula simplifies to P(A ∪ B) = P(A) + P(B). However, in most real-world scenarios — and certainly when modeling dependencies — some overlap exists. That’s where subtracting P(A ∩ B) becomes essential.


Final Thoughts

Applying the Formula with Numbers

Let’s apply the formula using concrete probabilities:

Suppose:

  • P(A) = 0.4
  • P(B) = 0.3
  • P(A ∩ B) = 0.4 × 0.3 = 0.12 (assuming A and B are independent — their joint probability multiplies)

Plug into the formula:

P(A ∪ B) = 0.4 + 0.3 − 0.12 = 0.7 − 0.12 = 0.58

Thus, the probability that either event A or event B occurs is 0.58 or 58%.


Why This Formula Matters

Understanding P(A ∪ B) is crucial across multiple fields:

  • Statistics: When analyzing survey data where respondents may select multiple options.
  • Machine Learning: Calculating the probability of incorrect predictions across multiple classifiers.
  • Risk Analysis: Estimating joint failure modes in engineering or finance.
  • Gambling and Decision Theory: Making informed choices based on overlapping odds.