BA01. [Mathematical Breakdown] The Short Shot

Exa Euler
8 Min Read

The story you just read is not merely a novel. It is like a castle built upon rigorous mathematical calculations. Here, I reveal the mathematical blueprint of how the ‘Traditional Bayes‘ Theorem’ learned in textbooks transforms into the ‘Odds and Bayes Factor‘ used in the field.

1. The Textbook Method (The Probability View)

The equation below is the archetype of Bayes’ Theorem we learn in school. The key is that it uses the denominator to Normalize the sum of total probabilities so that it always equals 1 (100%).

Here, the Likelihood P(D|H)—that is, the probability that the data is observed when Hypothesis H is true—is determined by the nature of the data. In the case of the ‘Short Shot’ scenario seen earlier, under the two hypotheses of Temperature (HT) and Pressure (HP), only results of ‘Defect’ or ‘Good’ exist. If we substitute these with 1 and 0, the individual data follows a typical Bernoulli Distribution.

$$P(H|D) = \frac{P(D|H) P(H)}{P(D)}$$

  • P(H/D) : Posterior Probability (Belief, confidence, probability, or probability distribution after seeing the data)
  • P(D/H) : Likelihood (Probability of the data appearing when the hypothesis is correct)
  • P(H) : Prior Probability (Existing belief)
  • P(D) : Probability of Evidence (Probability of data occurrence integrating all hypotheses)

Practical Calculation Problems in the Field

Here, calculating the denominator P(D) is a headache. This is because you have to sum up the probabilities of all hypotheses..
P(D)=P(D|HT)P(HT)+P(D|HP)P(HP)P(D) = P(D|H_T)P(H_T) + P(D|H_P)P(H_P) … (As hypotheses increase, the calculation not only explodes, but in practice, $P(D)$ is often unknown)

2. The Method in the Story: The Odds View

The Odds calculation method is another powerful approach to performing hypothesis-testing Bayesian inference. The core of this method is the active utilization of ‘Subjective Probability’.

Managers in the business field each have different intuitions and empirical beliefs. The Odds framework does not exclude such personal subjectivity, but rather converts it into a tool that can be managed systematically and consistently. Managers quantify their initial beliefs into Odds, and continuously update that probability by reflecting new evidence (data) whenever it comes in.

Through this iterative process, managers can verify how well their intuition aligns with actual data and improve the quality of their decision-making. Above all, from a technical perspective, the fact that there is no need to calculate the ‘troublesome denominator P(D)’, which requires considering all possible data scenarios, is a significant practical advantage.

[Value of the Odds Framework for Executives]

  1. Systematization of Subjectivity: Provides a framework to manage the differing intuitions and beliefs (subjective probabilities) of each manager as consistent numerical values.
  2. Dynamic Decision Making: Verification of decision consistency through the iterative loop of “Set Belief → Reflect Data → Update Belief“.
  3. Calculation Efficiency: Supports rapid decision-making by eliminating the complex denominator P(D) operation, which requires calculating the total probability, through the use of the Odds method.

$$\text{Posterior Odds} = \text{Prior Odds} \times \text{Bayes Factor}$$

  • Odds: How many times more likely is Hypothesis A than Hypothesis B? (Ratio)
  • Bayes Factor: How many times more does the data support Hypothesis A than Hypothesis B? (Ratio of strength)

3. Perfect One-to-One Mapping (The Mapping)

Now, let’s cast some magic. How does the ‘complex probability formula’ transform into the ‘simple odds formula’? We have two hypotheses, Temperature (HT) and Pressure (HP). Let’s write out each of the traditional Bayes equations below.

Equation A (The equation for the Temperature Hypothesis, where Temperature is the culprit):

$$P(H_T|D) = \frac{P(D|H_T) P(H_T)}{P(D)}$$

Equation B (The equation for the Pressure Hypothesis, where Pressure is the culprit):

$$P(H_P|D) = \frac{P(D|H_P) P(H_P)}{P(D)}$$

Now, let’s divide [Equation A] by [Equation B]. Then, something amazing happens.

$$\frac{P(H_T|D)}{P(H_P|D)} = \frac{\frac{P(D|H_T) P(H_T)}{P(D)}}{\frac{P(D|H_P) P(H_P)}{P(D)}}$$

The troublesome P(D), which is present in both the denominator and the numerator, cancels out and disappears! (Poof!)

If we organize what remains, it becomes exactly the formula we used in the story.

$$\underbrace{\frac{P(H_T|D)}{P(H_P|D)}}_{\text{Posterior Odds}} = \underbrace{\frac{P(D|H_T)}{P(D|H_P)}}_{\text{Bayes Factor}} \times \underbrace{\frac{P(H_T)}{P(H_P)}}_{\text{Prior Odds}}$$

Key Points

  • Ratio of Traditional Posterior Probabilities → Posterior Odds
  • Ratio of Traditional Likelihoods → Bayes Factor
  • Ratio of Traditional Prior Probabilities → Prior Odds

Ultimately, the Odds method is not new mathematics, but a field version of Bayes’ theorem that streamlines calculation by removing the complex normalization constant P(D).

4. Substitution with Real Data (Verification)

Let’s plug the ‘Morning Data (5 defects out of 50)’ from the story into both methods to check if they yield the same result.

A. A. The Story Method (Odds & Bayes Factor)

  • Prior Odds: 0.6 / 0.4 = 1.5
  • Bayes Factor: 𝟒.𝟕\approx \mathbf{4.7} (Calculated value)
  • Posterior Odds: 1.5 x 4.7 = 7.05
  • Final Probability: 7.05/(1+7.05)𝟖𝟕.𝟔%7.05 / (1+7.05) \approx \mathbf{87.6\%}

B. The Traditional Method (Probability)

  • Likelihood Calculation(Bernoulli distribution, $P$=defect rate, $k$=number of defects, $n$=sample size): Pk(1P)nkP^k (1-P)^{n-k}
    • Temperature Likelihood P(D|HT):(0.08)5(0.92)45very small number AP(D|H_T): (0.08)^5(0.92)^{45} \approx \text{very small number A}
    • Pressure Likelihood P(D|HP):(0.04)5(0.96)45very small number BP(D|H_P): (0.04)^5(0.96)^{45} \approx \text{very small number B}
    • (Here, A is about 4.7 times B. Try calculating it yourself)
  • Calculation of Evidence Probability P(D): (A×0.6)+(B×0.4)(A \times 0.6) + (B \times 0.4)
  • Posterior Probability P(HT|D)P(H_T|D): A×0.6(A×0.6)+(B×0.4)\frac{A \times 0.6}{(A \times 0.6) + (B \times 0.4)}
  • If you punch this complex formula into a calculator? Surprisingly, it comes out to exactly 87.6%.

Conclusion (Writer’s Conclusion)

Why do we use the Odds method?
The traditional method requires calculating the total sum P(D) every time, so the calculation slows down in streaming situations (Loop) where data keeps coming in.

On the other hand, the Odds method only requires continuous simple multiplication. This is precisely why field engineers and AI love Bayes, and why the Odds method was used in the story.

Share This Article
Leave a Comment

Leave a Reply