Name: Video 47 - Understanding the P-Value - ESTIEM LSS Course
Uploaded: 2021-08-01T10:40:20.000Z
Duration: 18 min 36 s
Description: For more info about the ESTIEM Lean Six Sigma Green Belt course visit https://internal.estiem.org/leansixsigma

Video 47 - Understanding the P-Value - ESTIEM LSS Course

Understanding P-Values in Hypothesis Testing

Introduction to P-Values

The discussion begins with the importance of understanding P-values as a key component of hypothesis testing, alongside sampling and confidence intervals.

Common Misconceptions about P-Values

The speaker aims to clarify common myths surrounding P-values, suggesting that many have learned about them incorrectly in basic statistics classes.

Graphical Representation of P-Values

A graphical description will be provided to illustrate P-values, including alpha risk (Type I error) and beta risk (Type II error), using normal distributions for clarity.

Nature of Distributions

It is noted that real-world data rarely follows a normal distribution; instead, many distributions encountered in industrial engineering exhibit long tails, such as Erlang or Poisson distributions.

Understanding the Normal Distribution and Risk Levels

A normal distribution graphic is introduced where a vertical line represents the decision criteria based on an alpha risk level (e.g., 5%).

Interpretation of P-Values

The area under the curve to the right of this line represents the P-value, indicating the probability of observing an event (often a failure event).

Significance of Small P-Values

A small P-value suggests that an observed event is rare and unlikely to occur by chance. For example, a P-value of 0.05 indicates a 5% chance that such an occurrence could happen randomly.

Null Hypothesis Contextualization

The population mean serves as the center for the null hypothesis (H₀). The distance between sample means helps determine if they likely come from different populations.

Risks Associated with Hypotheses Testing

Alpha risk refers to finding a difference when none exists, while beta risk involves failing to detect a difference when one does exist.

Visualizing Alternative Hypotheses

Two hypotheses are illustrated: H₀ (null hypothesis - red distribution) and H₁ (alternative hypothesis - blue distribution), highlighting how shifts can affect detection.

Probabilistic Nature of Decision Making

The concept emphasizes that smaller P-values provide stronger evidence against H₀. However, interpreting these values requires caution due to their probabilistic nature.

Rethinking Statistical Rigor

While not entirely accurate statistically, it’s suggested that one might consider small P-values as indicative probabilities regarding H₀'s truthfulness.

American Statistical Association Guidelines

In March 2016, guidelines were released emphasizing:

A low P-value indicates data incompatibility with statistical models.

Low values do not measure how true hypotheses are or whether data arose from random choice.

Understanding the Limitations of P-Values in Business Decisions

The Role of P-Values in Decision Making

Business decisions should not solely rely on whether a P-value crosses a threshold (e.g., 0.05). It is crucial to assess if we are measuring the right variables and asking appropriate questions.

Proper inference about future outcomes necessitates comprehensive reporting of all components, including beta risk, to provide transparency for decision-makers regarding total risks involved.

A P-value does not quantify the size or significance of an effect; it is essential to consider physical dimensions and knowledge gained from the analysis process.

Comprehensive Systems Approach

Relying on a single P-value does not adequately measure evidence supporting a model or hypothesis. A more holistic systems approach is required for understanding results.

Hypothesis testing must be grounded in real-world contexts, ensuring that sample sizes are appropriate and that alpha and beta risks are balanced between consumer and producer losses.

Criteria for Testing

The chosen P-value must align with sensible decision criteria, emphasizing the importance of selecting tests based on the specific differences being investigated.