e-value

January 20, 2026 — data-science

Snippet

e-values vs p-values

The key is to constructing e-values well.

A p-variable for testing : a random variable satisfying Realized values = p-values.
An e-variable for testing : a non-negative random variable such that Realized values = e-values.

Advantages of E-values

Safe under optional stopping (cf. time-to-event data).
Naturally suited for sequential testing, online inference, and gradual evidence accumulation.
Easily combined across studies or time (e.g., via multiplication).
Useful in high-dimensional and composite/irregular models, where computing exact -values is hard.
No need for fixed sample size or pre-specified analysis plans.

Weaknesses

Generally less powerful than Neyman–Pearson (fixed-) tests.
Require more data to reach traditional thresholds like 0.05-equivalent.

Optional Continuation Problem & p-hacking

Repeated trials (e.g., groups A, B, C testing a treatment sequentially) usually lead to:
- Improper reuse or pooling of -values.
- Inflated Type I error (p-hacking).
E-values solve this: test martingales allow continual evidence monitoring without inflating error.

Construction Example

Place a prior over the alternative .
Define an e-variable via likelihood ratio or Bayes factor-like object (e.g., test supermartingale).
Often of the form: where is under the alternative, under the null.

Connections

Bayes Factors: a type of e-variable when null and alternative are fully specified.
Test Martingales: a sequence where each is an e-variable, and is a non-negative martingale under .
FDR Control: e-Benjamini–Hochberg (e-BH) provides valid FDR procedures using e-values.

Notable Contributors

Peter Grünwald: "E is the new P" — formalizes safe testing under optional stopping.
Aditya Ramdas: key work on sequential testing, e-BH, martingales.
Ruodu Wang: connects e-values to game-theoretic probability and robust inference.
Brian Scheffer: applied presentations and tutorials.

Applications

Sequential A/B testing.
Automated hypothesis validation (e.g., POPPER framework).
High-dimensional inference (e.g., genomics).
Robust meta-analysis.

You can connnect Knockoffs and e-values. You can define e-values based on knockoff statistics. Pull them out of the hat. You can derandomise knockoff procedure by taking the averages.