e-value
Snippet
e-values vs p-values
The key is to constructing e-values well.
- A p-variable for testing : a random variable satisfying Realized values = p-values.
- An e-variable for testing : a non-negative random variable such that Realized values = e-values.
Advantages of E-values
- Safe under optional stopping (cf. time-to-event data).
- Naturally suited for sequential testing, online inference, and gradual evidence accumulation.
- Easily combined across studies or time (e.g., via multiplication).
- Useful in high-dimensional and composite/irregular models, where computing exact -values is hard.
- No need for fixed sample size or pre-specified analysis plans.
Weaknesses
- Generally less powerful than Neyman–Pearson (fixed-) tests.
- Require more data to reach traditional thresholds like 0.05-equivalent.
Optional Continuation Problem & p-hacking
- Repeated trials (e.g., groups A, B, C testing a treatment sequentially) usually lead to:
- Improper reuse or pooling of -values.
- Inflated Type I error (p-hacking).
- E-values solve this: test martingales allow continual evidence monitoring without inflating error.
Construction Example
- Place a prior over the alternative .
- Define an e-variable via likelihood ratio or Bayes factor-like object (e.g., test supermartingale).
- Often of the form: where is under the alternative, under the null.
Connections
- Bayes Factors: a type of e-variable when null and alternative are fully specified.
- Test Martingales: a sequence where each is an e-variable, and is a non-negative martingale under .
- FDR Control: e-Benjamini–Hochberg (e-BH) provides valid FDR procedures using e-values.
Notable Contributors
- Peter Grünwald: "E is the new P" — formalizes safe testing under optional stopping.
- Aditya Ramdas: key work on sequential testing, e-BH, martingales.
- Ruodu Wang: connects e-values to game-theoretic probability and robust inference.
- Brian Scheffer: applied presentations and tutorials.
Applications
- Sequential A/B testing.
- Automated hypothesis validation (e.g., POPPER framework).
- High-dimensional inference (e.g., genomics).
- Robust meta-analysis.
You can connnect Knockoffs and e-values. You can define e-values based on knockoff statistics. Pull them out of the hat. You can derandomise knockoff procedure by taking the averages.