// FOUNDER A/B TEST REALITY CHECK

Do you have enough traffic to run this experiment?

Most founder A/B tests never reach significance. Plug in your numbers — find out whether this test will finish in days, weeks, or never.

Free A/B Test Sample Size Calculator

Enter your baseline conversion rate, the smallest lift worth caring about, and your weekly traffic. The calculator returns the sample size per variant, days to significance, and a plain-English verdict on whether this test will actually finish.

Your numbers

Baseline conversion rate

What percentage of visitors converts on the page today.

Smallest win worth caring about

The smallest improvement you'd actually act on. +20% means going from 5% to 6%.

Monthly visitors

Total monthly traffic to the page you're testing. Split evenly between the two versions.

Feasibility map

Where your test lands on the cost curve. The dot marks the cell closest to your inputs; click any cell to inspect a different scenario.

Selected cell

Baseline 5% → detect +20% relative lift

Samples / variant

8,146

At 10k/mo

1.6 months

< 14d

< 60d

60-180d

> 180d

How the math works

Four numbers decide every A/B test.

Baseline conversion rate (p₁). What the page does today. Lower baselines need exponentially more samples to detect the same relative lift.
Minimum detectable effect (MDE). The smallest relative improvement you want to be able to call. Smaller MDEs need dramatically more samples.
Significance level (α = 0.05). Tolerance for false positives — the industry-standard 95% confidence is locked in here.
Statistical power (1−β = 0.80). The probability you actually catch a real effect. 80% is conventional and locked in here.

n = (Z_α/2 + Z_β)² × (p₁(1−p₁) + p₂(1−p₂)) / (p₂ − p₁)²

At 95% confidence and 80% power, (Z_α/2 + Z_β)² ≈ 7.84. Days to significance assumes you split traffic evenly between control and one variant.

What this calculator does not model: Bayesian priors, sequential tests, more than two variants, or unequal traffic splits. For all of those, the math shifts — start with the field guide before you commit to a non-standard design.

Run this experiment in Xi.

A calculator tells you whether the math works. Xi runs the experiment: lock the hypothesis and kill threshold up front, track the metric automatically, and let agents call the verdict so you don’t drift into endless “just one more week.”

Run your first experiment Read the CRO field guide