Skip to main content
P-Value

P-Value Calculator

Calculate p-value from a z-score or t-statistic for one-tailed and two-tailed hypothesis tests.

Enter a Z-score to calculate the p-value

How It Works

The p-value calculator converts a Z-score (a standard normal test statistic) into its one-tailed and two-tailed p-values and tells you whether the result is statistically significant at the common 95% and 99% confidence levels. The p-value is the probability of observing a test statistic at least as extreme as the one you got, assuming the null hypothesis is true. A small p-value means your data would be surprising under the null hypothesis, which is treated as evidence against it. This tool is for students taking statistics courses, researchers and analysts running hypothesis tests, and anyone who has computed a Z-score and needs to translate it into a probability they can compare against a significance threshold.

What the p-value actually measures

The null hypothesis (Hโ‚€) is the default position of "no effect" or "no difference" โ€” for example, "the new method changes nothing". You collect data, compute a test statistic, and ask: if Hโ‚€ really were true, how often would chance alone produce a result this extreme or more so? That probability is the p-value. It is not the probability that the null hypothesis is true, and it is not the probability that your finding happened by chance โ€” it is a statement about the data given the assumption of Hโ‚€. The calculation here uses the standard normal distribution: the calculator evaluates the cumulative distribution function ฮฆ at your Z-score, takes the smaller tail as the one-tailed p-value, and doubles it for the two-tailed p-value.

Significance and ฮฑ = 0.05

Before testing you choose a significance level, ฮฑ โ€” most commonly ฮฑ = 0.05, corresponding to 95% confidence. The rule is simple: if the p-value is less than ฮฑ, you reject the null hypothesis and call the result statistically significant; if it is greater than or equal to ฮฑ, you fail to reject Hโ‚€. This calculator flags significance at 95% (p < 0.05) and at 99% (p < 0.01) automatically. Stricter fields use ฮฑ = 0.01 or even ฮฑ = 0.001 (0.1%) to guard against false positives; the threshold should be fixed in advance based on how costly a wrong conclusion would be.

One-tailed vs. two-tailed

One-tailed (one-sided): tests whether the parameter is specifically greater than (or specifically less than) a value, using just one tail of the distribution. Use it only when you have a strong, pre-registered reason to expect the effect in one direction.

Two-tailed (two-sided): tests whether the parameter merely differs from a value in either direction, so it counts both tails โ€” which is why its p-value is twice the one-tailed value. It is more conservative and is the default choice in most research because it does not presume which way the effect will go.

Z-tests and t-tests

A Z-score is the right statistic when the population standard deviation is known or the sample is large. For small samples with an estimated standard deviation, the t-distribution is technically correct: it has heavier tails, so for the same statistic a t-test gives a slightly larger (more cautious) p-value. As the sample size grows beyond roughly n = 30 the t-distribution converges on the normal curve, so for large samples the Z-based p-values this calculator returns are very close to the t-test result.

Worked example

Enter a Z-score of 1.96. The standard normal curve places about 2.5% of its area beyond +1.96 in the upper tail, so the one-tailed p-value is roughly 0.025, and the two-tailed p-value is about 2 ร— 0.025 = 0.05. That is the famous boundary: a Z of 1.96 sits right at the edge of significance at the 5% level. A larger Z, say 2.58, pushes the two-tailed p-value down to about 0.0099 โ€” significant at both the 95% and 99% levels โ€” while a Z of 0 gives a two-tailed p-value of 1, meaning no evidence against Hโ‚€ whatsoever.

Tips and common mistakes

Tips: the sign of the Z-score does not affect the two-tailed p-value, since both tails are symmetric; |z| is what matters. Decide one- vs two-tailed and your ฮฑ before you look at the data, not after.

Common mistakes: a low p-value is not the same as a large or important effect โ€” with a big enough sample even a trivial difference can be "significant", so always read effect size alongside it. Equally, p โ‰ฅ 0.05 does not prove the null hypothesis is true; it only means you lack the evidence to reject it. And switching to a one-tailed test after seeing the data simply to squeeze under 0.05 is a misuse that inflates false positives.

Frequently Asked Questions

A p-value of 0.03 means there is a 3% probability of obtaining results as extreme as yours (or more extreme) if the null hypothesis were true. Since 0.03 < 0.05, the result is statistically significant at the 5% level โ€” you would reject the null hypothesis.

Part of Statistics & Probability Calculators โ€” compare every related calculator in one place.