What is the Pearson correlation coefficient formula?

Pearson r = Σ((xi − x̄)(yi − ȳ)) ÷ √(Σ(xi − x̄)² × Σ(yi − ȳ)²). r ranges from −1 to +1. r = +1 means perfect positive linear relationship; r = −1 means perfect negative; r = 0 means no linear relationship. The formula measures how closely the data cluster around a straight line. It's not robust to outliers and only catches linear relationships — use Spearman for monotonic non-linear data.

How do I interpret correlation strength?

Cohen's rough guidelines: |r| ≥ 0.7 strong; 0.5 ≤ |r| < 0.7 moderate; 0.3 ≤ |r| < 0.5 weak; |r| < 0.3 negligible. But context matters — in physics experiments, r = 0.9 is mediocre; in social science, r = 0.4 can be meaningful. Always look at the scatter plot alongside the number — anscombe's quartet famously shows four very different datasets with identical r ≈ 0.82.

Does correlation imply causation?

No — correlation does NOT imply causation. Two variables can correlate strongly for many reasons: (1) one causes the other, (2) both are caused by a third variable, (3) reverse causation, (4) coincidence. Ice cream sales and drowning deaths correlate strongly — both peak in summer, but ice cream doesn't cause drowning. To establish causation you need controlled experiments, longitudinal studies, or natural experiments with clear directional logic.

How do I calculate the p-value for correlation?

For Pearson r with n observations, compute t = r × √(n−2) ÷ √(1 − r²) which follows a t-distribution with df = n − 2. Then look up the p-value from a t-distribution table or use software. For r = 0.5 with n = 30: t = 0.5 × √28 ÷ √(0.75) = 2.65 ÷ 0.866 = 3.06. df = 28. p ≈ 0.005 — statistically significant at α = 0.05 by a comfortable margin.

Step-by-step solution

The Correlation Coefficient Calculator above generates a step-by-step solution panel for every calculation it runs. After you enter your data or summary statistics, the panel lists each formula step, intermediate value, and final result for the Pearson r. This makes it easy to check homework or verify the answer against software like R, Excel, SPSS, or a graphing calculator.

The Correlation Coefficient Calculator also produces a chart data table that lists the values plotted on the visualisation, with the underlying numbers used to draw each point or bar. You can copy the table to clipboard or download it as CSV for inclusion in a report. The chart itself is rendered as SVG and can be exported as PNG with the Download PNG button below the calculator.

Summary: This correlation coefficient calculator computes Pearson, Spearman and p-value with live steps, formulas and a chart. It accepts labeled numeric inputs, works offline through file:// and includes source-backed explanations for students, analysts and researchers.

Correlation Coefficient Calculator

Q: What's the difference between Pearson and Spearman correlation?

Pearson measures linear correlation between two continuous variables and assumes normal distributions. Spearman (ρ) measures monotonic correlation (data moves in same direction even if not linearly), works with ordinal data, and is robust to outliers. Use Pearson for height vs weight in adults; use Spearman for satisfaction rating vs years of service (ordinal scale, non-linear relationships likely).

Q: Does correlation imply causation?

No — correlation does NOT imply causation. Two variables can correlate strongly for many reasons: (1) one causes the other, (2) both are caused by a third variable, (3) reverse causation, (4) coincidence. Ice cream sales and drowning deaths correlate strongly — both peak in summer, but ice cream doesn't cause drowning. To establish causation you need controlled experiments, longitudinal studies, or natural experiments with clear directional logic.

Q: How do I calculate the p-value for correlation?

For Pearson r with n observations, compute t = r × √(n−2) ÷ √(1 − r²) which follows a t-distribution with df = n − 2. Then look up the p-value from a t-distribution table or use software. For r = 0.5 with n = 30: t = 0.5 × √28 ÷ √(0.75) = 2.65 ÷ 0.866 = 3.06. df = 28. p ≈ 0.005 — statistically significant at α = 0.05 by a comfortable margin.

Editorially reviewed Reviewed by Agarapu Ramesh, science educator (chemistry). LinkedIn

Last reviewed: May 2026 | Standard statistical formulas

Correlation coefficient calculator for Pearson r, Spearman rho, t test, p-value, strength interpretation and scatter plot. The calculator works offline, updates instantly and includes a worked example, plain-text formula, MathML, references and structured data.

Statistics tool

X values

Y values

Default values are loaded. Click any field and edit it; results and chart update automatically.

Default example loadedx=1,2,3,4,5 and y=2,4,5,4,5. Change any value above to test your own data.

Result: -

Correlation Coefficient Calculator Quick Reference

Input or setting	Result or interpretation	Use this when
absolute r below .3	weak association	association strength
absolute r .3 to .5	moderate association	association strength
absolute r above .7	very strong association	association strength

How to Use This Correlation Coefficient Calculator

Choose one calculator from the dropdown, such as Standard Deviation Calculator or Linear Regression Calculator.
Paste raw data into textarea fields or enter summary statistics in number fields.
Review the headline result, supporting metrics, step-by-step solution and SVG visualization.
Use the example button to compare against a known worked example from the reference table.
Copy the result or export the visible output as CSV or PNG for notes and reports.
Read the interpretation, pitfalls, glossary and references before making research decisions, especially when assumptions or tails affect the answer.

Correlation Coefficient Calculator Formula

Primary formular = sum((x_i-mean_x)(y_i-mean_y)) / sqrt(sum((x_i-mean_x)^2) * sum((y_i-mean_y)^2))

Plain-English meaning

Pearson r measures linear association from -1 to 1. Spearman rho applies Pearson correlation to ranks.

Example

x=[1,2,3,4,5], y=[2,4,5,4,5]

r about 0.775; t about 2.121; p about 0.124

This page uses the shared statistics core for distribution functions, quantiles and exact integer counting where needed. The formula is shown in plain text so screen readers and search engines can parse it reliably.

result = linear association

Correlation Coefficient Calculator Worked Example

Use Load example in the calculator to reproduce this reference result.

{
  "tool": "Correlation Coefficient Calculator",
  "input": "x=[1,2,3,4,5], y=[2,4,5,4,5]",
  "output": "r about 0.775; t about 2.121; p about 0.124",
  "formula": "r = sum((x_i-mean_x)(y_i-mean_y)) / sqrt(sum((x_i-mean_x)^2) * sum((y_i-mean_y)^2))"
}

Calculator	Example input	Expected output
Correlation Coefficient Calculator	x=[1,2,3,4,5], y=[2,4,5,4,5]	r about 0.775; t about 2.121; p about 0.124

Interpretation Guide

What does p = 0.03 mean? If the null hypothesis and model assumptions were true, a result at least this extreme would occur about 3% of the time. The American Statistical Association cautions that a p-value alone does not measure effect size, practical importance or the probability that Hâ‚€ is true.³

For most classroom and professional reports, pair the calculator result with the question you are answering. A mean or median summarizes location, but spread explains consistency. A confidence interval estimates plausible values, while a hypothesis test evaluates compatibility with a null model. Regression and correlation describe association, so they should be reported with a chart and residual or outlier review. When a result is statistically significant, still ask whether the effect is large enough to matter in the real setting.

Statistic	Small	Medium	Large	Use
Cohen's d	0.2	0.5	0.8	t-test effect size
CramÃ©r's V	0.1	0.3	0.5	chi-square association
\|r\|	0.10	0.30	0.50	correlation strength
RÂ²	0.01	0.09	0.25	variance explained

Pro Tips and Common Pitfalls

Use sample standard deviation for sampled data and population standard deviation only when the dataset is complete.
Choose the correct tail for p-values before looking at the result.
Correlation does not imply causation; inspect design, confounders and timing.
Check t-test assumptions: independence, roughly normal differences or means, comparable measurement scales and clear sampling design.
Round final results for reporting, but avoid rounding intermediate values during calculation or when comparing software output.
This calculator is for educational purposes; for formal research, verify with peer-reviewed software.

Correlation Coefficient Calculator FAQ

How do I calculate Pearson correlation coefficient?

The conceptual formula is r = Î£(xi âˆ’ xÌ„)(yi âˆ’ È³) / âˆš[Î£(xi âˆ’ xÌ„)Â² Ã— Î£(yi âˆ’ È³)Â²]. In words: you measure how x and y vary together (the numerator), then standardise by how much each varies on its own (the denominator). The result lands between âˆ’1 and +1. You need paired data points â€” each x must have a matching y from the same observation. Manually it's tedious for big datasets; calculators and Excel's CORREL function handle it instantly. Just make sure your x and y lists have equal length.

What is the difference between Pearson and Spearman correlation?

Pearson measures the strength of a linear relationship between two continuous variables, and it assumes roughly normal distributions. Spearman ranks the data first and then computes Pearson on those ranks, which means it captures any monotonic relationship â€” even curved ones â€” as long as y consistently increases or decreases with x. Use Pearson for clean linear data; use Spearman for ordinal data, when outliers are present, or when the relationship looks more like a curve than a straight line. They often give similar values but disagree when the link isn't strictly linear.

How do I interpret a negative correlation coefficient?

A negative r means the two variables move in opposite directions â€” as x rises, y tends to fall, and vice versa. The closer r is to âˆ’1, the stronger and more consistent that inverse relationship. For example, hours spent watching TV and exam scores might show r â‰ˆ âˆ’0.6, suggesting more screen time pairs with lower marks. Remember: correlation describes association, not causation. A strong negative r doesn't prove TV causes bad grades â€” there could be a third factor (study time, sleep) driving both.

How do I find p-value for a correlation coefficient?

Convert r to a t statistic using t = râˆš(n âˆ’ 2) / âˆš(1 âˆ’ rÂ²), with degrees of freedom df = n âˆ’ 2. Then look up the two-tailed p-value from the t distribution. Example: r = 0.5 with n = 30 gives t = 0.5 Ã— âˆš28 / âˆš0.75 â‰ˆ 3.05, df = 28. That t-value corresponds to p â‰ˆ 0.005, which is statistically significant. Most software (R's cor.test, Python's scipy.stats.pearsonr) returns this p-value automatically. Larger samples make even small r values significant, so context matters.

What does r squared mean in correlation?

RÂ² is just r squared, but its meaning is far more useful. It tells you the proportion of variance in y that's explained by its linear relationship with x. So if r = 0.8, then RÂ² = 0.64, meaning about 64% of the variation in y is accounted for by x. The remaining 36% is unexplained â€” due to other factors or noise. Always between 0 and 1, RÂ² makes interpretation more concrete than r alone. Just don't confuse high RÂ² with a good model: a flawed regression can still have high RÂ².

How do I calculate correlation from two data sets?

Make sure your two datasets are paired â€” each x value matches a specific y value from the same observation, person, or time point. Both lists must be the same length. Drop any row where one value is missing. Then plug them into the Pearson formula, or use a calculator that accepts two columns. In Excel: =CORREL(A1:A20, B1:B20). The order of the lists doesn't change r â€” correlation is symmetric. If your data are ranks or you suspect outliers, run Spearman as well and compare the two values.

When should I use Spearman rank correlation?

Reach for Spearman in three situations: when your data are ordinal (rankings, satisfaction scores), when the relationship between x and y is monotonic but not linear, or when outliers are skewing your Pearson result. Because Spearman works on ranks, extreme values lose their disproportionate pull. It also handles non-normal distributions gracefully. Example: relating exam rank with study-hour rank for a class of 25. If you're not sure which to use, run both â€” close values mean the relationship is fairly linear, big differences hint at non-linearity or outliers.

How do I know if a correlation is significant?

Look at the p-value associated with your r. If p is below your chosen alpha (typically 0.05), the correlation is statistically significant â€” meaning the observed link is unlikely to be due to random sampling alone. Sample size plays a huge role: with n = 1000, even r = 0.07 can be significant; with n = 10, you need r â‰ˆ 0.63 or higher. Also check the confidence interval for r â€” if it crosses zero, the correlation isn't significant. And remember: significant doesn't necessarily mean strong or practically important.

Correlation Coefficient Calculator Glossary

Pearson r: A measure of linear association from -1 to 1.
Spearman rho: Correlation of ranked values, useful for monotonic relationships.
Scatter plot: A graph of paired x and y values.
Positive correlation: Higher x values tend to occur with higher y values.
Negative correlation: Higher x values tend to occur with lower y values.
Outlier: An unusual point that can strongly affect correlation.

References and Sources

NIST/SEMATECH e-Handbook of Statistical Methods, descriptive statistics, uncertainty and modeling formulas.
OpenStax Introductory Statistics, definitions for inference, probability and summary statistics.
ASA Statement on p-values, Wasserstein and Lazar, 2016.
R stats package documentation, t.test, cor, quantile and distribution conventions.

Correlation Coefficient Calculator

Correlation Coefficient Calculator Quick Reference

How to Use This Correlation Coefficient Calculator

Correlation Coefficient Calculator Formula

Correlation Coefficient Calculator Worked Example

Interpretation Guide

Pro Tips and Common Pitfalls

Correlation Coefficient Calculator FAQ

Correlation Coefficient Calculator Glossary

Related Statistics Calculators

References and Sources