The PEARSON function in Excel is a statistical tool designed to calculate the Pearson product-moment correlation coefficient. This coefficient measures the strength and direction of the linear relationship between two sets of data. It’s commonly used in statistics to identify how closely related two variables are, which can inform decisions in various fields such as finance, research, and analytics.
Syntax
PEARSON(array1, array2)
- array1: The first range of data points or array.
- array2: The second range of data points or array being compared against the first.
Example #1
=PEARSON(A1:A10, B1:B10)
This function calculates the Pearson correlation coefficient for the datasets in ranges A1 to A10 and B1 to B10. For example, if A1 to A10 represents scores in a test and B1 to B10 represents corresponding hours studied, the result might be 0.85, indicating a strong positive correlation.
Example #2
=PEARSON(D1:D5, E1:E5)
This calculation compares two small datasets located in D1 to D5 and E1 to E5. If the data represent sales figures and marketing expenditure, a result of -0.60 suggests a negative correlation, indicating that higher marketing expenses may relate to lower sales figures in this sample.
Example #3
=PEARSON(C1:C15, F1:F15)
This example assesses two larger datasets in columns C and F. For instance, if C values are customer satisfaction ratings and F values are product return rates, a result of 0.20 could suggest a weak positive correlation between satisfaction and returns.
Error handling
- N/A: This error occurs if the data arrays contain different numbers of data points, making correlation calculation impossible.
- DIV/0!: This error indicates that there is no variation in one of the arrays, leading to division by zero.
- VALUE!: This message appears if the input data values are non-numeric, which is invalid in statistical calculations.