Advertisements

Correlation in Excel Data Analysis

Correlation in Excel Data Analysis is quite helpful for students who study statistics and want to analyze data to find correlation matrix between more than two variables. In this section, we will learn Multiple Correlation Excel. Multiple correlation is used to predict a given variable based on a linear function of a set of other variables. When we have two variables (X, Y), one (X) predicts the other (Y), we call the association asย simple correlation. When we use more than two variables (X, Y, Z, ..N), in correlation analysis, we call it multiple correlation. It means that if X and Z predict Y, we are talking about multiple correlation.

In this section:

Advertisements

  1. Analysis Tool Pack in Excel.
  2. How to Calculate Multiple Correlation in Excel Data Analysis?
  3. How to Interpret Correlation Coefficients in Multiple Correlation?
  4. Practice Problems on Correlation in Excel Data Analysis.
  5. More Related Readings on Correlation and Regression Analysis in Excel.

Correlation in Excel Data Analysis

1. Analysis Tool Pack in Excel:

When we need to develop complex statistical or engineering analyses, the use Excel Analysis ToolPack saves steps and time. As it is in most of the Excel functions, users provide the data and parameters for each analysis, and the ToolPack performs analysis and displays results in an output table. Some tools generates more outputs such as charts and graphs.

Check whether you already have the ToolPack installed in your Excel:

To check whether you have already installed the Analysis ToolPack, follow the steps below:

Click on the Data Tab (as in 1 in the image below)โ€“> then look at the top right corner and check whether you can see the Data Analysis ToolPack just below the Solver. In our example image below, we do not have the Data Analysis ToolPack.

in short:ย Data Tabย โ€“> Look at theย top-right cornerย of the Ribbon.

Excel Analysis Tool Pack

How to install Analysis ToolPack:

To install the Data Analysis ToolPack, follow the steps below:

Step 1:ย Click onย File Tabโ€“> then click onย Options. Now you see theย Excel Optionsย dialogue box as follows.

Install Excel Analysis Tool Pack

Step 2:ย Clickย Add-insย (as 1 in the image below)โ€“> Click onย Goย (2 in image)โ€“> clickย OKย (3 in image).

Advertisements
Install Excel Analysis Tool Pack

After clicking OK, you can see the followingย Add-insย dialogue box, and see thatย Analysis ToolPackย is unchecked.

Excel Analysis Tool Pack Add-ins

Step 3:ย Check all the uncheckedย Add-insย available as shown in the image below:

Excel Analysis Tool Pack Add-ins

Step 4:ย Finally, you can now check your Analysis ToolPack. Click onย Data Tabย โ€“> Look at theย Top-rightย corner of the Ribbon. You can now see theย Data Analysisย just below Solver (as in image below)

Excel Analysis Tool Pack Add-ins

2. How to Calculate Multiple Correlation in Excel:

To calculate multiple correlation coefficients, we will use the company data file, which can be downloaded from the Download button below. Follow the steps below:

Step 1: Start the Tool Pack:

Clickย Dataย (1) โ€“>ย Data Analysisย (2) โ€“>ย Correlationย (3) โ€“>ย OKย (4)

Excel Correlation matrix

Step 2: After clicking OK, you will see the Correlation dialogue box as follows.

Correlation in Excel Data Analysis

Step 3: Complete the following actions:

  1. Input Range: In our example data set, we select D1:O2307.
  2. Grouped By: Column
  3. Check Labels in the First Row
  4. Select New Worksheet Ply as Output options.
  5. Click OK.
Correlation in Excel Data Analysis

After clicking OK, you will see that a correlation table is created in a separate worksheet as follows:

Excel Correlation multiple Correlation matrix

Step 4: Format the Correlation Table:

After having the correlation table, you should format theย decimal pointsย for better visibility, and you may also like to change the fontย typeย andย color. For example, to set the decimal point to 3 digit, (1)ย Selectย the range, (2) Click onย Numberย , (3) Selectย 3ย decimal places, (4) clickย OK.

Advertisements
Correlation in Excel Data Analysis

The following table is much better than the previous one.

Correlation in Excel Data Analysis

3. How to Interpret Correlation Coefficients in Multiple Correlation?

Suppose, we have two variablesโ€“Dependent variable (Y) and Independent variable (X). The multiple correlation coefficient is used in multiple regression analysis to assess the quality of the prediction of the dependent variable (Y). It corresponds to the squared correlation between the predicted and the actual values of the dependent variable. It can also be interpreted as the proportion of the variance of the variable Y explained by the variable X.

The correlation coefficient indicates the direction of the relationship and the quality of the relationship. The closer the correlation coefficient to 1 or -1, the stronger the relationship. For example, if the correlation coefficient is 0.90, it means a strong positive relation whereas the correlation coefficient of 0.10 indicates a weak positive relation between two variables.

Example 1:ย We want to know the strength of relationship between Net Income (NI) and Closing Price of Stock (PRCC_F):

Explanation:ย From the table below, we can see the correlation coefficient ofย 0.40ย betweenย NIย andย PRCC_F. Thisย 0.40ย has two aspects:

  1. The relation between Net Income and Closing Price is POSITIVE.
  2. If Net Income increases byย $1.00, then the Closing Price is more likely to increase byย $0.40.
Correlation in Excel Data analysis

4. Practice Problems:

Mr. Robert Gary, the CEO of AMEX Corporation, wants to investigate the relationship between Net Income and the following variables:

(I) ACT (II) AP (III)AT (IV) CHE (V) COGS (VI) DP (VII) SALE (VIII) TXT (IX) MKTVALT and (X) PRCC_F.

where,

TermDefinition
ACTCurrent Assets-Total
APAccounts Payable
ATTotal Assets
CHECash and Short-term Investment
COGSCost of Goods Sold
DPDepreciation and Amortization
NINet Income (Loss)
SALESales/Turnover
TXTTotal Income Taxes
MKTVALTTotal market value
PRCC_FClosing Price

Requirements:

(I) You are required to prepare the correlation table by using the Excel Analysis ToolPack.

Advertisements

(II) You are required to interpret the correlation coefficients between Net Income and other variables one by one.

[To answer the questions, you are required to use the correlation data]

  1. Excel CORREL Function
  2. Excel CORREL Function MS Office Post
  3. Multiple Correlation in Excel Analysis Tool Pack
  4. Multiple Regression in Excel
  5. Predict a Variable based on other Variables in Excel Regression

4 Responses

Leave a Reply

Your email address will not be published. Required fields are marked *

Advertisements