Reporting Multiple Regressions in APA format – Part One

So this is going to be a very different post from anything I have put up before. I am writing this because I have just spent the best part of two weeks trying to find the answer myself without much luck. Sure I came across the odd bit of advice here and there and was able to work a lot of it out, but so many of the websites on this topic leave out a bucket load of the information, making it difficult to know what they are actually going on about. So after two weeks of wading through websites, texts book and having multiple meetings with my university supervisors, I thought I would take the time to write up some instructions on how to report multiple regressions in APA format so that the next poor sap who has this issue doesn’t have to waste all the time I did. If you have no interest in statistics then I recommend you skip the rest of this post.

Ok let’s start with some data. Here is some that I pulled off the internet that will serve our purposes nicely. Here we have a list of sales people, along with their IQ level, their extroversion level and the total amount of money they made in sales this week. We want to see if IQ level and extroversion level can be used to predict the amount of money made in a week.

Now I am not going to show you how to enter the data into SPSS, if you don’t know how to do that I recommend you find out first and then come back. However, I will show you how to calculate the regression and all of the important assumptions that go along with it.

In SPSS you need to click Analyse > Regression > Linear and you will get this box, or one very much like it depending on your version of SPSS, come up.

The first thing to do is move your Dependent Variable, in this case Sales Per Week, into the Dependent box. Next move the two Independent Variables, IQ Score and Extroversion, into the Independent(s) box. We are going to use the Enter method for this data, so leave the Method dropdown list on its default setting. We now need to make sure that we also test for the various assumptions of a multiple regression to make sure our data is suitable for this type of analysis. There are seven main assumptions when it comes to multiple regressions and we will go through each of them in turn, as well as how to write them up in your results section. These assumptions deal with outliers, collinearity of data, independent errors, random normal distribution of errors, homoscedasticity & linearity of data, and non-zero variances. But before we look at how to understand this information let’s first set SPSS up to report it.

Note: If your data fails any of these assumptions then you will need to investigate why and whether a multiple regression is really the best way to analyse it. Information on how to do this is beyond the scope of this post.

On the Linear Regression screen you will see a button labelled Save. Click this and then tick the Standardized check box under the Residuals heading. This will allow us to check for outliers. Click Continue and then click the Statistics button.

Tick the box marked Collinearity diagnostics. This, unsurprisingly, will give us information on whether the data meets the assumption of collinearity. Under the Residuals heading also tick the Durbin-Watson check box. This will allow us to check for independent errors. Click Continue and then click the Plots button.

Move the option *ZPRED into the X axis box, and the option *ZRESID into the Y axis box. Then, under the Standardized Residual Plots heading, tick both the Histogram box and the Normal probability plot box. This will allow you to check for random normally distributed errors, homoscedasticity and linearity of data. Click Continue. As the assumption of non-zero variances is tested on a different screen, I will leave explaining how to carry that out until we get to it. For now, click OK to run the tests.

Outliers

The first thing we need to check for is outliers. If we have any they will need to be dealt with before we can analyse the rest of the results. Scroll through your results until you find the box headed Residual Statistics.

Look at the Minimum and Maximum values next to Std. Residual (Standardised Residual) subheading. If the minimum value is equal or below -3.29, or the maximum value is equal or above 3.29 then you have outliers. Now as you can see in this example data we don’t have any outliers, but if you do here is what you need to do. Go back to your main data screen and you will see that SPSS has added a new column of numbers titled ZRE_1. This contains the standardised residual values for each of your participants. Go down the list and if you find any values equal or over 3.29, or less than or equal to -3.29 then that participant is an outlier and needs to be removed.

Once you have done this you will need to analyse your data again, in the same way described above, to make sure you have fixed the issue. You may find that you have new outliers when you do this and these too will need to be dealt with. In my recent experiment I had to run the check for outliers six times before I got them all and the standardised residual values were under 3.29 & -3.29 respectively. When it comes to writing this up what you put depends on what results you got. But something along the lines of one of these sentences will do.

An analysis of standard residuals was carried out on the data to identify any outliers, which indicated that participants 8 and 16 needed to be removed.

An analysis of standard residuals was carried out, which showed that the data contained no outliers (Std. Residual Min = -1.90, Std. Residual Max = 1.70).

Collinearity

To see if the data meets the assumption of collinearity you need to locate the Coefficients table in your results. Here you will see the heading Collinearity Statistics, under which are two subheadings, Tolerance and VIF.

If the VIF value is greater than 10, or the Tolerance is less than 0.1, then you have concerns over multicollinearity. Otherwise, your data has met the assumption of collinearity and can be written up something like this:

Tests to see if the data met the assumption of collinearity indicated that multicollinearity was not a concern (IQ Scores, Tolerance = .96, VIF = 1.04; Extroversion, Tolerance = .96, VIF = 1.04).

Independent Errors

To check see if your residual terms are uncorrelated you need to locate the Model Summary table and the Durbin-Watson value.

Durbin-Watson values can be anywhere between 0 and 4, however what you are looking for is a value as close to 2 as you can get in order to meet the assumption of independent errors. As a rule of thumb if the Durbin-Watson value is less than 1 or over 3 then it is counted as being significantly different from 2, and thus the assumption has not been met. Assuming it is you can write it up very simply like this:

The data met the assumption of independent errors (Durbin-Watson value = 2.31).

Random Normally Distributed Errors & Homoscedasticity & Linearity

I’m going to deal with these three things together as all the information comes from the same place. Now it is as this point that analysing the results becomes more of an art than a science as you need to look at some graphs and decide, pretty much for yourself, if they meet the various assumptions. We will start with the Histogram.

Now all going well this should have a nice looking normal distribution curve superimposed over a bar chart of your data. If you do then this means that your data has met the assumption of normally distributed residuals. However if you see something like the image below then you have problems.

Next you need to look at the Normal P-P Plot of Regression Standardized Residual, and yes I am aware that is says Observed Cum Prob on it and that this is highly amusing. That aside, this basically tells you the same thing as the Histogram, just in a different way.

What you are looking for is for the dots to be on, or close, to the line running diagonally across the screen. If it looks something like the image below then again you have problems.

When it comes to writing this information up you pretty much just have to describe what the two graphs look like. Something like this:

The histogram of standardised residuals indicated that the data contained approximately normally distributed errors, as did the normal P-P plot of standardised residuals, which showed points that were not completely on the line, but close.

Which brings us to the scatterplot, which will tell us if our data meets the assumptions of Homoscedasticity and Linearity. Now it is a bit hard to tell from the data we are using if these assumptions are met, as there are so few data points, and so I’m going to once again borrow some images from my textbook.

Basically you want your scatterplot to look something like the top left hand image. If it looks like any of the others then one or both of the assumptions has not been met (The lines have been added to show the shape of the date, these will not appear on the actual scatterplot). Again this is more art than science and comes down to how you interpret the image. That said if your data has met all of the other assumptions then the chances are it will have met this one as well, so if you are a little unsure what the scatterplot is telling you, as you might be with the one produced with our data here, then look at your other results for guidance. And when it comes to writing it up, again you just say what you see.

The scatterplot of standardised predicted values (Note: You may want to call it the “scatterplot of standardised residuals” instead, either is good) showed that the data met the assumptions of homogeneity of variance and linearity.

Non-Zero Variances

As I said before I have left this one until last as you need to run a little bit of extra analysis to get the information you need. From the menus at the top select Analyse > Descriptive Statistics > Descriptives and you will get this box come up.

Add both your IVs and your DV to the Variable(s) box and then click Options.

Check the Variance box under the heading Dispersion and then click Continue. Click OK to run the analysis and you will see this new table added to your results titled Descriptive Statistics.

On this table you are looking for the heading Variance, and all you need to do is see whether the values are over zero or not. If they are then the assumption is met and can be reported like this:

The data also met the assumption of non-zero variances (IQ Scores, Variance = 122.51; Extroversion, Variance = 15.63; Sales Per Week, Variance = 152407.90).

Ok, so that is all the assumptions taken care of, now we can get to actually analysing our data to see if we have found anything significant.

To be continued in Part Two.

UPDATE 20/09/2013 – When writing this post I used a number of images that I took from a powerpoint presentation on regressions that I got from my University. While I had no idea where they originally came from it has been pointed out to me that they are from Andy Field’s book Discovering Statistics Using SPSS and as such I should have acknowledged this fact when making use of them. I am now doing so and apologise for this oversight, it was never my intention to imply that the images were of my own creation. Also let me recommend that you pick up a copy of Andy Field’s book. I have been meaning to do so for some time, but have been lacking the funds to do so, as I have heard nothing but good things about it from my fellow psychology students. I have been told it is a great resource for all your SPSS and statistical needs.

64 Responses to "Reporting Multiple Regressions in APA format – Part One"

Emma says:

April 29th 2013 at 3:23 pm

Very useful indeed thank you – I look forward to part two

Reply
margie says:

May 4th 2013 at 10:36 am

your regression part one example is wonderful and has helped me with an assignment i’m doing. is part 2 available?

Reply
Sherri Elidrissi says:

June 22nd 2013 at 8:31 pm

Thank you so much. This was so helpful for my multivariate statistics course.

Reply
Kate says:

June 24th 2013 at 4:02 pm

Absolutely fantastic, I’ve been trying to figure this out for a week now and your guide just brought everything together. Thanks very much.

Reply
Muneeza says:

July 2nd 2013 at 10:33 pm

You really made it so easy for me to understand interpretation of multiple regression. Thank u so much

Reply
Andy Field says:

September 20th 2013 at 1:11 pm

I noticed that you have reproduced some images from my textbook ‘Discovering Statistics Using SPSS’ without acknowledging from where they came. Notwithstanding that fact that it’s a violation of copyright, it’s also just not very nice not to credit other people when you have used their work. I don’t mind you using the images if you acknowledge from where they came.

Andy Field

Reply
1. Andrew Dart says:
  
  September 20th 2013 at 9:10 pm
  
  Firstly let me apologise for this oversight, I was completely unaware the images came from your book. I copied them from a powerpoint presentation produced by my university and as such did not know I was violating any copyright. I have now added an update indicating where the images come form, as well as including a link to your book on Amazon. Thank you for bringing this to my attention, I will be more careful in future to find out the source of any images I use and give appropriate credit.
  
  Reply
2. Alison says:
  
  April 19th 2020 at 5:46 pm
  
  It’s not nice to come for somebody trying to help others. Check your attitude before you go on the offensive Andy. It’s easy to make mistakes when you start out. Don’t be so quick to forget you were in this position once.
  
  Reply
  1. Eric says:
    
    October 14th 2020 at 2:03 am
    
    Andy has every right to post what he did. In academia, publication is currency. Actually, Andy was nice about it just by posting. He could have very easily gotten lawyers involved and most likely the website would have been shut down and we would all be out of this knowledge.
    Andrew did the right thing by apologizing and took his apology to the next level by posting a link to the Andy Field’s book.
    
    Reply
    1. leeshy says:
      
      May 5th 2021 at 6:47 pm
      
      Yeah he’s right, but… the vibe was just off. It reiterates to me that you can be in the right and also a t0sser. Perfectly exampled by the last 4 words: ending a sentence with a preposition may be grammatically incorrect, but doing the opposite makes you sound REALLY haughty. Thanks for the walk through of the assumptions, super clear and helpful.
      
      Reply
3. Cons says:
  
  February 20th 2021 at 10:58 pm
  
  OMG ANDY your book is my long time go to bible whenever I have to write a thesis or paper or difficult statistical things. Can you please confirm the information on this page is correct and, more specifically, if the ‘how to report’ is written properly? (The ‘how to report’ is the only thing that I can not always figure out 100% from your book)
  
  Reply
Brianna says:

September 29th 2014 at 8:26 pm

Hi there,

There is no reference supporting the rules of thumb for VIF and tolerance so I was wondering, what work do you cite to support these rules?

Kind regards,
B

Reply
Ed says:

October 29th 2014 at 1:30 pm

Thank you, this helped me so much

Reply
Nurul says:

November 13th 2014 at 5:44 am

Thank you very much! it really helps, a lot. Psychologist to be! 🙂

Reply
Ruth says:

December 9th 2014 at 3:21 am

Thank you very much for this very clear and detailed explanation!

Reply
Olaya de la Iglesia says:

April 11th 2015 at 10:25 am

Thank you ever so much for this, this has saved me hours and hours in 2 different assignments. It is really well explained and illustrated. You are a godsend!

Reply
Mama says:

May 28th 2015 at 8:57 pm

Thanks thanks thanks! I have found it super useful… I missed information about Cook’s Distance. My data meets all the assumptions, except that one, which shows outliers. But I just tell you, thanks for sharing this information and be a good person reducing the effort to others.

peace and love! =)

Reply
Aleta Clegg says:

August 4th 2015 at 2:07 am

Thank you, thank you, thank you! I’m trying to write up my data analysis for my thesis and I am sorely lacking in statistics knowledge. You post was very clear and helpful, much better than most of what I’ve found online.

Reply
Ishrat Rehman says:

September 19th 2015 at 7:59 pm

its an amazing way of describing or interpreting results from multiple Regression .. thank you so much for your easiest and simple way of teaching..

Reply
Batagarawa Aminu Ibrahim says:

November 10th 2015 at 2:26 pm

You are really good. Humans need individuals like you

Reply
paul says:

February 7th 2016 at 1:32 pm

just wanted you to know that your blog post here is still helping people an awful lot! I’m totally rubbish at applying statistics to my research but thanks to you and this post I’ve been able to apply one of the most important statistical tests to my research which I found to be significant (whoop!). Thanks very much anyway!

Reply
becky says:

March 28th 2016 at 2:24 pm

This is absolutely wonderful, thank you so so so much for taking the time to write this! It has helped me enormously, taken all the stress away of searching through textbooks. You’re a star!

Reply
hamoudizi says:

May 2nd 2016 at 5:47 am

Aaaah, Thank you very much.

Reply
Mb says:

July 12th 2016 at 5:38 pm

This has helped me so much, it has made more sense than anything else I have read. You have made completing my stats analysis for my DClinPsy thesis a whole lot easier! Thank you!

Reply
Gerry says:

July 22nd 2016 at 9:37 am

My heartfelt thanks for this wonderfully clear and concise article, you have made my life as a researcher so much more easier!
Great job!

Reply
Jayanthi Thiyagarajan says:

August 27th 2016 at 6:13 am

Excellent way you have written..Very simple and clear to understand.. Thanks alot

Reply
Claudia says:

September 6th 2016 at 4:24 pm

Thank you so much – it really helped to understand the assumption for linear regression and how to interpret the SPSS outputs. I also order Andy Field’s book today 🙂

Reply
JJ says:

October 26th 2016 at 4:23 am

Hi there.

I have to say that when it comes to reporting regression in APA style, your post is the best on the internet – you have saved a lot of my time, I was looking how to report multiple regression and couldn’t find anything (well until now), even some of my core textbooks don’t go beyond explaining what is regression and how to run the analysis in the SPSS, so thank you kind Sir!

Reply
C says:

November 18th 2016 at 1:54 pm

did you always stick with the +/- 3.29 for outliers or +/-1.96?
i thought we should use +/- 1.96

Reply
1. Andrew Dart says:
  
  November 21st 2016 at 10:31 am
  
  Honest answer, I don’t know. I was taught to use the 3.29 figure, but if you have been told to use 1.96 I would go with that. In my experience if you do things the way the person reviewing your work does them you will probably be ok 🙂
  
  Reply
Sama says:

November 30th 2016 at 4:46 am

A great post! Thanks so much for providing this for us (graduate students- PhD(c) Nurse).. Almost there! Write up and analysis …

Reply
Shakera says:

December 4th 2016 at 2:30 am

THANK YOU!!

Reply
Yuanita Asri Langi says:

March 4th 2017 at 6:05 pm

Thank you….really helped

Reply
LS. says:

March 8th 2017 at 8:39 pm

This has helped me so much! Thank you

Reply
amna says:

April 20th 2017 at 3:19 pm

any idea how to report a non significant simple linear regression in apa?

Reply
LM says:

April 25th 2017 at 12:18 pm

This literally saved my dissertation! Thank you!

Reply
Zohaib Ali says:

May 1st 2017 at 5:02 pm

I’ve been stuck on conducting a regression analysis for days! This article has really helped me and i have to say that you are a god send for creating this!

Thank you

Zohaib Ali

Reply
Hiii says:

June 26th 2017 at 11:07 am

Hello! I have a question that is not necessarily about multiple linear regression. However, I have no clue where I should ask my question, so I hope you can help me out! I’m writing my thesis right now. One limitation of my study is that the sample is non independent (the sample consists of couples and they need to fill in the surveys for multiple times). However, the data is treated as dependent. I know that this could affect my results, but I don’t know exactly how. Do you perhaps have an idea? Thanks!! 🙂

Reply
Sonja says:

August 13th 2017 at 12:31 am

Hi – I am just writing up results for a third year stats paper: how would you report a Mahalanobis distance test that detected three outliers on two predictor variables? I excluded the three
Thanks – this is soooo helpful – sonja

Reply
Enideg Geremew says:

June 9th 2019 at 9:00 pm

Hi! I’m student from Ethiopia, attending MA class in Sociology. I have been doing my thesis using multiple regression as techniques of data analysis really I found this post very helpful.

Thank you indeed!

Reply
JD says:

April 17th 2020 at 4:23 pm

Amazing stuff! very helpful when nowhere else brings it all together like this. thank you!

Reply
1. Andrew Dart says:
  
  April 17th 2020 at 4:30 pm
  
  Happy to be of help 🙂
  
  Reply
CG says:

August 18th 2020 at 5:38 pm

Dear Andrew,

I have never left a review before for anything but I felt compelled to after finding this utter GEM. I also searched for hours for a comprehensive to-do list for the multiple regressions I am doing for my dissertation. I would also highly recommend Andy Field’s books but when you’re in a panic state just before hand-in books are a paperweight made entirely of stress. Thank you so much for being one of those helpful people who posts the answers to their problems.

Reply
1. Andrew Dart says:
  
  August 19th 2020 at 8:39 am
  
  Well thank you, I really appreciate the kind words 🙂 Hope the dissertation goes well
  
  Reply
Aishath Shina says:

August 26th 2020 at 1:40 pm

thank you so much. This is really very helpful

Reply
BA says:

September 28th 2020 at 11:43 pm

Thank you so much for your explanations! I did not understand this sort of analysis at all. When I started a Research Assistantship this semester, I found out how many things my beginning stats and research methods in psychology courses did not cover!

Your site offered so much insight! Thank you 🙂

– A grateful nontraditional undergrad student

Reply
Describing the impact of Smoking and Drinking Alcohol on Poor Physical or Mental health of individuals, using The Behavioral Risk Factor Surveillance System (BRFSS) dataset. – S.R. Tallie Consulting Services and The Right Hand Persons LLc says:

November 18th 2020 at 10:53 am

[…] Dart, A., (2013). Reporting Multiple Regressions in APA format – Part One. https://www.adart.myzen.co.uk/reporting-multiple-regressions-in-apa-format-part-one/ […]

Reply
sabarun says:

September 21st 2021 at 2:56 am

This is a good artcle. the language is simple and understandable.

Reply
Segs B says:

October 6th 2021 at 3:08 am

Thank you so much for this! I am a fourth year PhD student and I have never come across such a detailed and compact explanation for reporting in APA format. Yours was the only one I found online! And I almost didn’t click on it because the others were so disappointing…thanks once again, you’ll be glad to know this is still making a difference almost 10 years later!

Reply
1. Andrew Dart says:
  
  October 6th 2021 at 9:10 am
  
  Thank you for your kind words, I am really glad to hear that something I put together because I couldn’t locate anything about it either is still helping other people.
  
  Reply
Christina Lynn says:

January 7th 2022 at 3:48 pm

Thanks for taking the time to put part 1 and part 2 of MLR APA reporting together! (I’m the poor sap you helped!)

Reply
1. Andrew Dart says:
  
  January 7th 2022 at 3:50 pm
  
  Thanks for that, I am really glad that something I put together really just for my own information has been of so much use to people over the years 🙂
  
  Reply
Ruchi Singh says:

March 2nd 2022 at 6:34 am

Respected Andrew, will you please provide a multivariate regression table formate for presenting results. Actually, I am conducting a study taking two predictors and two criterion variables. I need an APA appropriate table formate for presenting the results in which two predictors predict two criterion variables. I will be really highly obliged for your kind support.

Thank you

Reply
1. Andrew Dart says:
  
  March 2nd 2022 at 11:22 am
  
  Hi Ruchi
  
  Thank you very much for your message, but the truth is I am not sure how much I can help you. It has now been almost 9 years since I have done any statistics at all, and I honestly can’t remember how to do what you are asking. What I will do, if I can find it, is post a picture of the tables from the dissertation that I wrote this blog post about. That will show you how I reported the results in table form, so hopefully will help you. Bear with me as it might take me a while to locate it, as I have moved house since then and am not completely sure where it is.
  
  Reply
  1. Ruchi Singh says:
    
    March 3rd 2022 at 4:29 am
    
    Respected Sir, thank you so so much for considering my query. Sir I would like to tell you that some time ago, in one of my studies I worked on 2IVs and 1DV and applied Multiple Regression Analysis. So, I formed a few tables taking all the three variables in a single table for my entire sample, sub-sample, and gender-wise. but here as I am working on 2 DVs. So, I was looking for such a table format in which I could report the results on both the DVs in a single table. I searched online, offline, but did not find any solution. Simultaneously, I posted this question on Research Gate also, and over there respected Rani P Ramachandran ma’am (Engineering Adjunct Professor) suggested 11 hours ago that each DV should be reported in separate tables. So, I am very happy to inform you that I have found the answer to my question. Again thank you very much for your cooperation and please give me your blessings 🙂
    
    Thank you…
    
    Reply
    1. Andrew Dart says:
      
      March 3rd 2022 at 11:15 am
      
      Hi Ruchi
      
      That is great to hear, I am very glad that you found an answer to your question. Best of luck with the rest of your paper.
      
      Reply
      1. Ruchi Singh says:
        
        March 3rd 2022 at 12:42 pm
        
        Thank you Sir ?…. thank you so much…
Ruchi Singh says:

March 3rd 2022 at 12:41 pm

Thank you Sir ?…. thank you so much…

Reply
Sarah says:

December 16th 2022 at 7:37 pm

Ok, got all the way to the end and my variance statistic is a decimal (e.g., they read .715, .430, 24.817, 45.701, 114.711). What does this mean? How should we deal with this? Thanks!

Reply
1. Andrew Dart says:
  
  December 20th 2022 at 3:27 pm
  
  Hey there, sorry for the delay in replying. It has been a long time since I actually did this, and I will be completely honest and admit I don’t really remember what it means. However, there is a section on reporting non-zero variance at the bottom of this page, hopefully that will help somewhat. Sorry I can’t be more helpful, if I work it out I will be sure to let you know.
  
  Reply
2. Andrew Dart says:
  
  December 20th 2022 at 3:33 pm
  
  Ok I found this – https://mathbitsnotebook.com/Algebra1/StatisticsData/STSD.html
  
  It seems that the variance relates to how spread out your data is. You should expect to get a non-zero variance value, unless all your data is identical, and it just tells you that there is a difference between your IVs
  
  https://www.youtube.com/watch?v=wc9NE7sMqjg
  
  Reply
Sue says:

August 1st 2023 at 2:22 pm

This was so helpful for my master’s dissertation! Thank you so much for explaining it in a clear and structured manner. You clearly have a knack for explaining things so I hope you are still writing.

Reply
1. Andrew Dart says:
  
  August 1st 2023 at 2:30 pm
  
  Thank you, I am really glad I was able to help you out. I don’t tend to do much of this type of writing any more, I mainly write novels these days, but maybe I should get back to it once in a while.
  
  Reply
James435 says:

December 8th 2023 at 7:11 pm

I believe it is a trick for observation and you should display eliminated answers.

Reply

Leave a Reply to Nurul Cancel reply