Free LSS Academy Guide to Lean Manufacturing

Sign up to receive a FREE copy of our 70+ page book, "LSS Academy Guide to Lean Manufacturing" and our "Insider Newsletter".

What is RSS?

Subscribe to LSS Academy

Click Here to Subscribe to Articles Subscribe By Email Below

Regression - Part 3

by Ron on March 6th, 2007

This evening we will wrap up our discussion of regression. So far we have discussed what regression is and a few ways to determine whether our model is significant.

Next up I want to discuss something called the least squares method and residuals. I will wrap it all up with a short discussion on the differences between correlation, causation, and extrapolation. Yikes, this sounds serious.

Least Squares Method

Our regression equation used to predict things is determined by a procedure known as the method of least squares. There is some math involved to sort this all out but the basic idea is simple. All we are doing is plotting the actual data points and drawing a line down the middle of them. This line is called the “best fit” line as it tries to minimize the distance of all the points to the best fit line (actually it is the total squared vertical distance for the statistics nerds out there).

So basically, we plot the actual data points and fit a line down the middle of them. That is the least squares method and I didn’t even need an entire book!

Residuals

I mentioned how the lack of a flip chart was slowing me down last night. Well I am trying out my scanner and while it is not the best it is better than nothing. As my nice little picture (compliments are very welcome by the way… hee hee) demonstrates, a residual is simply the distance between the actual data point and the predicted data point (also called the “fit”). Put another way, the residual is the leftover variation in Y after using X to predict it.

We like to look at our residuals when doing regression as it can help us spot any issues with data collection, variation issues, operator error, etc. There are a few assumptions we make with residuals, namely:

  • They are not related to the inputs
  • They don’t change over time – they are consistent
  • They are normal (bell shaped)

A nice Black Belt can help you ensure these assumptions are in check. If they are not in check you need to proceed with much caution (i.e. don’t try to predict anything).

Correlation, Causation, and Extrapolation

Typing those three words made me cringe. They sound so serious. Well don’t sweat it I will do my best to bring it down to earth for us normal people. Yes, I am normal. I swear. I am!

Correlation means that two things seem to be varying in a similar manner. If raising the temperature on our injection molding machine seems to be impacting the weight of the part we may say there is correlation.

Taking it one step further, causation means that when we change one variable the other variable in question changes too. So, in our injection molding example we may be able to prove causation by predicting what our Y will be given a specific X and then testing the theory! The 11th commandment of Six Sigma is “Thou Shall Confirm.”

Finally, the term extrapolation means that we attempt to predict Y outside the range of what was tested. So if you only tested up to 500 degrees with your injection molding machine you should not try to predict what will happen at 600 degrees. We have no data and do not know if there is a linear relationship.

Summary 

Well that about sums things up for our regression discussion. I hope you found it useful. As with anything, the best way to learn something is to give it a shot! So go collect some variable data and fit a line through it. Until next time, I wish you all the best on your journey towards continuous improvement.

Subscribe to LSS Academy

If you enjoyed this article please consider subscribing to our full feed RSS. You can also subscribe by email and have new articles sent directly to your inbox.


Did you know you can have future LSS Academy articles focused on leadership, lean, and six sigma sent directly to your email inbox for free? Just enter your email below:

3 comments...What do you think?

  1. Posted by Joe 9th March, 2007 at 10:31 am

    Hi Ron

    Very kind of for the discussion.

  2. Posted by Liquor Stores and Churches | Lean Six Sigma Academy 27th September, 2007 at 7:05 pm

    […] For more details on this correlation, causation, and even a little extrapolation fun check this post out. […]

  3. Posted by LSS Academy Series Review - Six Sigma Edition | Lean Six Sigma Academy 23rd March, 2008 at 10:26 pm

    […] values.  Finally, in part 3 we talked about residuals and the ultra important differences between correlation, causation, and extrapolation.  If you click on only only one link in this article.. click this last […]

What do you think? Join the discussion...