<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Oh Snap! We Sent Bad Product to Our Customer: Fun with the Hypergeometric Distribution</title>
	<atom:link href="http://lssacademy.com/2009/08/03/oh-snap-we-sent-bad-product-to-our-customer-fun-with-the-hypergeometric-distribution/feed/" rel="self" type="application/rss+xml" />
	<link>http://lssacademy.com/2009/08/03/oh-snap-we-sent-bad-product-to-our-customer-fun-with-the-hypergeometric-distribution/</link>
	<description>Lean Manufacturing, Six Sigma, Lean Six Sigma, and Kaizen</description>
	<lastBuildDate>Wed, 01 Feb 2012 21:21:25 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
	<item>
		<title>By: Robert</title>
		<link>http://lssacademy.com/2009/08/03/oh-snap-we-sent-bad-product-to-our-customer-fun-with-the-hypergeometric-distribution/comment-page-1/#comment-3689</link>
		<dc:creator>Robert</dc:creator>
		<pubDate>Tue, 01 Sep 2009 13:15:48 +0000</pubDate>
		<guid isPermaLink="false">http://lssacademy.com/?p=1370#comment-3689</guid>
		<description>So why talking about fixing the inspection system? If 36 products out of 1000 are busted, there is room for improvement on the manufacturing side...
Or was this about explaining the distribution in a simple way?</description>
		<content:encoded><![CDATA[<p>So why talking about fixing the inspection system? If 36 products out of 1000 are busted, there is room for improvement on the manufacturing side&#8230;<br />
Or was this about explaining the distribution in a simple way?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Sowrirajan</title>
		<link>http://lssacademy.com/2009/08/03/oh-snap-we-sent-bad-product-to-our-customer-fun-with-the-hypergeometric-distribution/comment-page-1/#comment-3674</link>
		<dc:creator>Sowrirajan</dc:creator>
		<pubDate>Fri, 21 Aug 2009 02:35:20 +0000</pubDate>
		<guid isPermaLink="false">http://lssacademy.com/?p=1370#comment-3674</guid>
		<description>Hi
There seem to a basic problem with the sampling method they used.
Sampling methods use certain sample size formulae that are developed for well defined static population situations.
Though random sampling can be applied to stable processes we use systematic or sub-group sampling with clear time order to represent the process behaviour precisely.
Representativeness is the most important aspect of sampling.
Ron can correct me if i have faltered in my assessment please.</description>
		<content:encoded><![CDATA[<p>Hi<br />
There seem to a basic problem with the sampling method they used.<br />
Sampling methods use certain sample size formulae that are developed for well defined static population situations.<br />
Though random sampling can be applied to stable processes we use systematic or sub-group sampling with clear time order to represent the process behaviour precisely.<br />
Representativeness is the most important aspect of sampling.<br />
Ron can correct me if i have faltered in my assessment please.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Ron Pereira</title>
		<link>http://lssacademy.com/2009/08/03/oh-snap-we-sent-bad-product-to-our-customer-fun-with-the-hypergeometric-distribution/comment-page-1/#comment-3668</link>
		<dc:creator>Ron Pereira</dc:creator>
		<pubDate>Wed, 19 Aug 2009 15:33:52 +0000</pubDate>
		<guid isPermaLink="false">http://lssacademy.com/?p=1370#comment-3668</guid>
		<description>Gagandeep, the key to this analysis - and really any analysis that attempts to &quot;infer&quot; something from a sample of data is to ensure you do a good job sampling.  

My example here was made up... but in order to do what I describe for real you&#039;d need to have confidence in your sampling plan... also, you must know how many defects or defectives we&#039;re starting with in order to infer anything.  

So, if you don&#039;t have any data on how many defects or defectives a process has this must be your first step.  Go to gemba and observe first... Mintab second.  Make sense?</description>
		<content:encoded><![CDATA[<p>Gagandeep, the key to this analysis &#8211; and really any analysis that attempts to &#8220;infer&#8221; something from a sample of data is to ensure you do a good job sampling.  </p>
<p>My example here was made up&#8230; but in order to do what I describe for real you&#8217;d need to have confidence in your sampling plan&#8230; also, you must know how many defects or defectives we&#8217;re starting with in order to infer anything.  </p>
<p>So, if you don&#8217;t have any data on how many defects or defectives a process has this must be your first step.  Go to gemba and observe first&#8230; Mintab second.  Make sense?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Gagandeep S. Datta</title>
		<link>http://lssacademy.com/2009/08/03/oh-snap-we-sent-bad-product-to-our-customer-fun-with-the-hypergeometric-distribution/comment-page-1/#comment-3667</link>
		<dc:creator>Gagandeep S. Datta</dc:creator>
		<pubDate>Wed, 19 Aug 2009 15:20:01 +0000</pubDate>
		<guid isPermaLink="false">http://lssacademy.com/?p=1370#comment-3667</guid>
		<description>Ron

I have a leading question to this ... suppose we did not have the defectives count in the population (which 99.9% is true!). i.e.,

- the number of defectives in the sample = 0
- the size of the sample = 100
- the number of defectives in the population = &quot;unknown&quot;
- the population size = 1000

What I have boserved is that many examples assume &quot; the number of defectives in the population = the size of the sample&quot; i.e.,

- the number of defectives in the sample = 0
- the size of the sample = 100
- the number of defectives in the population = 100
- the population size = 1000

Why is this so the case? 

For e.g., suppose we want the probability of getting 3 winning numbers in a lottery of 49 numbers where we draw 6 numbers. This is equivalent to saying, any 3 from 6 in 49 of 1/49% (2.04%).

The resulting excel formula and result is =HYPGEOMDIST(3,6,6,49) = 1.77%

The reason I am seeking this is because Minitab asks for Sample-Size, Population-Size and Population-Defectives, REALITY is that we have defectives IN samples and NOT in populations (populations which are theoretically and practically infinite!).

Seek clarity.

Gagandeep</description>
		<content:encoded><![CDATA[<p>Ron</p>
<p>I have a leading question to this &#8230; suppose we did not have the defectives count in the population (which 99.9% is true!). i.e.,</p>
<p>- the number of defectives in the sample = 0<br />
- the size of the sample = 100<br />
- the number of defectives in the population = &#8220;unknown&#8221;<br />
- the population size = 1000</p>
<p>What I have boserved is that many examples assume &#8221; the number of defectives in the population = the size of the sample&#8221; i.e.,</p>
<p>- the number of defectives in the sample = 0<br />
- the size of the sample = 100<br />
- the number of defectives in the population = 100<br />
- the population size = 1000</p>
<p>Why is this so the case? </p>
<p>For e.g., suppose we want the probability of getting 3 winning numbers in a lottery of 49 numbers where we draw 6 numbers. This is equivalent to saying, any 3 from 6 in 49 of 1/49% (2.04%).</p>
<p>The resulting excel formula and result is =HYPGEOMDIST(3,6,6,49) = 1.77%</p>
<p>The reason I am seeking this is because Minitab asks for Sample-Size, Population-Size and Population-Defectives, REALITY is that we have defectives IN samples and NOT in populations (populations which are theoretically and practically infinite!).</p>
<p>Seek clarity.</p>
<p>Gagandeep</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Ron Pereira</title>
		<link>http://lssacademy.com/2009/08/03/oh-snap-we-sent-bad-product-to-our-customer-fun-with-the-hypergeometric-distribution/comment-page-1/#comment-3666</link>
		<dc:creator>Ron Pereira</dc:creator>
		<pubDate>Wed, 19 Aug 2009 12:34:40 +0000</pubDate>
		<guid isPermaLink="false">http://lssacademy.com/?p=1370#comment-3666</guid>
		<description>Thanks for the excellent analysis,  Gagandeep!  I, and I&#039;m sure those without Minitab access, appreciate the time you took to share this!</description>
		<content:encoded><![CDATA[<p>Thanks for the excellent analysis,  Gagandeep!  I, and I&#8217;m sure those without Minitab access, appreciate the time you took to share this!</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Gagandeep S. Datta</title>
		<link>http://lssacademy.com/2009/08/03/oh-snap-we-sent-bad-product-to-our-customer-fun-with-the-hypergeometric-distribution/comment-page-1/#comment-3664</link>
		<dc:creator>Gagandeep S. Datta</dc:creator>
		<pubDate>Wed, 19 Aug 2009 10:56:51 +0000</pubDate>
		<guid isPermaLink="false">http://lssacademy.com/?p=1370#comment-3664</guid>
		<description>When we talk of hyper-geometric distribution, we require the following data to establish a probability:
•	the number of successes in the sample
•	the size of the sample
•	the number of successes in the population
•	the population size
 
In excel the formula is entered as HYPGEOMDIST(the number of successes in the sample, the size of the sample, the number of successes in the population, the population size)

Let me first illustrate using a lottery example. For e.g., If we want the probability of getting 3 winning numbers in a lottery of 49 numbers where we draw 6 numbers, it is equivalent to saying, “any 3 from 6 in 49 of 1/49% (2.04%),” which in excel translates to: =HYPGEOMDIST(3,6,6,49) = 1.77%

So in the example stated by you, in this scenario,
-	the number of defectives in the sample = 0
-	the size of the sample = 100
-	the number of defectives in the population = 36
-	the population size = 1000

Which in excel translates to: =HYPGEOMDIST(0,100,36,1000) = 2.09676%</description>
		<content:encoded><![CDATA[<p>When we talk of hyper-geometric distribution, we require the following data to establish a probability:<br />
•	the number of successes in the sample<br />
•	the size of the sample<br />
•	the number of successes in the population<br />
•	the population size</p>
<p>In excel the formula is entered as HYPGEOMDIST(the number of successes in the sample, the size of the sample, the number of successes in the population, the population size)</p>
<p>Let me first illustrate using a lottery example. For e.g., If we want the probability of getting 3 winning numbers in a lottery of 49 numbers where we draw 6 numbers, it is equivalent to saying, “any 3 from 6 in 49 of 1/49% (2.04%),” which in excel translates to: =HYPGEOMDIST(3,6,6,49) = 1.77%</p>
<p>So in the example stated by you, in this scenario,<br />
-	the number of defectives in the sample = 0<br />
-	the size of the sample = 100<br />
-	the number of defectives in the population = 36<br />
-	the population size = 1000</p>
<p>Which in excel translates to: =HYPGEOMDIST(0,100,36,1000) = 2.09676%</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Anonymous</title>
		<link>http://lssacademy.com/2009/08/03/oh-snap-we-sent-bad-product-to-our-customer-fun-with-the-hypergeometric-distribution/comment-page-1/#comment-3602</link>
		<dc:creator>Anonymous</dc:creator>
		<pubDate>Tue, 11 Aug 2009 01:24:28 +0000</pubDate>
		<guid isPermaLink="false">http://lssacademy.com/?p=1370#comment-3602</guid>
		<description>Do defects occur completely randomly or in clusters? If the latter i would think entirely random sampling might not be the most efficient.</description>
		<content:encoded><![CDATA[<p>Do defects occur completely randomly or in clusters? If the latter i would think entirely random sampling might not be the most efficient.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Anonymous</title>
		<link>http://lssacademy.com/2009/08/03/oh-snap-we-sent-bad-product-to-our-customer-fun-with-the-hypergeometric-distribution/comment-page-1/#comment-3601</link>
		<dc:creator>Anonymous</dc:creator>
		<pubDate>Tue, 11 Aug 2009 01:20:08 +0000</pubDate>
		<guid isPermaLink="false">http://lssacademy.com/?p=1370#comment-3601</guid>
		<description>If you were making teh widgets in lots of 10, testing every 10th wouldnt be the smartest idea. 10mod10=0</description>
		<content:encoded><![CDATA[<p>If you were making teh widgets in lots of 10, testing every 10th wouldnt be the smartest idea. 10mod10=0</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jeff Hajek</title>
		<link>http://lssacademy.com/2009/08/03/oh-snap-we-sent-bad-product-to-our-customer-fun-with-the-hypergeometric-distribution/comment-page-1/#comment-3574</link>
		<dc:creator>Jeff Hajek</dc:creator>
		<pubDate>Sat, 08 Aug 2009 06:56:29 +0000</pubDate>
		<guid isPermaLink="false">http://lssacademy.com/?p=1370#comment-3574</guid>
		<description>This article only talks about one shipment to one customer. The sampling plan is undoubtedly used on all of the shipments. So, out of every fifty shipments, it is likely that one will look good and actually be bad (consumer&#039;s risk, or beta risk).
If this were the only inspection that happened and it had a problem, the most likely scenario is that there is a problem with the inspection process. I would look at that first. 
On the other hand, if there were 49 other inspections that turned out right (i.e. properly categorized the lot), the sampling process might not be broken. Sampling is a shortcut, and carries risk. Every sampling plan will, eventually, have this kind of problem. Even with a sample size of 999, there is still a chance, no matter how small, that the one that isn&#039;t inspected will be the one that is defective. That&#039;s why sampling isn&#039;t used for critical functions, like airport security.
There&#039;s another important task, though. We don&#039;t know how long ago the problem happened. Since there is only a 1 in 50 (ish) chance that this happened to the very first customer, the producer probably needs to be looking at prior shipments too.
The good news, though, is that the article mentions that a permanent fix (poka yoke) was put in place. It sounds like the company is on the right track--preventing defects at the source.</description>
		<content:encoded><![CDATA[<p>This article only talks about one shipment to one customer. The sampling plan is undoubtedly used on all of the shipments. So, out of every fifty shipments, it is likely that one will look good and actually be bad (consumer&#8217;s risk, or beta risk).<br />
If this were the only inspection that happened and it had a problem, the most likely scenario is that there is a problem with the inspection process. I would look at that first.<br />
On the other hand, if there were 49 other inspections that turned out right (i.e. properly categorized the lot), the sampling process might not be broken. Sampling is a shortcut, and carries risk. Every sampling plan will, eventually, have this kind of problem. Even with a sample size of 999, there is still a chance, no matter how small, that the one that isn&#8217;t inspected will be the one that is defective. That&#8217;s why sampling isn&#8217;t used for critical functions, like airport security.<br />
There&#8217;s another important task, though. We don&#8217;t know how long ago the problem happened. Since there is only a 1 in 50 (ish) chance that this happened to the very first customer, the producer probably needs to be looking at prior shipments too.<br />
The good news, though, is that the article mentions that a permanent fix (poka yoke) was put in place. It sounds like the company is on the right track&#8211;preventing defects at the source.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Ron Pereira</title>
		<link>http://lssacademy.com/2009/08/03/oh-snap-we-sent-bad-product-to-our-customer-fun-with-the-hypergeometric-distribution/comment-page-1/#comment-3557</link>
		<dc:creator>Ron Pereira</dc:creator>
		<pubDate>Tue, 04 Aug 2009 22:36:48 +0000</pubDate>
		<guid isPermaLink="false">http://lssacademy.com/?p=1370#comment-3557</guid>
		<description>Hi everyone, thanks for your great and thought provoking comments.  I really appreciate them.</description>
		<content:encoded><![CDATA[<p>Hi everyone, thanks for your great and thought provoking comments.  I really appreciate them.</p>
]]></content:encoded>
	</item>
</channel>
</rss>

