[PDF] Normal approximation to the Binomial





Loading...








[PDF] The Normal Approximation to the Binomial Distribution

Remarkably, when n, np and nq are large, then the binomial distribution is well approximated by the normal distribution According to eq (8 3) on p 762 of Boas 




[PDF] Normal approximation to the binomial

Theorem 9 1 (Normal approximation to the binomial distribution) If Sn is a binomial variable with parameters n and p, Binom (n, p), then

[PDF] Approximating the Binomial Distribution by the Normal - DiVA Portal

21 jui 2011 · In this paper an examination is made regarding the size of the approximations errors The exact probabilities of the binomial distribution is 

[PDF] Lab Project 5: The Normal approximation to Binomial distribution

Lab Project 5: The Normal approximation to Binomial distribution Course : Introduction to Probability and Statistics, Math 113 Section 3234

[PDF] 65 The Normal Approximation to the Binomial Distribution

12 nov 2019 · Although it seems strange, under certain circumstances a (continuous) normal distribution can be used to approximate a (discrete) binomial 




[PDF] Normal approximation to the Binomial

In 1733, Abraham de Moivre presented an approximation to the Binomial distribution He later (de Moivre, 1756, page 242) appended the derivation

[PDF] Normal Distribution as Approximation to Binomial Distribution

Normal Approximation to the Binomial distribution IF np > 5 AND nq > 5, then the binomial random variable is approximately normally distributed with mean µ =np 

[PDF] NORMAL APPROXIMATION TO THE BINOMIAL DISTRIBUTION

For accurate values for binomial probabilities, either use computer software to do exact calculations or if n is not very large, the probability calculation 

[PDF] Section 55, Normal Approximations to Binomial Distributions

Let x represent a binomial random variable for n trials, with probability of Since the binomial distribution is discrete and the normal distribution is 




[PDF] The Normal Distribution as an Approximation to the Binomial

For a large enough number of trials (n) the area under normal curve can be used to approximate the probability of a binomial distribution Requirements:

[PDF] Normal approximation to the binomial - UConn Undergraduate

Theorem 9 1 (Normal approximation to the binomial distribution) Below is a table on how to use the continuity correction for normal approximation to a

[PDF] Normal approximation to the Binomial

Normal approximation to the Binomial 5 1 History In 1733, Abraham de Moivre presented an a + b\n expanded into a Series, from whence are deduced some display, for the Bin(n,0 4) distribution with n = 20,50,100,150,200, are typical

PDF document for free
  1. PDF document for free
[PDF] Normal approximation to the Binomial 29623_6Bin_Normal.pdf

Chapter5

Normal approximation to the

Binomial

5.1

History

In 1733, Abraham de Moivre presented an approximation to the Binomial distribution. He later ( de Moivre , 1756
, page 242) appended the derivation of his approximation to the solution of a problem asking for the calculation of an expected value for a particular game. He posed the rhetorical question of how we might show that experimental proportions should be close to their expected values: From this it follows, that if after taking a great number of Experi- ments, it should be perceived that the happenings and failings have been nearly in a certain proportion, such as of 2 to 1, it may safely be concluded that the Probabilities of happening or failing at any one time assigned will be very near in that proportion, and that the greater the number of Experiments has been, so much nearer the Truth will the conjectures be that are derived from them. But suppose it should be said, that notwithstanding the reason- ableness of building Conjectures upon Observations, still considering the great Power of Chance, Events might at long run fall out in a di erent proportion from the real Bent which they have to happen one way or the other; and that supposing for Instance that an Event might as easily happen as not happen, whether after three thousand Experiments it may not be possible it should have happened two thou- sand times and failed a thousand; and that therefore the Odds against so great a variation from Equality should be assigned, whereby the Mind would be the better disposed in the Conclusions derived from the Experiments.

Statistics 241/541 fall 2014

c

David Pollard, Sept20111

5. Normal approximation to the Binomial 2

In answer to this, I'll take the liberty to say, that this is the hardest Problem that can be proposed on the Subject of Chance, for which reason I have reserved it for the last, but I hope to be forgiven if my Solution is not tted to the capacity of all Readers; however I shall derive from it some Conclusions that may be of use to every body: in order thereto, I shall here translate a Paper of mine which was printed November 12, 1733, and communicated to some Friends, but never yet made public, reserving to myself the right of enlarging my own Thoughts, as occasion shall require. De Moivre then stated and proved what is now known as the normal approximation to the Binomial distribution. The approximation itself has subsequently been generalized to give normal approximations for many other distributions. Nevertheless, de Moivre's elegant method of proof is still worth understanding. This Chapter will explain de Moivre's approximation, using modern notation.

A Method of approximating the Sum of the Terms of the Binomiala+bnnexpanded into a Series, from whence are deduced some

practical Rules to estimate the Degree of Assent which is to be given to Experiments. Altho' the Solution of problems of Chance often requires that several Terms of the Binomiala+bnnbe added together, never- theless in very high Powers the thing appears so laborious, and of so great diculty, that few people have undertaken that Task; for besides James and Nicolas Bernouilli, two great Mathemati- cians, I know of no body that has attempted it; in which, tho' they have shown very great skill, and have the praise that is due to their Industry, yet some things were further required; for what they have done is not so much an Approximation as the deter- mining very wide limits, within which they demonstrated that the Sum of the Terms was contained. Now the method ... 5.2

Pictures of the binomial

SupposeXnhas a Bin(n;p) distribution. That is,

b n(k) :=PfXn=kg=n k p kqnkfork= 0;1;:::;n, whereq= 1p,

Statistics 241/541 fall 2014

c

David Pollard, Sept2011

5. Normal approximation to the Binomial 3

Recall that we can think ofXnas a sum of independent random variables Y

1++YnwithPfYi= 1g=pandPfYi= 0g=q. From this representation

it follows that EXn=X iEYi=nEY1=np var(Xn) =X ivar(Yi) =nvar(Y1) =npq Recall also that Tchebychev's inequality suggests the distribution should be clustered aroundnp, with a spread determined by the standard devia- tion,n:=pnpq. What does the Binomial distribution look like? The plots in the next display, for the Bin(n;0:4) distribution withn= 20;50;100;150;200, are typical. Each plot on the left shows bars of heightbn(k) and width 1, centered atk. The maxima occur nearn0:4 for each plot. Asnincreases, the spread also increases, re ecting the increase in the standard deviations  n=pnpqforp= 0:4. Each of the shaded regions on the left has area to one becausePn k=0bn(k) = 1 for eachn.0204060801001200 0.1 0.2

Bin(20,0.4)

-4-20240 0.5

0204060801001200

0.1 0.2

Bin(50,0.4)

-4-20240 0.5

0204060801001200

0.1 0.2

Bin(100,0.4)

-4-20240 0.5

0204060801001200

0.1 0.2

Bin(150,0.4)

-4-20240 0.5

0204060801001200

0.1 0.2

Bin(200,0.4)

-4-20240

0.5The plots on the right show represent the distributions of the standard-

ized random variablesZn= (Xnnp)=n. The location and scaling e ects of the increasing expected values and standard deviations (withp= 0:4 and variousn) are now removed. Each plot is shifted to bring the location of the maximum close to 0 and the horizontal scale is multiplied by a factor 1=n.

Statistics 241/541 fall 2014

c

David Pollard, Sept2011

5. Normal approximation to the Binomial 4

A bar of heightnbn(k) with width 1=nis now centered at (knp)=n. The plots all have similar shapes. Each shaded region still has area 1. 5.3

De Moivre's argumen t

Notice how the standardized plots in the last picture settle down to a sym- metric `bell-shaped' curve. You can understand this e ect by looking at the ratio of successive terms: b n(k)=bn(k1) =n!k!(nk)!pkqnk =n!(k1)!(nk+ 1)!pk1qnk+1 = (nk+ 1)p=(kq) fork= 1;2;:::;n. As a consequence,bn(k)bn(k1) if and only if (nk+ 1)pkq, that is, i (n+ 1)pk. For xedn, the probabilitybn(k) achieves its largest value atkmax=b(n+1)pc np. The probabilitiesbn(k) increase withkfor kkmaxthen decrease fork > kmax. That explains why each plot on the left has a peak nearnp. Now for the shape. At least fork=kmax+inearkmaxwe get a good approximation for the logarithm of the ratio of successive terms using the Taylor approximation: log(1 +x)xforxnear 0. Indeed, b(kmax+i)=b(kmax+i1) =(nkmaxi+ 1)p(kmax+i)q  (nqi)p(np+i)q =

1i=(nq)1 +i=(np)after dividing through bynpq:

The logarithm of the last ratio equals

log  1inq  log

1 +inp

  inq inp =inpq : By taking a product of successive ratios we get the ratio of the individual Binomial probabilities to their largest term. On a log scale the calculation

Statistics 241/541 fall 2014

c

David Pollard, Sept2011

5. Normal approximation to the Binomial 5

is even simpler. For example, ifm1 andkmax+mn, log b(kmax+m)b(kmax) = logb(kmax+ 1)b(kmax)b(kmax+ 2)b(kmax+ 1)  b(kmax+m)b(kmax+m1) = log b(kmax+ 1)b(kmax)+ logb(kmax+ 2)b(kmax+ 1)++ logb(kmax+m)b(kmax+m1)  12  mnpq  12 m2npq : The last line used the fact that 1 + 2 + 3 ++m=12 m(m+ 1)12 m2:

In summary,

PfX=kmax+mg b(kmax)exp

m22npq formnot too large. An analogous approximation holds for 0kmax+mkmax. 5.4

The largest binomial probabilit y

Using the fact that the probabilities sum to 1, forp= 1=2 de Moivre was able to show that theb(kmax) should decrease like 2=(Bpn), for a constantB that he was initially only able to express as an in nite sum. Referring to his calculation of the ratio of the maximum term in the expansion of (1 + 1) n to the sum, 2 n, he wrote (de Moivre,1756 , page 244) When I rst began that inquiry, I contented myself to deter- mine at large the Value ofB, which was done by the addition of some Terms of the above-written Series; but as I perceived that it converged but slowly, and seeing at the same time that what I had done answered my purpose tolerably well, I desisted from proceeding further till my worthy and learned Friend Mr. James Stirling, who had applied himself after me to that inquiry, found that the QuantityBdid denote the Square-root of the Cir- cumference of a Circle whose Radius is Unity, so that if that Circumference be calledc, the Ratio of the middle Term to the

Sum of all the Terms will be expressed by2pnc.

Statistics 241/541 fall 2014

c

David Pollard, Sept2011

5. Normal approximation to the Binomial 6

In modern notation, the vital fact discovered by the learned Mr. James

Stirling asserts that

n!p2 nn+1=2enforn= 1;2;::: in the sense that the ratio of both sides tends to 1 (very rapidly) asngoes to in nity. See

F eller

( 1968
, pp52-53) for an elegant, modern derivation of the Stirling formula.

By Stirling's formula, fork=kmaxnp,

b n(k) =n!k!(nk)!pkqnk  1p2n n+1=2(np)np+1=2(nq)nq+1=2pnpqnq =

1p2npq:

De Moivre's approximation becomes

PfXn=kmax+mg 1p2npqexp

m22npq ; or, substitutingnpforkmaxand writingkforkmax+m,

PfXn=kg 1p2npqexp

(knp)22npq =1 np2exp (knp)222n : That is,PfXn=kgis approximately equal to the area under the smooth curve f(x) =1 np2exp (xnp)222n ; for the intervalk1=2xk+ 1=2. (The length of the interval is 1, so it does not appear in the previous display.) Similarly, for each pair of integers with 0a < bn,

PfaXnbg=X

b k=abn(k)X b k=aZ k+1=2 k1=2f(x)dx=Z b+1=2 a1=2f(x)dx: A change of variables,y= (xnp)=n, simpli es the last integral to 1p2Z ey2=2dywhere =anp1=2 nand =bnp+ 1=2 n: Remark.It usually makes little di erence to the approximation if we omit the1=2 terms from the de nitions of and .

Statistics 241/541 fall 2014

c

David Pollard, Sept2011

5. Normal approximation to the Binomial 7

5.5

Normal appro ximations

How does one actually perform a normal approximation? Back in the olden days, I would have interpolated from a table of values for the function (x) :=1p2Z x 1 ey2=2dy; which was found in most statistics texts. For example, ifXhas a Bin(100;1=2) distribution,

Pf45X55g 55:5505

 44:5505  0:86430:1356 = 0:7287

These days, I would just calculate in R:

> pnorm(55.5, mean = 50, sd = 5) - pnorm(44.5, mean = 50, sd = 5) [1] 0.7286679 or use another very accurate, built-in approximation: > pbinom(55,size = 100, prob = 0.5) - pbinom(44,size = 100, prob = 0.5) [1] 0.728747 5.6

Con tinuousdistributions

At this point, the integral in the de nition of (x) is merely a re ection of the Calculus trick of approximating a sum by an integral. Probabilists have taken a leap into abstraction by regarding , or its derivative(y) := exp(y2=2)=p2, as a way to de ne a probability distribution <5.1>De nition.A random variableYis said to have acontinuous distribu- tion(onR) withdensity functionf()if

PfaYbg=Z

b a f(y)dyfor all intervals [a;b]R:

Equivalently, for each subsetAof the real line,

PfY2Ag=Z

A f(y)dy=Z 1 1

Ify2Agf(y)dy

Statistics 241/541 fall 2014

c

David Pollard, Sept2011

5. Normal approximation to the Binomial 8

Notice thatfshould be a nonnegative function, for otherwise it might get awkward when calculatingPfY2Agfor the setA=fy2R:f(y)<0g:

0PfY2Ag=Z

A f(y)dy0:

Remark.By puttingAequal toRwe get

1 =Pf1< Y <+1g=Z

1 1 f(y)dy That is, the integral of a density function over the whole real line equals one. I prefer to think of densities as being de ned on the whole real line, with values outside the range of the random variable being handled by setting the density function equal to zero in appropriate places. If a range of integration is not indicated explicitly, it can then always be understood as1to1, with the zero density killing o unwanted contributions. Distributions de ned by densities have both similarities to and di er- ences from the sort of distributions I have been considering up to this point in Stat 241/541. All the distributions before now werediscrete. They were described by a (countable) discrete set of possible valuesfxi:i= 1;2;:::g that could be taken by a random variableXand the probabilities with whichXtook those values:

PfX=xig=pifori= 1;2;::::

For any subsetAof the real line

PfX2Ag=X

iIfxi2AgPfX=xig=X iIfxi2Agpi Expectations, variances, and things likeEg(X) for various functionsg, could all be calculated by conditioning on the possible values forX. For a random variableXwith a continuous distribution de ned by a densityf, we have

PfX=xg=Z

x x f(y)dy= 0 for everyx2R. We cannot hope to calculate a probability by adding up (an uncountable set of) zeros. Instead, as you will see in Chapter 7, we must pass to a limit and replace sums by integrals when a random variableXhas a continuous distribution.

Statistics 241/541 fall 2014

c

David Pollard, Sept2011

5. Normal approximation to the Binomial 9

5.7

App endix:The m ysterious

p2 The p2appeared in de Moivre's approximation by way of Stirling's for- mula. It is slightly mysterious why it appears in that formula. The reason for both appearances is the fact that the constant C:=Z 1 1 exp(x2=2)dx is exactly equal to p2, as I now explain. Equivalently, the constantC2=RRexp((x2+y2)=2)dxdyequal to 2. (Here, and subsequently, the double integral runs over the whole plane.) We can evaluate this double integral by using a small Calculus trick.

Using the fact thatZ1

0

Ifrzgezdz=erforr >0;

we may rewriteC2as a triple integral: replacerby (x2+y2)=2, then substitute into the double integral to get C 2=ZZ Z1 0

Ifx2+y22zgezdz

dxdy = Z 1 0 ZZ

Ifx2+y22zgdxdy

e zdz: With the change in the order of integration, the double integral is now calculating the area of a circle centered at the origin and with radiusp2z.

The triple integral reduces toZ1

0 p2z

2ezdz=Z

1 0 2zezdz= 2:

That is,C=p2.

References

de Moivre, A. (1756).The Doctrine of Chances: or, A Method of Calculating the Probabilities of Events in Play(Third ed.). New York: Chelsea. Third edition (fuller, clearer, and more correct than the former), reprinted in

1967. First edition 1718.

Feller, W. (1968).An Introduction to Probability Theory and Its Applications (third ed.), Volume 1. New York: Wiley.

Statistics 241/541 fall 2014

c

David Pollard, Sept2011


Normal Distribution Documents PDF, PPT , Doc

[PDF] a normal distribution curve where xu003d70 and

  1. Math

  2. Statistics And Probability

  3. Normal Distribution

[PDF] a normal distribution typically has how many tails

[PDF] anti normal distribution

[PDF] ap statistics normal distribution practice test

[PDF] approximately normal distribution when

[PDF] binomial distribution into normal

[PDF] bivariate normal distribution example problems

[PDF] bivariate normal distribution exercises

[PDF] can a normal distribution across the x axis

[PDF] can normal distribution be skewed

Politique de confidentialité -Privacy policy