Solve the following, using both the binomial distribution and the normal approximation to the binomial a What is the probability that exactly 7 people will
Given a binomial distribution X with n trials, success probability p, we can approximate it using a Normal random variable N with mean np, variance np(1 ? p)
19 juil 2017 · Many things in the world are not quite distributed normally, but data scientists and computer scientists model them as normal distributions
When you take the parametric approach to inferential statistics, the values that are assumed to be normally distributed are the means across samples To be
3 3 2 Condition of Normal Distribution: i) Normal distribution is a limiting form of the binomial distribution under the following conditions
If x, y are independently distributed random variables, then V (x+y) = V (x)+V (y) But this is not true in general The variance of the binomial distribution
In those days, the binomial distribution was known as a discrete probability distribution in the way we think of discrete distributions today, but it is not
While the heights of human beings follow a normal distribution, weights do not (This linear interpolation is not strictly correct but is acceptable )
There is no closed form for the distribution function of the Normal distribution A sufficient condition on X for the Central Limit Theorem to apply is
![[PDF] The Normal Distribution [PDF] The Normal Distribution](https://pdfprof.com/EN_PDFV2/Docs/PDF_6/107567_6110_normal_distribution.pdf.jpg)
107567_6110_normal_distribution.pdf - 1 -
Will Monroe
CS 109Lecture Notes #11
July 19, 2017The Normal DistributionBased on a chapter by Chris Piech Thesinglemostimportantrandomvariabletypeisthenormal(a.k.a.Gaussian)randomvariable, parametrized by a mean () and variance (2). IfXis a normal variable, we writeXN(;2). The normal is important for many reasons: it is generated from the summation of independent random variables and as a result it occurs often in nature. Many things in the world are not quite distributed normally, but data scientists and computer scientists model them as normal distributions anyways. Why? Essentially, the normal is what we use if we know mean and variance, but nothing else. In fact, it is the most conservative1modeling decision that we can make for a random variable while still matching a particular expectation (average value) and variance (spread). The probability density function (PDF) for a normalXN(;2)is: f
X(x)=1
p2e 12 (x )2 Notice thexin the exponent of the PDF function. Whenxis equal to the mean (), theneis raised to the power of0and the PDF is maximized.
By design, a normal hasE[X]=andVar(X)=2.
Linear Transform
IfXis a normal RV such thatXN(;2)andY=aX+b(Yis a linear transform ofX), thenY is also a normal RV where:
YN(a+b;a22)
Projection to Standard Normal
For any normal RVXwe can find a linear transform fromXto thestandard normalN(0;1). That
is, if you subtract the mean () of the normal and divide by the standard deviation (), the result is
distributed according to the standard normal. We can prove this mathematically. LetW=X :
W=X
transformX: subtractand divide by = 1 X use algebra to rewrite the equation =aX+bwherea=1 ,b= N(a+b;a22)the linear transform of a normal is another normal N( ;2
2)substituting values in foraandb
N(0;1)the standard normal1Formally,ithasthehighestentropyH(X)= Rdx f(x)logf(x)ofanydistributiongiventhemeanandvariance.
- 2 - An extremely common use of this transform is to expressFX(x), the CDF ofX, in terms of the CDF ofZ,FZ(x). Since the CDF ofZis so common it gets its own Greek symbol:(x) F
X(x)=P(Xx)
=P X x ! =P
Zx
=x Whyisthisuseful?Well,inthedayswhenwecouldn"tcallscipy.stats.norm.cdf(oronexams, when one doesn"t have a calculator), people would look up values of the CDF in a table (see the last page of these notes). Using the standard normal means you only need to build a table of one distribution, rather than an indefinite number of tables for all the different values ofand! We also have an online calculator on the CS 109 website. You should learn how to use the normal table for the exams, however!
How to Remember that Crazy PDF
What is the PDF of the standard normalZ? Let"s plug it in: f
Z(z)=1
p2e 12 (z )2=11 p2e 12 z 01
2=1p2e 12
z2
This gets even better if we realize that
1p2is just a constant to make the whole thing integrate to 1.
Call that constantC:
f
Z(z)=Ce 12
z2 Not so scary anymore, is it? In fact, this equation can be a rather helpful mnemonic: the normal distribution PDF is just the exponential of a parabola. What does that look like?f(z)= 12 z2f(z)=e 12 z2 As it turns out, the exponential of a (downward) parabola is a familiar shape: the "bell curve". Now bring back the fact thatZ=X , and you can see thatdetermines where the "peak" of the bell curve will be, whiletells you how wide it is. (Don"t forget thatCchanges too!) - 3 -
Example 1
LetXN(3;16), what isP(X>0)?
P(X>0)=P X 34
>0 34 ! =P Z> 34 ! =1 P
Z 34
! =1 ( 34 )=1 (1 (34 ))=(34 )=0:7734
What isP(2 P(2