[PDF] [PDF] STAT301: Cheat Sheet - TAMU Stat





Previous PDF Next PDF



Statistics Cheat Sheet Statistics Cheat Sheet

Statistics Cheat Sheet. Population. The entire group one desires information statistic and degrees of freedom. Ha: μAμ0 → the t-statistic is likely ...



Frequently Used Statistics Formulas and Tables Frequently Used Statistics Formulas and Tables

*see table 7-2 (last page of formula sheet). Confidence Intervals. Level of Confidence z-value ( /2 z α. ) 70%. 1.04. 75%. 1.15. 80%. 1.28. 85%. 1.44. 90%.



Statistics Formula Sheet and Tables 2020 Statistics Formula Sheet and Tables 2020

critical value standard error of statistic. ±. Chi-square statistic: (. ) 2 observed expected expected χ. −. = ∑. 2. AP Statistics 2020 Formulas and Tables 



TI 83/84 Calculator – The Basics of Statistical Functions

What to do next. Clear numbers already in a list: Arrow up to L1 then hit CLEAR



Important Concepts not on the AP Statistics Formula Sheet

With systematic random sampling we create a list of every member of the population. From the list



Excel Formulas Cheat Sheet

Excel Formulas Cheat Sheet. Page 4 of 7. Statistical Formulas. Formula. Structure. Explanation. SUMIF. = SUMIF(range criteria



List MF19

List of formulae and statistical tables. Cambridge International AS & A Level FURTHER PROBABILITY & STATISTICS. Sampling and testing. Two-sample estimate of ...



STAT301: Cheat Sheet - • Algebra (i) a+z×b−a

Normal calculations. Note the normal distribution is (i) symmetric about the One of the most common statistical methods is when samples are collected from two ...



TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics

The median M is located (n + 1)/2 observations from the beginning of this list. • Quartiles: The first quartile Q1 is the median of the observations whose 





Statistics Cheat Sheet

Statistics Cheat Sheet. Population Reflects the extent to which a statistic changes from sample to sample. For a mean.



Statistics Cheat Sheet

Statistics Cheat Sheet. Population Reflects the extent to which a statistic changes from sample to sample. For a mean.



Frequently Used Statistics Formulas and Tables

Math PRB nCr enter r. Note: textbooks and formula sheets interchange “r” and “x” for number of successes. Chapter 5. Discrete Probability Distributions:.



Probability Cheatsheet v2.0

4 Sept 2015 Probability Cheatsheet v2.0. Compiled by William Chen ... Uniform Order Statistics The jth order statistic of ... Formulas. Geometric Series.



Business Statistics Formula Sheet (Download Only) - lemmy.riotfest

2 days ago Getting the books business statistics formula sheet now is not type of ... Business Statistics 1 (QM161) Statistics Formulas Cheat. Sheet.



TI 83/84 Calculator – The Basics of Statistical Functions

Chapter 3 – Statistics for Describing Exploring and Comparing Data Formulas. µ = n · p. ? q = 1 - p. Using the Calculator ... sheet on 1- sentence.



Business

Recruiting Metrics Cheatsheet. 2 talent.linkedin.com. 03 Application Completion Rate ›. 04 Candidate Call Back Rate ›. 05 Candidates per Hire ›.



Important Concepts not on the AP Statistics Formula Sheet

Find the t-test statistic and p-value for the effect cry count has on IQ. From the regression analysis t = 3.07 and p = 0.004. Or. 07.3. 4870.0. 4929.1.



Statistics Formula Sheet and Tables 2020

statistic critical value standard error of statistic. ±. Chi-square statistic: (. ) 2 observed expected expected ?. ?. = ?. 2. AP Statistics 2020 Formulas 



TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics

t test statistic for H0 : µ = µ0 (SRS from Normal same formula after adding 2 successes and two ... same formulas after adding one success and one.



[PDF] Statistics Cheat Sheet

Statistics Cheat Sheet Population The entire group one desires Reflects the extent to which a statistic changes from sample to sample For a mean



[PDF] Frequently Used Statistics Formulas and Tables

Frequently Used Statistics Formulas and Tables Chapter 2 Note: textbooks and formula sheets interchange *see table 7-2 (last page of formula sheet)



[PDF] STAT301: Cheat Sheet - TAMU Stat

Probability The chance of a certain event happening Example: Out of 44 calves 12 weighed less than 90 pounds The probability of randomly picking a



[PDF] Statistics Cheat Sheet - Squarespace

Statistics Cheat Sheet Ch 1: Overview Descriptive Stats Populations Samples and Processes Population: well-defined collection of objects



[PDF] Statistics Cheat Sheet - Blast Analytics and Marketing

Statistics – Practice or science of collecting and analyzing numerical data Check out Probability Correlation Cheat Sheet for more on this one!



[PDF] TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics

Plus four to greatly improve accuracy: use the same formula after adding 2 successes and two failures to the data • z test statistic for H0 : p = p0 (large 



Basic Statistics & Probability Formulas - PDF Download - Pinterest

Feb 1 2021 - The complete list of statistics probability functions basic formulas cheat sheet for PDF download



[PDF] Statistics Formula Sheet and Tables 2020 - AP Central

critical value standard error of statistic ± Chi-square statistic: ( ) 2 observed expected expected ? ? = ? 2 AP Statistics 2020 Formulas and 



Statistics Formula Cheat Sheet - Studocu

Helpful tool for studying statistics formulas frequently used statistics formulas and tables chapter class width highest value lowest value (increase to 

  • What are the formulas used in statistics?

    MedianIf n is odd, then M = ( n + 1 2 ) t h term If n is even, then M = ( n 2 ) t h t e r m + ( n 2 + 1 ) t h t e r m 2ModeThe value which occurs most frequentlyVariance? 2 = ? ( x ? x ¯ ) 2 nStandard DeviationS = ? = ? ( x ? x ¯ ) 2 n
  • What is a basic statistics formula?

    The important statistics formulas are listed in the chart below: Mean (¯x)¯x=?xn Median (M) If n is odd, then M=(n+12)th term If n is even, then M=(n2)th term +(n2+1)th term 2 Mode The value which occurs most frequently Variance (?2)?2=?(x?¯x)2n Standarad Deviation (S)S=?=??(x?¯x)2n.
  • What do you put on a statistical cheat sheet?

    Data Types

    1Numerical: data expressed with digits; is measurable. 2Categorical: qualitative data classified into categories. 3Mean: the average of a dataset.4Median: the middle of an ordered dataset; less susceptible to outliers.5Mode: the most common value in a dataset; only relevant for discrete data.
  • In statistics and probability theory the Greek small letter mu ? is used to denote a population mean or expected value. For example, the expected value of the mean under the null hypothesis would be denoted by ?0 while the expected value of the alternative hypothesis will be denoted by ?1.

STAT301: Cheat Sheet

Algebra(i)

a+zbab =z(ii)a(b+c) =ab+ac. (iii) 1pab =1pa 1pb (iv) apa =pa. (v)a < bmeansaislessthanb.a > bmeansaisbiggerthanb.abmeans thataisless than or the sameasb. {The point that cuts the interval [a;b] in half is(a+b)2 ProbabilityThe chance of a certain event happening. Example: Out of 44 calves, 12 weighed less than 90 pounds. The probability of randomly picking a calf from this group which weighs less than 90 pounds is 12=44. Types of studies{ObservationalRecord data on individuals without attempting an intervention. {ExperimentalDeliberatly impose a treatment on individuals. Usually this is done in a random- ized fashion, where some are given a treatment and others a placebo. ConfoundingWhen a observed factor and unobserved factor are mixed-up, making it impossible to decide what is in uencing the response. Types of variableNumerical discrete, numerical continuous and categorical. Figure 1: Shapes of distribution: Symmetric, Uniform (thick/heavy tailed), Left Skewed, Right Skewed and corresponding QQplots

Data Analysis

Checking for normality{Use QQplot, see Figure 1.

1 {A cruder method is to use the 68-95-99.7% rule (check to see whethe data is with one, two and three sample sd of the sample mean).

Measures of center given dataX1;:::;Xn

(i) (Sample) mean X=1n P n i=1Xi.

Example: Average of 1;1;1;3;4;5;6 is x= 3

(ii) Medianis the point which cuts the data in half. Example: Median of 1;1;1;3;4;5;6 is 3. Median of 1;1;3;4;5;6 is 3.5.

Measure of spread of given dataX1;:::;Xn.

(i) (Sample) standard deviations=q1 n1P n i=1(XiX)2. Example: The standard deviation of 1;1;1;3;4;5;6 iss= 2:08.

(iii) Quartiles and Interquartile rangeThe rst quartile cuts the rst half of the data in half and the

second quartile cuts the top part of the data in half. IQR = 3rd quartile - 1st quartile. Example: The rst and third quartile of 1;1;1;3;4;5;6 is 1 and 5 respectively. Linear transformationSuppose the dataX1;:::;Xnhas sample meanXand sample standard devia- tionsX. We make a linear transformation of the data set using the transformationYi=a+bXi. The sample mean and standard deviation of the new data set isY=a+bXandsY=jbjsXrespectively. Example: 0:5;1:5;2;3:2;3:8 has mean 2:2 and standard deviation 1:3. We transform it usingY=

12X. The new data set is 6:0;18:0;24:0;38:4;45:6 which has mean 122:2 and standard deviation

1:312.

Z-score calculationsSupposeXis a random variable with meanand standard deviation, the z-transform isZ=X . The mean and standard deviation of the z-transform is zero and one. The z-transform tells us how many standard deviations an observation,X, is from the mean.

Normal distribution

Normal calculations. Note the normal distribution is (i) symmetric about the mean (ii) total area is one (iii) the y-axis is positive. {Question: Suppose the random variableXis known to come from a normal distribution with mean 5 and standard deviation 2N(5;2). What is the chanceXwill be less than 6? {Answer: Make z-transformz=652 = 0:5 then look up 0:5 (from outside into the z-tables) to giveP(X6) =P(Z0:5) = 0:69:= Question: Suppose thatXis known to come from a normal distribution with mean 5 and standard deviation 2N(5;2). If an observationXis in the 85th percentile what isX? Answer: Look up 0:85 (from inside to outside) the table, which corresponds to 1:04, soX= 5 +

1:042 = 7:08.

Rule of Thumb: If data is normally distributed, the roughly speaking 68% of the data lies within one standard deviation of the mean, 95% of the data lies within two standard deviations of the mean and 99.8% of the data lies within 3 standard deviations of the mean. 2

Figure 2: The distribution of averages

The sample mean

The sample meanSuppose a random sampleX1;:::;Xnis drawn from a population, where the mean isand the standard deviation is. The average, usually called the sample mean,X= 1n (X1+X2+:::+Xn) =1n P n i=1Xiis an estimator of the sample mean.

Mean and standard error of the sample mean{The mean of the sample mean (the average of the average) is.

{The standard error (variability) of the sample mean is=pn. The standard error informs us how variable the estimator is. The smaller the standard error, the less variable it will be. {Example: A population has mean= 67 and standard deviation= 3:8. A sample of 5 is drawn and the average is taken. The average will change from sample to sample, but it is estimating the population mean. The mean of the average,X, (the average of the average) is

again= 67 (it is estimating this value, so it unbiased) and the standard error of the average,X, iss:e==pn=3:8p5

The distribution of the sample mean{Normal dataIf the distribution of the population is normal (examples include heights of one

gender) then the distribution of the sample mean (no matter how big or small the sample size) will be normal. Example 1: Female heights are normally distributed with mean 64 inches and standard deviation

2:5 inches (N(64;2:5)). A sample of size three is taken the average isN(64;2:5p3

Example 2: Female heights are normally distributed with mean 64 inches and standard deviation

2:5 inches (N(64;2:5)). A sample of size 50 is taken the average isN(64;2:5p50

{Non-normal dataIf the distribution of the population is not normal (examples include the number of M&Ms in a bag) then the distribution of the sample mean will be close to normal if the sample size is sucientlylarge. How large is large depends on how close to normal the original distribution. Example 1: The mean number of M&Ms in a bag is= 13:54 with standard deviation= 4:64. The average in 5 bags of M&Ms will have mean= 13:54 and standard error se=4:26=p5, but it will NOT be normally distributed because the original data is not normal. Example 2: The mean number of M&Ms in a bag is= 13:54 with standard deviation= 4:64. The average in 5 bags of M&Ms will be close to normal withN(13:54;4:26=p40). If the sample mean is close to normal we can use all the usual normal calculations (using the mean and standard error) to calculate probabilities. 3

Inference for the sample mean

Condence IntervalsA condence interval is an interval where we believe withC% condence the population mean lies. TypicallyC= 95%;99%;90%. To construct a condence interval using the sample meanX(which is evaluated from the data) we need to be sure that the sample mean is normally distributed (either by normality of the data or the sample size being large enough for the CLT to kick in). We consider the two cases, which depends on whether the population standard deviation in known or not. (i)Known population standard deviationIf for some reason the population standard deviation is known but the population mean is unknown the the 95% CI for the mean ishX1:96pn i (we look up 2.5% in the z-tables to get 1.96). (ii)unknown population standard deviationIf the population standard deviation is unknown then we need toestimateit from the data. If the sample size isn, we replace the normal distribution with thet-distribution with (n1) degrees of freedom. The 95% CI for the mean ishXtn1(2:5%)spn i (remember we need to look up 2.5% each side). As the sample size grows the dierence between the normal and the t-distribution becomes less. ExampleThe sample size is 30, the sample mean is 0:5 and sample standard deviations= 4, the 95% CI is [0:52:044=p30]. Margin of ErrorThis is half the length of the condence interval. ExampleThe margin of error of 95% CI [3;8] is MoE = (83)=2. Formula for Margin of ErrorIf the population standard deviation is known, then the MoE for a 95%

CI is 1:96pn

. We can use this to nd the minimum sample size to obtain a given margin of error: n= (1:96=MoE)2. Notes: {The larger the standard deviationthe larger the sample size we will need. {If the standard deviation is unknown then bounds are given say, its somewhere between1to

2. Use the largest standard deviation to get the smallest margin of error.

{To decrease the margin of error frommtom=Pyou need to increase the sample size by a factor P 2. Testing the meanDepending on what the alternative of interest is, there are three dierent possible test set-ups. To reduce algebra we will assume the mean under investigation is 5. {H0:= 5 againstHA:6= 5. {H0:5 againstHA: >5. {H0:5 againstHA: <5. Which hypothesis you use depends on the alternative that you want to `prove'. Example: The mean height of females 30 years ago was known to be 63 inches. It is believed that female heights have increased over the past 30 years, what is the hypothesis of interest? AnswerH

063 againstHA: >63.

Let us suppose thatX1;:::;Xn(these are numbers) is a random sample of sizen, drawn from a population with mean(this is what we are investigating) and standard deviation. We will assume that the sample size is large enough such that the sample mean is normally distributed with meanand standard error=pn. If the population standard deviation is unknown and is instead estimated from the data, then in all the calculation use at-distribution withn1-degrees of freedom rather than the standard normal distribution. 4 Calculating the p-valueThe p-value is always calculated under the null. This means determining the chance of the observations if the null were true (how viable is the null)? Example 1: We test the hypothesisH0:= 5 againstHA:6= 5. We collect a random sample of size 30, the sample mean based on this sample isX= 6 and the sample standard deviation iss= 3.

The t-transform ist=Xs=

pn =653=p30 = 1:825. To calculate the p-value:

1 Calculate the smallest area under the plot, in this case it is the area to the RIGHT of 1.825.

Using t-tables with 29df, we see that it is between 2.5-5%.

2. The p-value for thetwo-sided test, istwotimes this area, which is between 5-10%.

Example 2: We test the hypothesisH0:5 againstHA: >5. We collect a random sample of size 30, the sample mean based on this sample isX= 6 and the sample standard deviation iss= 3.

The t-transform ist=Xs=

pn =653=p30 = 1:825. To calculate the p-value:

1 Check to see the direction of the alternative. SinceHA: >5, the alternative is pointing

RIGHT.

2. The p-value for thisone-sided test, is the area to the RIGHT oft= 1:825. From tables this

area, is between 5-10%.

3. For the one-sided test the p-value is this area, which is between 5-10%.

Example 3: We test the hypothesisH0:5 againstHA: <5. We collect a random sample of size 30, the sample mean based on this sample isX= 6 and the sample standard deviation iss= 3.

The t-transform ist=Xs=

pn =653=p30 = 1:825. To calculate the p-value:

1 Check to see the direction of the alternative. SinceHA: <5, the alternative is pointing

LEFT.

2. The p-value for thisone-sided test, is the area to the LEFT oft= 1:825. From tables this area,

is between 90-95%.

3. For the one-sided test the p-value is this area, which is between 90-95%.

The decision processThe decision process is made at the(typically 5%) signicance level. We

reject the null and say there is evidence to suggest the alternative is true (or equivalently there is

evidence to reject the null), if the p-value islessthan%. is often called the type I error (or signicance level). The largerthe more likely we are to falsely reject the null when the null is true. ExampleIn a tomato packing plant, the mean weight of tomato boxes is tested at the% signicance

level, every hour. If the machine is working correctly for every 100 tests, on average we will falsely

reject (determine the machine faulty)times. Condence intervals and p-valuesThe (100)% (eg. 95%) condence interval and a test done at the% (%) signicance level are connected in the sense that bounds for p-values can be deduced

from the condence interval (this is because the length of the condence interval and the non-rejection

region are the same). ExampleThe 95% CI for the mean is [0:5;4]. The sample mean isX= (4 + 0:5)=2 = 2:25. Using this we can deduce the following:

1. Two-sided tests

5 (a) We test the hypothesisH0:= 0 againstHA:6= 0. Since 0 is not in the interval [0:5;4] the p-value for this two sided test is less than 5%. This means the smallest area is less than

2:5% and is the area to the RIGHT ofX= 2:25 (centered about zero).

(b) We test the hypothesisH0:= 1 againstHA:6= 1. 1 is inside the interval [0:5;4], thus it is plausible that the mean is 1. The p-value for this two-sided test is greater than 5%. This means that p-value for the smallest area is greater than 2:5% and is the area to the

RIGHT ofX= 2:25 (centered about one).

2. One-sided test pointing RIGHTWe test the hypothesisH0:0 againstHA: >0. The p-value is the area to the RIGHT

ofX= 2:25 (centered about zero), which we have shown in (a) is LESS than 2.5%. We test the hypothesisH0:1 againstHA: >1. The p-value is the area to the RIGHT ofX= 2:25 (centered about one), which we have shown in (b) is GREATER than 2.5%.

3. One-sided test pointing LEFTWe test the hypothesisH0:0 againstHA: <0. The p-value is the area to the LEFT ofX= 2:25 (centered about zero), which we have shown in (a) is GREATER than 97.5%.

We test the hypothesisH0:1 againstHA: <1. The p-value is the area to the LEFT

ofX= 2:25 (centered about one), which we have shown in (b) is less than 97.5%. But sinceX= 2:25 is on the right of 1 it has to be greater than 50%. Thus the p-value is between 50 to

97:5%.

Type I, Type II errors and PowerWhenever we do a test we always base the result on a pre-set sig- nicance level (typically 5% usually denoted as). These are features about the statistical procedure and not the data itself. Example, suppose we test for gestational diabetes and use the hypothesisH0:140 against H

A: >140.

{Type I erroris the same as the signicance level =% and is pre-set by us. What it means: If we set= 5%, then the proportion of healthy women with= 140, that we would falsely diagnose as having gestational diabetes is 5%. {Type II errorGiven an alternative of interest the type II error is the chance of rejecting the null when actually that alternative is true. What it means: If someone is said to have severe diabetes when their mean is 148 then the type II error is the probability our test does not detect

her. It can be calculated using: We see that when the sample size is 4 and the test is done atthe 5% level the chance of this happening is 10:9907 = 0:93%.

{PowerThe power is the chance of rejecting null when the alternative is true. Using the above plot we see the power for the alternative= 148 is 99:07%. Increasing the type I error increases power. Increasing sample size increases power. 6

Comparing populations

One of the most common statistical methods is when samples are collected from two dierent populations

and compared (for example does a drug work verses a placebo). Hence there are two dierent data sets.quotesdbs_dbs5.pdfusesText_9
[PDF] statistics formulas class 12

[PDF] statistics lecture notes pdf

[PDF] statistics of paramagnetism

[PDF] statistics on flexible working

[PDF] statistics using excel tutorial

[PDF] statistique exercices corrigés

[PDF] stats that prove lebron is the goat

[PDF] status of sewage treatment in india

[PDF] status signal in 8086

[PDF] statutory holidays 2020 usa

[PDF] stay in place bridge deck forms

[PDF] stayman convention audrey grant

[PDF] std 9 maths chapter 2

[PDF] steady state exercise physiology

[PDF] steel deck institute