Chapter 7: Inference for Distributions
A visual comparison of normal and paranormal distribution Lower caption says
'Paranormal Distribution' - no idea why the graphical artifact is occurring. http://stats.stackexchange.com/questions/423/what-is-your-favorite-data-analysis-cartoon 2
7.1: Inference for the Mean of a Population - Goals
• Be able to distinguish the standard deviation from the standard error of the sample mean.
• Be able to construct a level C confidence interval
(without knowing ) and interpret the results.
• Perform a one-sample t significance and summarize the results. • Be able to determine when the t procedure is valid.
3
Conditions for Inference (Chapter 6)
1. The variable we measure has a Normal distribution with mean and standard deviation σ.
2. We don’t know , but we do know σ.
3. We have an SRS from the population of interest. 4
Table A vs. Table D
Table A
Standard normal (z)
P(Z ≤ z) df not required
Table D t-distribution P(T > t) df required
7
Example: t critical values
What is the t critical value for the following:
a) Central area = 0.95, df = 10
b) Central area = 0.95, df = 60
c) Central area = 0.95, df = 100
d) Central area = 0.95, z curve
e) Upper area = 0.99, df = 10
f) lower area = 0.99, df = 10
8
Summary: CI
Confidence Interval x ± t*(df)
s n Upper Confidence Bound < x + t*(df)
Lower Confidence Bound > x - t*(df)
Sample Size
t ' s n
m
2
s n s n 9
Example: Sample size
You are in charge of quality control in your food company. You sample randomly four packs of cherry tomatoes. The average weight from your four boxes is 222 g with a sample standard deviation of 5 g.
a) What sample size is required to obtain a margin of error of 2 g at a 95% confidence level? 10
Single mean test: Summary
Null hypothesis: H0: μ = μ0 x 0
Test statistic: t s/ n
Robustness of the t-procedure
• A statistical value or procedure is robust if the calculations required are insensitive to violations of the condition.
• The t-procedure is robust against normality.
– n < 15 : population distribution should be close to normal.
– 15 < n < 40: mild skewedness is acceptable
– n > 40: procedure is usually valid.
12
Inferences for Non-Normal Distributions
• If you know what the distribution is, use the appropriate model.
• If the data is skewed, you can transform the variable. • Use a nonparametric procedure.
13
7.2: Comparing two Means - Goals
• Be able to construct a level C confidence interval for the difference between two means and interpret the results. • Perform a two-sample t significance and summarize the results. • Be able to construct a level C confidence interval for a matched pair and interpret the results.
• Perform a matched pair t significance and summarize the results.
• Be able to determine when the t procedure is valid.
14
Conditions for Inference: 2 - sample
1. Each group is considered to be a sample from a distinct population.
• We have an SRS from the population of interest for each variable.
2. The responses in each group are independent of those in the other group.
3. The variable(s) we measure has a Normal distribution with mean and standard deviation σ.
15
Continuous Probability Distributions Chapter 7 GOALS 1. 2. 3. 4. 5. 6. Understand the difference between discrete and continuous distributions. List the characteristics of the normal probability distribution. Define and calculate z values. Determine the probability an observation is between two points on a normal probability distribution. Determine the probability an observation is above (or below) a point on a normal probability distribution. Use the normal probability distribution to approximate the…
Business Statistics MGSC-372 MGSC 372 The Lognormal Distribution The Lognormal Distribution A continuous random variable X follows a lognormal distribution if its natural logarithm, ln(X), follows a normal distribution. The lognormal distribution is an asymmetric distribution with interesting applications for modeling the probability distributions of stock and other asset prices The Lognormal Distribution • Properties of the lognormal distribution – Skewed to the right – Strictly positive (i.e.…
Probability and Statistics Basic Probability concepts Most inspection and quality control theory deals with statistics to make inference about a population based on information contained in samples. The mechanism we use to make these inferences is probability. We use P E to represent the probability of any event (E) 0 PE 1 The sum of all possible events = 1 P(S) = 1, S = Sample space Definition of Probability The ratio of the chances favoring an event to the total number of chances for and against…
1 Non-Normal Distributions Reading: Christoffersen, Elements of Financial Risk Management, Chapter 6 2 Overview • Returns are conditionally normal if the dynamically standardized returns are normally distributed. A standardized return is zt = Rt/t, where t is the (estimate) of the standard deviation of the return Rt. Typically t comes from a variance forecasting model, e.g. a GARCH model. • Fig.6.1 illustrates how histograms from returns and standardized returns typicallydo not conform to…
hypothesis comparing an observed set of frequencies to an expected distribution. LO2 List and explain the characteristics of the chi-square distribution. LO3 Conduct a goodness-of-fit test for unequal expected frequencies. LO4 Conduct a test of hypothesis to verify that data grouped into a frequency distribution is a sample from a normal distribution. LO5 Use graphical methods to determine if a set of sample data is from a normal distribution. LO6 Conduct a test of hypothesis to determine whether two 17-2…
Chapter 6: Continuous Probability Distributions Study Modules (PPT presentations): Introduction to Continuous Probability Distributions Normal Probability Distribution Discrete Distributions Excel Tutorial: Computing Normal Probabilities Java Applet: Normal Distribution Areas Normal Approximation to Binomial Probabilities Continuous Random Variables: A continuous random variable can assume ____any value_______________ in an interval on the real line or in a collection of intervals…
excellent tool to keep businesses running efficiently and effectively. The normal distribution is the most important pattern of data that occurs in statistics. Rongrong Xie writes in The American Statistician that the reason the normal distribution is interesting is that it has an important use in the statistical theory of drawing conclusions from sample data about the populations from which the samples are drawn. A practical example: Suppose you must establish regulations concerning the maximum number…
Statistics Outline of lecture Purpose of descriptive statistics Describing categorical variables Frequency analysis Describing continuous variables Summary statistics Measures of central tendency, variability and normality Normal distribution Relevant SPSS commands: Descriptives, Compare means, Histograms, Explore Next lab session lbic.navitas.com navitas.com Objectives for today Understand the importance of exploring the characteristics of your data before conducting…
3.1 Measuring Location in a Distribution Where do you Stand? p. 102 Introduce percentile and zscore A) Measuring location: percentile The Pth Percentile of a distribution is the value with p percent of the observations less than or equal to it. (equal or below it) B) Measuring location: zscores 1) Standardizing: Converting observations like for example, each height, from original values to standard deviation units from the mean 2) Standardized value is the zscore…
Chapter 4 Probability and Probability Distributions Classical Interpretation (Way 1) This interpretation of probability arose from games of chance. Common sense and theory tell us that: - The probability of a HEADS from a fair coin is ½ - The probability of an ACE is 4/52 or 1/13 1 - The probability of a SEVEN on two dice is 6/36 or 1/6 First die: Second die: 1 2 3 4 5 6 1 2 3 4 5 6 7 2 3 4 5 6 7 8 3 4 5 6 4 5 6 7 5 6 7 8 6 7 8 9 7 8 9 10 8 9 10 11…