This document discusses key concepts in statistical estimation including:
- Estimation involves using sample data to infer properties of the population by calculating point estimates and interval estimates.
- A point estimate is a single value that estimates an unknown population parameter, while an interval estimate provides a range of plausible values for the parameter.
- A confidence interval gives the probability that the interval calculated from the sample data contains the true population parameter. Common confidence intervals are 95% confidence intervals.
- Formulas for confidence intervals depend on whether the population standard deviation is known or unknown, and the sample size.
This document provides an overview of key concepts related to the normal distribution, sampling distributions, estimation, and hypothesis testing. It defines important terms like the normal curve, z-scores, sampling distributions, point and interval estimates, and the steps of hypothesis testing including stating hypotheses, collecting data, and determining whether to reject the null hypothesis. It also reviews concepts like the central limit theorem, standard error, bias, confidence intervals, types of errors in hypothesis testing, and factors that influence test statistics.
This document discusses inferential statistics and confidence intervals. It introduces confidence intervals for a population mean using the t-distribution when the sample size is small (less than 30). When the population variance is known, the z-distribution can be used. It provides examples of how to calculate 95% and 99% confidence intervals for a population mean using the t-distribution and normal distribution. Formulas for the standard error and reliability coefficients are also presented.
This document discusses estimating population parameters such as proportions, means, and standard deviations from sample data. It covers how to calculate confidence intervals for a population proportion based on a sample proportion. The key steps are to determine the sample proportion, calculate the margin of error using the sample size and a critical z-value, and use these to estimate the confidence interval. An example is provided to demonstrate calculating the confidence interval for a population proportion based on survey data. The summary accurately conveys the main topic and methods discussed in the document in under 3 sentences.
Inferential statistics are often used to compare the differences between the treatment groups. Inferential statistics use measurements from the sample of subjects in the experiment to compare the treatment groups and make generalizations about the larger population of subjects.
Dr. Abhay Pratap Pandey introduces statistical inference and its key concepts. Statistical inference allows making conclusions about a population based on a sample. It involves estimation and hypothesis testing. Estimation determines population parameters using sample statistics. Hypothesis testing determines if sample data provides sufficient evidence to reject claims about population parameters. The document defines key terms like population, sample, parameter, statistic, and discusses properties of estimators like unbiasedness and consistency. It also explains hypothesis testing concepts like null and alternative hypotheses, types of errors, and steps to conduct hypothesis tests on a population mean. An example demonstrates hypothesis testing for a population mean using a z-test.
The document discusses normal and standard normal distributions. It provides examples of using a normal distribution to calculate probabilities related to bone mineral density test results. It shows how to find the probability of a z-score falling below or above certain values. It also explains how to determine the sample size needed to estimate an unknown population proportion within a given level of confidence.
The document discusses the process of sampling from a population. It explains that sampling is used because it is not always possible to study the entire population. It then outlines the 7 key steps to analyzing sample data from a population: 1) estimating population parameters, 2) estimating population variance, 3) computing standard error, 4) specifying confidence level, 5) finding critical values, 6) computing margin of error, and 7) defining the confidence interval. The document provides formulas for estimating means, variances, standard errors, and computing confidence intervals.
Biostatistics is the science of collecting, summarizing, analyzing, and interpreting data in the fields of medicine, biology, and public health. It involves both descriptive and inferential statistics. Descriptive statistics summarize data through measures of central tendency like mean, median, and mode, and measures of dispersion like range and standard deviation. Inferential statistics allow generalization from samples to populations through techniques like hypothesis testing, confidence intervals, and estimation. Sample size determination and random sampling help ensure validity and minimize errors in statistical analyses.
Confidence Intervals in the Life Sciences PresentationNamesS.docxmaxinesmith73660
Confidence Intervals in the Life Sciences Presentation
Names
Statistics for the Life Sciences STAT/167
Date
Fahad M. Gohar M.S.A.S
1
Conservation Biology of Bears
Normal Distribution
Standard normal distribution
Confidence Interval
Population Mean
Population Variance
Confidence Level
Point Estimate
Critical Value
Margin of Error
Welcome to the presentation on Confidence Intervals of Conservation Biology on Bears.
The team will define normal distribution and use an example of variables why this is important. A standard and normal distribution is discussed as well as the difference between standard and other normal distributions. Confidence interval will be defined and how it is used in Conservation Biology and Bears. We will learn how a confidence interval helps researchers estimate of population mean and population variance. The presenters defined a point estimate and try to explain how a point estimate found from a confidence interval. Confidence level is defined and a short explanation of confidence level is related to the confidence interval. Lastly, a critical value and margin of error are explained with examples from the Statdisk.
2
Normal Distribution
A normal distribution is one which has the mean, median, and mode are the same and the standard deviations are apart from the mean in the probabilities that go with the empirical rule. Not all data has the measures of central tendency, since some data sets may not have one unique value which occurs more than once. But every data set has a mean and median. The mean is only good with interval and ratio data, while the median can be used with interval, ratio and ordinal data. Mean is used when they're a lot of outliers, and median is used when there are few.
The normal distribution is continuous, and has only two parameters - mean and variance. The mean can be any positive number and variance can be any positive number (can't be negative - the mean and variance), so there are an infinite number of normal distributions. You want your data to represent the population distribution because when you make claims from the distribution of the sample you took, you want it to represent the whole entire population.
Some examples in the business world: Some industries which use normal distributions are pharmaceutical companies. They model the average blood pressure through normal distributions, and can make medicine which will help majority of the people with high blood pressure. A company can also model its average time to create something using the normal distribution. Several statistics can be calculated with the normal distribution, and hypothesis tests can be done with the normal distribution which models the average time.
Our chosen life science is BEARS. The age of the bears can be modeled by normal distributions and it is important to monitor since that tells us the average age of the bear, and can tell us a lot about the population. If the mean is high and the standard deviatio.
This document provides an overview of Module 5 on sampling distributions. It discusses key concepts like parameters vs statistics, sampling variability, and sampling distributions. It explains that the sampling distribution of a sample mean is a normal distribution with a mean equal to the population mean and standard deviation equal to the population standard deviation divided by the square root of the sample size. The central limit theorem states that as the sample size increases, the distribution of sample means will approach a normal distribution regardless of the shape of the population distribution. The module also covers binomial distributions for sample counts and proportions.
The document outlines statistical inference topics including sampling distributions, estimation, hypothesis testing, and specific statistical tests. The key objectives are to estimate parameters, conduct hypothesis tests, and test associations between variables. It discusses important statistical concepts such as point and interval estimation, properties of sampling distributions, the central limit theorem, and assumptions needed for statistical inference. Confidence intervals are presented as a method of interval estimation that accounts for sample to sample variability in estimates.
Standard Error & Confidence Intervals.pptxhanyiasimple
Certainly! Let's delve into the concept of **standard error**.
## What Is Standard Error?
The **standard error (SE)** is a statistical measure that quantifies the **variability** between a sample statistic (such as the mean) and the corresponding population parameter. Specifically, it estimates how much the sample mean would **vary** if we were to repeat the study using **new samples** from the same population. Here are the key points:
1. **Purpose**: Standard error helps us understand how well our **sample data** represents the entire population. Even with **probability sampling**, where elements are randomly selected, some **sampling error** remains. Calculating the standard error allows us to estimate the representativeness of our sample and draw valid conclusions.
2. **High vs. Low Standard Error**:
- **High Standard Error**: Indicates that sample means are **widely spread** around the population mean. In other words, the sample may not closely represent the population.
- **Low Standard Error**: Suggests that sample means are **closely distributed** around the population mean, indicating that the sample is representative of the population.
3. **Decreasing Standard Error**:
- To decrease the standard error, **increase the sample size**. Using a large, random sample minimizes **sampling bias** and provides a more accurate estimate of the population parameter.
## Standard Error vs. Standard Deviation
- **Standard Deviation (SD)**: Describes variability **within a single sample**. It can be calculated directly from sample data.
- **Standard Error (SE)**: Estimates variability across **multiple samples** from the same population. It is an **inferential statistic** that can only be estimated (unless the true population parameter is known).
### Example:
Suppose we have a random sample of 200 students, and we calculate the mean math SAT score to be 550. In this case:
- **Sample**: The 200 students
- **Population**: All test takers in the region
The standard error helps us understand how well this sample represents the entire population's math SAT scores.
Remember, the standard error is crucial for making valid statistical inferences. By understanding it, researchers can confidently draw conclusions based on sample data. 📊🔍
If you need further clarification or have additional questions, feel free to ask! 😊
---
I've provided a concise explanation of standard error, emphasizing its importance in statistical analysis. If you'd like more details or specific examples, feel free to ask! ¹²³⁴
Source: Conversation with Copilot, 5/31/2024
(1) What Is Standard Error? | How to Calculate (Guide with Examples) - Scribbr. https://www.scribbr.com/statistics/standard-error/.
(2) Standard Error (SE) Definition: Standard Deviation in ... - Investopedia. https://www.investopedia.com/terms/s/standard-error.asp.
(3) Standard error Definition & Meaning - Merriam-Webster. https://www.merriam-webster.com/dictionary/standard%20error.
(4) Standard err
📺Please Subscribe to this Channel for more solutions and lectures
http://www.youtube.com/onlineteaching
Chapter 7: Estimating Parameters and Determining Sample Sizes
7.1: Estimating a Population Proportion
Please Subscribe to this Channel for more solutions and lectures
http://www.youtube.com/onlineteaching
Chapter 7: Estimating Parameters and Determining Sample Sizes
7.1: Estimating a Population Proportion
The t-distribution is a probability distribution used for statistical analysis when sample sizes are small or population standard deviations are unknown. It is similar to the normal distribution but with heavier tails, accounting for more uncertainty. The t-distribution is applied in hypothesis testing and constructing confidence intervals to make inferences about population means based on small samples. Its shape depends on degrees of freedom which reflects sample size information. It assumes data is normally distributed and population variance is unknown.
The document discusses key concepts in statistical inference including estimation, confidence intervals, hypothesis testing, and types of errors. It provides examples and formulas for estimating population means from sample data, calculating confidence intervals, stating the null and alternative hypotheses, and making decisions to accept or reject the null hypothesis based on a significance level.
1. Illustrate point and interval estimations.
2. Distinguish between point and interval estimation.
Visit the website for more services it can offer:
https://cristinamontenegro92.wixsite.com/onevs
Biostatistics - the application of statistical methods in the life sciences including medicine, pharmacy, and agriculture.
An understanding is needed in practice issues requiring sound decisions.
Statistics is a decision science.
Biostatistics therefore deals with data.
Biostatistics is the science of obtaining, analyzing and interpreting data in order to understand and improve human health.
Applications of Biostatistics
Design and analysis of clinical trials
Quality control of pharmaceuticals
Pharmacy practice research
Public health, including epidemiology
Genomics and population genetics
Ecology
Biological sequence analysis
Bioinformatics etc.
This document discusses methods for estimating population parameters from sample data, including point estimation, bias, confidence intervals, sample size determination, and hypothesis testing. Key points include defining point estimates as single values representing plausible population values based on sample data, describing how to calculate confidence intervals for population proportions and means using z-tests and t-tests, and outlining how to determine necessary sample sizes to achieve a desired level of accuracy and confidence.
This document discusses the distribution of sample means when taking samples from a population. It explains that as the sample size increases, the distribution of sample means approaches a normal distribution, even if the population is not normally distributed. The mean of the distribution of sample means is equal to the population mean. The variability of the distribution is measured by the standard error, which depends on the sample size and population standard deviation. Larger sample sizes result in smaller standard errors and distributions of sample means that are nearly normal.
This document discusses confidence intervals, which are interval estimates of population parameters that indicate the reliability of sample estimates. The document defines confidence intervals and explains how they are constructed. It also discusses point estimates versus interval estimates and describes how to calculate confidence intervals for means, proportions, and when the population standard deviation is unknown using the t-distribution. Examples are provided to illustrate how to construct confidence intervals in different situations.
The document discusses key concepts related to probability and statistics, including:
- Probability is a number that reflects the likelihood of an event occurring, ranging from 0 to 1.
- Standard deviation measures how spread out values are from the mean.
- The normal distribution is a bell-shaped curve used to model naturally occurring phenomena.
- The t-distribution is similar to the normal distribution but with heavier tails, used when sample sizes are small.
- The normal probability curve is a graphical representation of the normal distribution, and is used to determine probabilities and percentiles.
The document discusses two approaches to statistical inference - the frequentist approach and the Bayesian approach. It focuses on explaining key concepts of the frequentist approach, including point estimates, sampling distributions, confidence intervals, and hypothesis testing. Specifically, it provides examples of how to calculate a 95% confidence interval and conduct hypothesis tests using confidence intervals to determine if differences between groups are statistically significant. The document is intended as an introduction to statistical inference from a frequentist perspective.
Johan Lammers from Statistics Netherlands has been a business analyst and statistical researcher for almost 30 years. In their business, processes have two faces: You can produce statistics about processes and processes are needed to produce statistics. As a government-funded office, the efficiency and the effectiveness of their processes is important to spend that public money well.
Johan takes us on a journey of how official statistics are made. One way to study dynamics in statistics is to take snapshots of data over time. A special way is the panel survey, where a group of cases is followed over time. He shows how process mining could test certain hypotheses much faster compared to statistical tools like SPSS.
GenAI for Quant Analytics: survey-analytics.aiInspirient
Pitched at the Greenbook Insight Innovation Competition as apart of IIEX North America 2025 on 30 April 2025 in Washington, D.C.
Join us at survey-analytics.ai!
Ad
More Related Content
Similar to Inferential Statistics-Part-I mtech.pptx (20)
The document discusses normal and standard normal distributions. It provides examples of using a normal distribution to calculate probabilities related to bone mineral density test results. It shows how to find the probability of a z-score falling below or above certain values. It also explains how to determine the sample size needed to estimate an unknown population proportion within a given level of confidence.
The document discusses the process of sampling from a population. It explains that sampling is used because it is not always possible to study the entire population. It then outlines the 7 key steps to analyzing sample data from a population: 1) estimating population parameters, 2) estimating population variance, 3) computing standard error, 4) specifying confidence level, 5) finding critical values, 6) computing margin of error, and 7) defining the confidence interval. The document provides formulas for estimating means, variances, standard errors, and computing confidence intervals.
Biostatistics is the science of collecting, summarizing, analyzing, and interpreting data in the fields of medicine, biology, and public health. It involves both descriptive and inferential statistics. Descriptive statistics summarize data through measures of central tendency like mean, median, and mode, and measures of dispersion like range and standard deviation. Inferential statistics allow generalization from samples to populations through techniques like hypothesis testing, confidence intervals, and estimation. Sample size determination and random sampling help ensure validity and minimize errors in statistical analyses.
Confidence Intervals in the Life Sciences PresentationNamesS.docxmaxinesmith73660
Confidence Intervals in the Life Sciences Presentation
Names
Statistics for the Life Sciences STAT/167
Date
Fahad M. Gohar M.S.A.S
1
Conservation Biology of Bears
Normal Distribution
Standard normal distribution
Confidence Interval
Population Mean
Population Variance
Confidence Level
Point Estimate
Critical Value
Margin of Error
Welcome to the presentation on Confidence Intervals of Conservation Biology on Bears.
The team will define normal distribution and use an example of variables why this is important. A standard and normal distribution is discussed as well as the difference between standard and other normal distributions. Confidence interval will be defined and how it is used in Conservation Biology and Bears. We will learn how a confidence interval helps researchers estimate of population mean and population variance. The presenters defined a point estimate and try to explain how a point estimate found from a confidence interval. Confidence level is defined and a short explanation of confidence level is related to the confidence interval. Lastly, a critical value and margin of error are explained with examples from the Statdisk.
2
Normal Distribution
A normal distribution is one which has the mean, median, and mode are the same and the standard deviations are apart from the mean in the probabilities that go with the empirical rule. Not all data has the measures of central tendency, since some data sets may not have one unique value which occurs more than once. But every data set has a mean and median. The mean is only good with interval and ratio data, while the median can be used with interval, ratio and ordinal data. Mean is used when they're a lot of outliers, and median is used when there are few.
The normal distribution is continuous, and has only two parameters - mean and variance. The mean can be any positive number and variance can be any positive number (can't be negative - the mean and variance), so there are an infinite number of normal distributions. You want your data to represent the population distribution because when you make claims from the distribution of the sample you took, you want it to represent the whole entire population.
Some examples in the business world: Some industries which use normal distributions are pharmaceutical companies. They model the average blood pressure through normal distributions, and can make medicine which will help majority of the people with high blood pressure. A company can also model its average time to create something using the normal distribution. Several statistics can be calculated with the normal distribution, and hypothesis tests can be done with the normal distribution which models the average time.
Our chosen life science is BEARS. The age of the bears can be modeled by normal distributions and it is important to monitor since that tells us the average age of the bear, and can tell us a lot about the population. If the mean is high and the standard deviatio.
This document provides an overview of Module 5 on sampling distributions. It discusses key concepts like parameters vs statistics, sampling variability, and sampling distributions. It explains that the sampling distribution of a sample mean is a normal distribution with a mean equal to the population mean and standard deviation equal to the population standard deviation divided by the square root of the sample size. The central limit theorem states that as the sample size increases, the distribution of sample means will approach a normal distribution regardless of the shape of the population distribution. The module also covers binomial distributions for sample counts and proportions.
The document outlines statistical inference topics including sampling distributions, estimation, hypothesis testing, and specific statistical tests. The key objectives are to estimate parameters, conduct hypothesis tests, and test associations between variables. It discusses important statistical concepts such as point and interval estimation, properties of sampling distributions, the central limit theorem, and assumptions needed for statistical inference. Confidence intervals are presented as a method of interval estimation that accounts for sample to sample variability in estimates.
Standard Error & Confidence Intervals.pptxhanyiasimple
Certainly! Let's delve into the concept of **standard error**.
## What Is Standard Error?
The **standard error (SE)** is a statistical measure that quantifies the **variability** between a sample statistic (such as the mean) and the corresponding population parameter. Specifically, it estimates how much the sample mean would **vary** if we were to repeat the study using **new samples** from the same population. Here are the key points:
1. **Purpose**: Standard error helps us understand how well our **sample data** represents the entire population. Even with **probability sampling**, where elements are randomly selected, some **sampling error** remains. Calculating the standard error allows us to estimate the representativeness of our sample and draw valid conclusions.
2. **High vs. Low Standard Error**:
- **High Standard Error**: Indicates that sample means are **widely spread** around the population mean. In other words, the sample may not closely represent the population.
- **Low Standard Error**: Suggests that sample means are **closely distributed** around the population mean, indicating that the sample is representative of the population.
3. **Decreasing Standard Error**:
- To decrease the standard error, **increase the sample size**. Using a large, random sample minimizes **sampling bias** and provides a more accurate estimate of the population parameter.
## Standard Error vs. Standard Deviation
- **Standard Deviation (SD)**: Describes variability **within a single sample**. It can be calculated directly from sample data.
- **Standard Error (SE)**: Estimates variability across **multiple samples** from the same population. It is an **inferential statistic** that can only be estimated (unless the true population parameter is known).
### Example:
Suppose we have a random sample of 200 students, and we calculate the mean math SAT score to be 550. In this case:
- **Sample**: The 200 students
- **Population**: All test takers in the region
The standard error helps us understand how well this sample represents the entire population's math SAT scores.
Remember, the standard error is crucial for making valid statistical inferences. By understanding it, researchers can confidently draw conclusions based on sample data. 📊🔍
If you need further clarification or have additional questions, feel free to ask! 😊
---
I've provided a concise explanation of standard error, emphasizing its importance in statistical analysis. If you'd like more details or specific examples, feel free to ask! ¹²³⁴
Source: Conversation with Copilot, 5/31/2024
(1) What Is Standard Error? | How to Calculate (Guide with Examples) - Scribbr. https://www.scribbr.com/statistics/standard-error/.
(2) Standard Error (SE) Definition: Standard Deviation in ... - Investopedia. https://www.investopedia.com/terms/s/standard-error.asp.
(3) Standard error Definition & Meaning - Merriam-Webster. https://www.merriam-webster.com/dictionary/standard%20error.
(4) Standard err
📺Please Subscribe to this Channel for more solutions and lectures
http://www.youtube.com/onlineteaching
Chapter 7: Estimating Parameters and Determining Sample Sizes
7.1: Estimating a Population Proportion
Please Subscribe to this Channel for more solutions and lectures
http://www.youtube.com/onlineteaching
Chapter 7: Estimating Parameters and Determining Sample Sizes
7.1: Estimating a Population Proportion
The t-distribution is a probability distribution used for statistical analysis when sample sizes are small or population standard deviations are unknown. It is similar to the normal distribution but with heavier tails, accounting for more uncertainty. The t-distribution is applied in hypothesis testing and constructing confidence intervals to make inferences about population means based on small samples. Its shape depends on degrees of freedom which reflects sample size information. It assumes data is normally distributed and population variance is unknown.
The document discusses key concepts in statistical inference including estimation, confidence intervals, hypothesis testing, and types of errors. It provides examples and formulas for estimating population means from sample data, calculating confidence intervals, stating the null and alternative hypotheses, and making decisions to accept or reject the null hypothesis based on a significance level.
1. Illustrate point and interval estimations.
2. Distinguish between point and interval estimation.
Visit the website for more services it can offer:
https://cristinamontenegro92.wixsite.com/onevs
Biostatistics - the application of statistical methods in the life sciences including medicine, pharmacy, and agriculture.
An understanding is needed in practice issues requiring sound decisions.
Statistics is a decision science.
Biostatistics therefore deals with data.
Biostatistics is the science of obtaining, analyzing and interpreting data in order to understand and improve human health.
Applications of Biostatistics
Design and analysis of clinical trials
Quality control of pharmaceuticals
Pharmacy practice research
Public health, including epidemiology
Genomics and population genetics
Ecology
Biological sequence analysis
Bioinformatics etc.
This document discusses methods for estimating population parameters from sample data, including point estimation, bias, confidence intervals, sample size determination, and hypothesis testing. Key points include defining point estimates as single values representing plausible population values based on sample data, describing how to calculate confidence intervals for population proportions and means using z-tests and t-tests, and outlining how to determine necessary sample sizes to achieve a desired level of accuracy and confidence.
This document discusses the distribution of sample means when taking samples from a population. It explains that as the sample size increases, the distribution of sample means approaches a normal distribution, even if the population is not normally distributed. The mean of the distribution of sample means is equal to the population mean. The variability of the distribution is measured by the standard error, which depends on the sample size and population standard deviation. Larger sample sizes result in smaller standard errors and distributions of sample means that are nearly normal.
This document discusses confidence intervals, which are interval estimates of population parameters that indicate the reliability of sample estimates. The document defines confidence intervals and explains how they are constructed. It also discusses point estimates versus interval estimates and describes how to calculate confidence intervals for means, proportions, and when the population standard deviation is unknown using the t-distribution. Examples are provided to illustrate how to construct confidence intervals in different situations.
The document discusses key concepts related to probability and statistics, including:
- Probability is a number that reflects the likelihood of an event occurring, ranging from 0 to 1.
- Standard deviation measures how spread out values are from the mean.
- The normal distribution is a bell-shaped curve used to model naturally occurring phenomena.
- The t-distribution is similar to the normal distribution but with heavier tails, used when sample sizes are small.
- The normal probability curve is a graphical representation of the normal distribution, and is used to determine probabilities and percentiles.
The document discusses two approaches to statistical inference - the frequentist approach and the Bayesian approach. It focuses on explaining key concepts of the frequentist approach, including point estimates, sampling distributions, confidence intervals, and hypothesis testing. Specifically, it provides examples of how to calculate a 95% confidence interval and conduct hypothesis tests using confidence intervals to determine if differences between groups are statistically significant. The document is intended as an introduction to statistical inference from a frequentist perspective.
Johan Lammers from Statistics Netherlands has been a business analyst and statistical researcher for almost 30 years. In their business, processes have two faces: You can produce statistics about processes and processes are needed to produce statistics. As a government-funded office, the efficiency and the effectiveness of their processes is important to spend that public money well.
Johan takes us on a journey of how official statistics are made. One way to study dynamics in statistics is to take snapshots of data over time. A special way is the panel survey, where a group of cases is followed over time. He shows how process mining could test certain hypotheses much faster compared to statistical tools like SPSS.
GenAI for Quant Analytics: survey-analytics.aiInspirient
Pitched at the Greenbook Insight Innovation Competition as apart of IIEX North America 2025 on 30 April 2025 in Washington, D.C.
Join us at survey-analytics.ai!
保密服务圣地亚哥州立大学英文毕业证书影本美国成绩单圣地亚哥州立大学文凭【q微1954292140】办理圣地亚哥州立大学学位证(SDSU毕业证书)毕业证书购买【q微1954292140】帮您解决在美国圣地亚哥州立大学未毕业难题(San Diego State University)文凭购买、毕业证购买、大学文凭购买、大学毕业证购买、买文凭、日韩文凭、英国大学文凭、美国大学文凭、澳洲大学文凭、加拿大大学文凭(q微1954292140)新加坡大学文凭、新西兰大学文凭、爱尔兰文凭、西班牙文凭、德国文凭、教育部认证,买毕业证,毕业证购买,买大学文凭,购买日韩毕业证、英国大学毕业证、美国大学毕业证、澳洲大学毕业证、加拿大大学毕业证(q微1954292140)新加坡大学毕业证、新西兰大学毕业证、爱尔兰毕业证、西班牙毕业证、德国毕业证,回国证明,留信网认证,留信认证办理,学历认证。从而完成就业。圣地亚哥州立大学毕业证办理,圣地亚哥州立大学文凭办理,圣地亚哥州立大学成绩单办理和真实留信认证、留服认证、圣地亚哥州立大学学历认证。学院文凭定制,圣地亚哥州立大学原版文凭补办,扫描件文凭定做,100%文凭复刻。
特殊原因导致无法毕业,也可以联系我们帮您办理相关材料:
1:在圣地亚哥州立大学挂科了,不想读了,成绩不理想怎么办???
2:打算回国了,找工作的时候,需要提供认证《SDSU成绩单购买办理圣地亚哥州立大学毕业证书范本》【Q/WeChat:1954292140】Buy San Diego State University Diploma《正式成绩单论文没过》有文凭却得不到认证。又该怎么办???美国毕业证购买,美国文凭购买,【q微1954292140】美国文凭购买,美国文凭定制,美国文凭补办。专业在线定制美国大学文凭,定做美国本科文凭,【q微1954292140】复制美国San Diego State University completion letter。在线快速补办美国本科毕业证、硕士文凭证书,购买美国学位证、圣地亚哥州立大学Offer,美国大学文凭在线购买。
美国文凭圣地亚哥州立大学成绩单,SDSU毕业证【q微1954292140】办理美国圣地亚哥州立大学毕业证(SDSU毕业证书)【q微1954292140】录取通知书offer在线制作圣地亚哥州立大学offer/学位证毕业证书样本、留信官方学历认证(永久存档真实可查)采用学校原版纸张、特殊工艺完全按照原版一比一制作。帮你解决圣地亚哥州立大学学历学位认证难题。
主营项目:
1、真实教育部国外学历学位认证《美国毕业文凭证书快速办理圣地亚哥州立大学办留服认证》【q微1954292140】《论文没过圣地亚哥州立大学正式成绩单》,教育部存档,教育部留服网站100%可查.
2、办理SDSU毕业证,改成绩单《SDSU毕业证明办理圣地亚哥州立大学成绩单购买》【Q/WeChat:1954292140】Buy San Diego State University Certificates《正式成绩单论文没过》,圣地亚哥州立大学Offer、在读证明、学生卡、信封、证明信等全套材料,从防伪到印刷,从水印到钢印烫金,高精仿度跟学校原版100%相同.
3、真实使馆认证(即留学人员回国证明),使馆存档可通过大使馆查询确认.
4、留信网认证,国家专业人才认证中心颁发入库证书,留信网存档可查.
《圣地亚哥州立大学学位证书的英文美国毕业证书办理SDSU办理学历认证书》【q微1954292140】学位证1:1完美还原海外各大学毕业材料上的工艺:水印,阴影底纹,钢印LOGO烫金烫银,LOGO烫金烫银复合重叠。文字图案浮雕、激光镭射、紫外荧光、温感、复印防伪等防伪工艺。
高仿真还原美国文凭证书和外壳,定制美国圣地亚哥州立大学成绩单和信封。毕业证网上可查学历信息SDSU毕业证【q微1954292140】办理美国圣地亚哥州立大学毕业证(SDSU毕业证书)【q微1954292140】学历认证生成授权声明圣地亚哥州立大学offer/学位证文凭购买、留信官方学历认证(永久存档真实可查)采用学校原版纸张、特殊工艺完全按照原版一比一制作。帮你解决圣地亚哥州立大学学历学位认证难题。
圣地亚哥州立大学offer/学位证、留信官方学历认证(永久存档真实可查)采用学校原版纸张、特殊工艺完全按照原版一比一制作【q微1954292140】Buy San Diego State University Diploma购买美国毕业证,购买英国毕业证,购买澳洲毕业证,购买加拿大毕业证,以及德国毕业证,购买法国毕业证(q微1954292140)购买荷兰毕业证、购买瑞士毕业证、购买日本毕业证、购买韩国毕业证、购买新西兰毕业证、购买新加坡毕业证、购买西班牙毕业证、购买马来西亚毕业证等。包括了本科毕业证,硕士毕业证。
How to regulate and control your it-outsourcing provider with process miningProcess mining Evangelist
Oliver Wildenstein is an IT process manager at MLP. As in many other IT departments, he works together with external companies who perform supporting IT processes for his organization. With process mining he found a way to monitor these outsourcing providers.
Rather than having to believe the self-reports from the provider, process mining gives him a controlling mechanism for the outsourced process. Because such analyses are usually not foreseen in the initial outsourcing contract, companies often have to pay extra to get access to the data for their own process.
Tijn van der Heijden is a business analyst with Deloitte. He learned about process mining during his studies in a BPM course at Eindhoven University of Technology and became fascinated with the fact that it was possible to get a process model and so much performance information out of automatically logged events of an information system.
Tijn successfully introduced process mining as a new standard to achieve continuous improvement for the Rabobank during his Master project. At his work at Deloitte, Tijn has now successfully been using this framework in client projects.
Dimension Data has over 30,000 employees in nine operating regions spread over all continents. They provide services from infrastructure sales to IT outsourcing for multinationals. As the Global Process Owner at Dimension Data, Jan Vermeulen is responsible for the standardization of the global IT services processes.
Jan shares his journey of establishing process mining as a methodology to improve process performance and compliance, to grow their business, and to increase the value in their operations. These three pillars form the foundation of Dimension Data's business case for process mining.
Jan shows examples from each of the three pillars and shares what he learned on the way. The growth pillar is particularly new and interesting, because Dimension Data was able to compete in a RfP process for a new customer by providing a customized offer after analyzing the customer's data with process mining.
Philipp Horn has worked in the Business Intelligence area of the Purchasing department of Volkswagen for more than 5 years. He is a front runner in adopting new techniques to understand and improve processes and learned about process mining from a friend, who in turn heard about it at a meet-up where Fluxicon had participated with other startups.
Philipp warns that you need to be careful not to jump to conclusions. For example, in a discovered process model it is easy to say that this process should be simpler here and there, but often there are good reasons for these exceptions today. To distinguish what is necessary and what could be actually improved requires both process knowledge and domain expertise on a detailed level.
indonesia-gen-z-report-2024 Gen Z (born between 1997 and 2012) is currently t...disnakertransjabarda
Gen Z (born between 1997 and 2012) is currently the biggest generation group in Indonesia with 27.94% of the total population or. 74.93 million people.
Mitchell Cunningham is a process analyst with experience across the business process management lifecycle. He has a particular interest in process performance measurement and the integration of process performance data into existing process management methodologies.
Suncorp has an established BPM team and a single claims-processing IT platform. They have been integrating process mining into their process management methodology at a range of points across the process lifecycle. They have also explored connecting process mining results to service process outcome measures, like customer satisfaction. Mitch gives an overview of the key successes, challenges and lessons learned.
Mieke Jans is a Manager at Deloitte Analytics Belgium. She learned about process mining from her PhD supervisor while she was collaborating with a large SAP-using company for her dissertation.
Mieke extended her research topic to investigate the data availability of process mining data in SAP and the new analysis possibilities that emerge from it. It took her 8-9 months to find the right data and prepare it for her process mining analysis. She needed insights from both process owners and IT experts. For example, one person knew exactly how the procurement process took place at the front end of SAP, and another person helped her with the structure of the SAP-tables. She then combined the knowledge of these different persons.
1. Inferential Statistics
Why do you need inferential statistics?
Making an inference about the population from a sample.
Allow us to use the information learned from descriptive
statistics.
It extends beyond the immediate data.
Used to infer from sample data what is the population might
think.
Used to make judgements about the probability that an observed
difference between groups is a dependable one or one that might
have happened by chance in the study.
2. Inferential Statistics
Statistical inference is the procedure by which we reach a conclusion about a population
on the basis of the information contained in a sample drawn from that population.
Statistical Inference (Two General Areas)
Estimation
Hypothesis Testing
Estimation process use sample data to calculate some statistics that serves as an
approximation of the corresponding parameter of the population from which the
sample was drawn.
Types of estimate
Point estimate: A point estimate is a single numerical value used to estimate the
corresponding population parameter.
Interval estimate: An interval estimate consists of two numerical values defining a range
of values that, with a specified degree of confidence, most likely includes the parameter
being estimated.
3. Inferential Statistics
A single computed value is referred to as an estimate. The rule that tells us
how to compute this value, or estimate, is referred to as an estimator.
Estimators are usually presented as formulas. For example,
is an estimator of the population mean, µ. The single numerical value that
results from evaluating this formula is called an estimate of the parameter
µ.
In many cases, a parameter may be estimated by more than one estimator.
One of the desired property of a good estimator is unbiasedness.
An estimator, say, T, of the parameter θ is said to be an unbiased estimator
of θ if E(T) = θ.
4. Inferential Statistics
The sampled population is the population from which one actually
draws a sample.
The target population is the population about which one wishes to
make an inference.
Only when the target population and the sampled population are the
same it is possible for one to use statistical inference procedures to
reach conclusions about the target population. If the sampled
population and the target population are different, the researcher
can reach conclusions about the target population only on the basis
of nonstatistical considerations like extrapolating findings to the
target population.
5. Inferential Statistics
CONFIDENCE INTERVAL FOR A POPULATION MEAN
Suppose a researchers wishes to estimate the mean of
some normally distributed population.
Draw a random sample of size n from the normally
distributed population and compute , which is used as a
point estimate of µ. Random sampling inherently
involves chance, cannot be expected to be equal to µ.
It would be much more meaningful, therefore, to
estimate µ by an interval that somehow communicates
information regarding the probable magnitude of µ.
6. Inferential Statistics
CONFIDENCE INTERVAL FOR A POPULATION MEAN
If sampling is from a normally distributed
population, the sampling distribution of the sample
mean will be normally distributed with a mean
equal to the population mean µ, and a variance is
equal to . From our knowledge of normal
distributions, approximately 95% of the possible
values of constituting the distribution are within
two standard deviations of the mean. The two
points that are two standard deviations from the
mean will contain approximately 95%.
7. Inferential Statistics
Example:
Suppose a researcher, interested in obtaining an estimate of the average level of some
enzyme in a certain human population, takes a sample of 10 individuals, determines the
level of the enzyme in each, and computes a sample mean of . Suppose further it is
known that the variable of interest is approximately normally distributed with a variance of
45. We wish to estimate µ.
8. Inferential Statistics
An approximate 95% confidence interval for µ is given by
Interval Estimate Components:
The interval estimate contains in its centre the point estimate of µ. The 2 we recognize as a
value from the standard normal distribution that tells us within how many standard errors
lie approximately 95% of the possible values of . This value of z is referred to as the
reliability coefficient. The last component, , is the standard error, or standard deviation of
the sampling distribution of . In general, then, a interval estimate may be expressed as
follows:
Sampling from a normal distribution with known variance, an interval estimate for µ is
where is the value of z to the left of which lies and to the right of which lies of the area
under its curve.
9. Inferential Statistics
Interpreting Confidence Intervals:
How do we interpret the interval given by Expression ?
In the present example the reliability coefficient is equal to 2. We say
that in repeated sampling approximately 95% of the intervals
constructed by the above expression will include the population
mean. This interpretation is based on the probability of occurrence of
different values of . We may generalize this interpretation by
designating the total area under the curve of that is outside the
interval as and the area within the interval as 1- .
𝛂 𝛂
10. Inferential Statistics
Probabilistic Interpretation: In repeated sampling, from a normally distributed
population with a known standard deviation, 100(1- ) percent of all intervals of
𝛂
the form will in the long run include the population mean µ.
The quantity (1- ), in this case 0.95, is called the
𝛂 confidence coefficient (or
confidence level), and the interval is called a confidence interval for µ. When
(1- )=0.95,
𝛂 the interval is called the 95% confidence interval for µ.
Practical Interpretation: When sampling is from a normally distributed
population with known standard deviation, we are 100(1- ) percent confident
𝛂
that the single computed interval, , contains the population mean.
Precision: The quantity obtained by multiplying the reliability factor by the
standard error of the mean is called the precision of the estimate. This quantity
is also called the margin of error.
11. Inferential Statistics
Sampling from Nonnormal Populations:
As noted, it will not always be possible or prudent to assume that the
population of interest is normally distributed. Thanks to the central limit
theorem, this will not deter us if we are able to select a large enough sample.
We have learned that for large samples, the sampling distribution of is
approximately normally distributed regardless of how the parent population is
distributed.
Example: Punctuality of patients in keeping appointments is of interest to a
research team. In a study of patient flow through the offices of general
practitioners, it was found that a sample of 35 patients was 17.2 minutes late
for appointments, on average. Previous research had shown the standard
deviation to be about 8 minutes. The population distribution was felt to be
nonnormal. What is the 90% confidence interval for µ, the true mean amount
of time late for appointments?
12. Inferential Statistics
Since the sample size is fairly large (greater than 30), and since the population
standard deviation is known, we draw on the central limit theorem and
assume the sampling distribution of to be approximately normally
distributed. From the normal distribution Table, we find the reliability
coefficient corresponding to a confidence coefficient of 0.90 to be about
1.645. The standard error is
Therefore, 90% percent confidence interval for µ is
Frequently, when the sample is large enough for the application of the central
limit theorem, the population variance is unknown. In that case, we use the
sample variance as a replacement for the unknown population variance in the
formula for constructing a confidence interval for population mean.
13. t-distribution
One does not have knowledge of the population mean and
variance, and cannot use the statistic to construct Confidence
intervals of the mean. The statistic,
is normally distributed when the population is normally
distributed and is at least approximately normally distributed
when n is large, regardless of the functional form of the
population, we cannot make use of this fact because is
𝜎
unknown.
14. t-distribution
Most logical solution, we use the sample Standard Deviation (SD),
as an approximation of . This is justifiable when n ≥ 30 and also the use of
𝜎
normal distribution theory to construct confidence interval for
When we have a small sample an alternative procedure for constructing a
confidence interval is the use of Student’s t distribution or simply t
distribution.
William S Gosset was a statistician employed by the Guinness brewing
company which had stipulated that he cannot publish his own name.
(GOSSET -pseudonym of student) student’s t distribution.
15. t-distribution
Properties of t distribution:
1. It has a mean = 0
2. symmetrical about the mean.
3. In general, it has a variance greater than 1, but the
variance approaches 1 as the sample size becomes
large. For degree of freedom (df) > 2, the variance of t
distribution is df/(df-2), since here df = n - 1 for n > 3
we may write the variance of the t distribution
= (n-1)/(n-3)
4. Variable t ranges from -∞ to +∞
5. Compared to the normal distribution the t distribution
is less peaked in the centre and has higher tails.
6. The t distribution approaches the normal distribution
as n - 1 approaches infinity
16. t-distribution
Confidence interval using t
Source of reliability coefficient is different
When sampling is from a normal distribution whose standard
deviation , is unknown the 100(1-𝛼) percent confidence interval for
the population mean µ is given by
Applicable: strictly valid when the sample be drawn from a normal distribution
17. t-distribution: Applicability
When to use t distribution:
The t distribution can be used with any statistic having a bell-shaped distribution
(approximately normal).
The sampling distribution of a statistic should be bell-shaped. If any of the following
conditions apply:
The population distribution is normal.
The population distribution is symmetric, unimodal, and without outliers, and the
sample size is at least 30.
The population distribution is moderately skewed, unimodal, without outliers and the
sample size is at least 40.
The sample size is greater than 40, without outliers
The t distribution should not be used with small samples from populations that are
not approximately normal.
18. t-distribution: Example
A researcher conducted a study to evaluate the effect of on job body
mechanism instruction on the work performance of newly employed young
workers. He used two randomly selected groups of subjects, an experimental
group and a control group. The experimental group received one hour of back-
school training provided by an occupational therapist. The control group did
not receive this training. A criterion-referenced body mechanics evaluation
checklist was used to evaluate each worker’s lifting, lowering, pulling and
transferring of objects in the work environment. A correctly performed task
received a score of 1. The 15 control subjects made a mean score of 11.53 on
the evaluation with a standard deviation of 3.681. We assume that these 15
controls behave as a random sample from a population of similar subjects. We
wish to use these sample data to estimate the mean score for the population.
20. t-distribution: Example
Ans: We may use the sample mean 11.53 as a point estimate of the population mean but since
the population standard deviation is unknown, we must assume that the population of values
to be at least approximately normally distributed before constructing a confidence interval for
µ. Let us assume that such an assumption is reasonable and that a 95% confidence interval is
desired. We have our estimator and our standard error is We need now to find the
x̄
reliability coefficient, the value of t associated with a confidence coefficient of 0.95 and n-1
=14 degree of freedom. Since a 95% confidence interval leaves 0.05 of the area under the
curve of t to be equally divided between the two tails, we need the value of t to the right of
which lies 0.025 of the area.
From T distribution table at 14 degrees of freedom: t0.975 = 2.1448
Hence the 95% confidence interval
11.53 ± 2.1448 × (0.9504) = 11.53 ± 2.04 = [9.49, 13.57]
This interval may be interpreted from both the probabilistic and practical points of view. We
are 95% of confidence that the true population mean µ is somewhere between 9.49 and 13.57
because, in repeated sampling, 95% of intervals constructed in like manner will include µ.
21. Deciding between z and t
When we construct a confidence interval for a population mean, we must decide whether to use
a value of z or a value of t as the reliability factor. To make an appropriate choice we must
consider sample size, whether the sampled population is normally distributed, and whether the
population variance is known.
22. Confidence interval for the difference between two population means
Confidence interval for the difference between population means provides information
that is helpful and deciding whether or not it is likely that the two population means are
equal. When the constructed interval does not include zero, we say that the interval
provides evidence that the two-population means are not equal. When the interval
includes zero, we say that the population mean may be equal.
Sampling from Normal Populations with known variences
Population variance are known the 100(1-)% confidence interval for µ1 - µ2 is given
Sampling from Non-normal Populations
The construction of a confidence interval for the difference between two population
means when sampling is from non-normal populations proceeds in the same manner as
sampling from normal populations if the sample sizes n1 and n2 are large. Again, this is a
result of the central limit theorem. If the population variances are unknown, we use the
sample variances to estimate them.
23. Confidence interval for the difference between two population means
Example: Despite common knowledge of the adverse effects of
doing so, many women continue to smoke while pregnant. A
researcher examined the effectiveness of a smoking cessation
program for pregnant women. The mean number of cigarettes
smoked daily at the close of the program by the 328 women who
completed the program was 4.3 with a standard deviation of 5.22.
Among 64 women who did not complete the program, the mean
number of cigarettes smoked per day at the close of the program
was 13 with a standard deviation of 8.97. We wish to construct a 99
percent confidence interval for the difference between the means of
the populations from which the samples may be presumed to have
been selected.
24. Confidence interval for the difference between two population means
No information is given regarding the shape of the distribution of cigarettes smoked per day.
Since our sample sizes are large, however, the central limit theorem assures us that the sampling
distribution of the difference between sample means will be approximately normally distributed
even if the distribution of the variable in the populations is not normally distributed. We may
use this fact as justification for using the z statistic as the reliability factor in the construction of
our confidence interval. Also, since the population standard deviations are not given, we will use
the sample standard deviations to estimate them. The point estimate for the difference between
population means is the difference between sample means, 4.3 – 13.0 = - 8.7. From normal
distribution Table, we find the reliability factor to be 2.58. The estimated standard error is
Our 99 percent confidence interval for the difference between population means is
- 8.7 ± 2.58 (1.1577) = [- 11.7; - 5.7]
We are 99 percent confident that the mean number of cigarettes smoked per day for women
who complete the program is between 5.7 and 11.7 lower than the mean for women who do
not complete the program.
25. Confidence interval for the difference between two population means
t distribution and the difference between means
when population variances are unknown, and we wish to estimate
the difference between two population means with a confidence
interval we can use the t distribution as a source of reliability factor
if certain assumptions are met.
We must know or willing to assume, that the two sampled
populations are normally distributed.
Regarding unknown population variances, two situations may occur:
Situation–I: population variances are equal,
Situation–II: population variances are not equal.
26. Situation–I: population variances are equal
If the assumption of equal population variance is justified, the two samples may be
considered as the estimates of the same quantity, the common variance. Obtain a
pooled estimate of the common variance. Pooled variance is the weighted average of
the sample variances. Sample variances are weighted by their degree of freedom
100(1 - α) percent confidence interval for µ1 - µ2
degree of freedom used in determining the value of t in n1 + n2 – 2
27. Example: population variances are equal
The purpose of a researcher study was to determine the effect of long term
exercise intervention on corporate executives enrolled in a supervised
fitness program. Data were collected on 13 subjects (the exercise group)
who voluntarily entered a supervised exercise program and remained active
for an average of 13 years and 17 subjects (the secondary group) who
elected not to join the fitness program. Among the data collected on the
subjects was the maximum number of sit-ups completed in 30 seconds. The
exercise group has a mean and standard deviation for this variable of 21.0
and 4.9 respectively. The mean and standard deviation for the sedentary
group were 12.1 and 5.6 respectively. We assume that the two populations
of overall muscle condition measures are approximately normally
distributed and that the two population variances are equal. We wish to
construct a 95% confidence interval for the difference between the means
of the populations represented by these two samples.
28. Example: population variances are equal
Pooled estimate of the common population variance
from t distribution table (13+17-2) = 28 degree of freedom and desired 0.95 confidence
interval, reliability factor 2.0484
Confidence interval = (21.0 - 12.1) 2.0484 8.9 4.0085 =
We are 95% confident that the difference between the population mean is somewhere
between 4.9 and 12.9. We can say this because we know that if we were to repeat the
study many, many times and compute confidence intervals in the same way, about 95% of
the intervals would include the difference between the population mean.
Since the interval does not include zero, we conclude that the population means are not
equal.
We can interpret this interval that the difference between the two population means is
estimated to be 8.9 and we are 95% confident that the true value lies between 4.9 and
12.9.
29. Confidence interval for the difference between two population means
Population variance is not equal
When one is unable to conclude that the variances of two populations of
interest are equal even though the two populations may be assumed to
be normally distributed, it is not proper to use t distribution.
Solutions has been proposed by many researchers. But the problem
resolves around the fact that the quality
does not follow t-distribution with n1 + n2 - 2 degree of freedom when
the population variances are not equal.
30. Confidence interval for the difference between two population means
Population variance is not equal
The solution proposed by Cochran consists of completing the reliability factor by the
following formula:
Where for n1 - 1 degrees of freedom, and for n2 - 1 degrees of freedom.
An approximate 100(1- ) percent confidence interval for µ
𝛼 1 - µ2 is given by
31. Example: population variances are not equal
The purpose of a research study was to determine the effect of long-term
exercise intervention on corporate executives enrolled in a supervised
fitness program. Data were collected on 13 subjects (the exercise group)
who voluntarily entered a supervised exercise program and remained
active for an average of 13 years and 17 subjects (the secondary group)
who elected not to join the fitness program. Among the data collected on
the subjects were the maximum number of sit-ups completed in 30
seconds. The exercise group has a mean and standard deviation for this
variable of 21.0 and 4.9 respectively. The mean and standard deviation
for the sedentary group were 12.1 and 5.6 respectively. We assume that
the two populations of overall muscle condition measures are
approximately normally distributed and that the two population
variances are not equal. We wish to construct a 95% confidence interval
for the difference between the means of the populations represented by
these two samples.
32. Example: population variances are not equal
We will use Cochran reliability factor t’. From t distribution Table with 12 degrees of freedom
and
. Similarly, with 16 degrees of freedom and . We now compute
we now construct the 95 percent confidence interval for the difference between the two
population means.
Since the interval does not include zero, we conclude that the population means are not equal.
We can interpret this interval that the difference between the two population means is
estimated to be 8.9 and we are 95% confident that the true value lies between 7.9348 and
16.8348.
33. Determination of sample size for estimating mean
Planning of any survey experiment - How large a sample to take?
Larger than needed - wasteful of resource
Smaller than needed - lead to a result of no practical use
objective:- Interval estimation should have
1) a narrow interval
2) high reliability.
Total width of the interval is twice the magnitude of the quality :
Increasing reliability means a larger reliability coefficient---> increase interval.
34. Determination of sample size for estimating mean
Fixed reliability coefficient and reduce standard error
standard error = , is fixed the only way is to increase n --> take a
larger sample
How large?--> depends on the desired degree of reliability and the
desired interval width
sampling with replacement from an infinite or sufficiently large
population formula
sampling from small finite population without replacement formula
35. Determination of sample size for estimating mean
Sample size estimation formulas require the and population variance
is unknown.
The most frequently used source for estimation of are:
A pilot or preliminary sample may drawn from the population and
computed sample variance(S2
) may be used to estimate . Observations
used in the pilot sample may be computed on a part of the final
sample:
n(the computed sample size) - n1(the pilot sample size)
= n2(the number of observation needed to satisfy the total sample size
requirement)
Estimates of Sigma square may be available from previous or similar
studies.
36. Inference about population variance
Confidence interval for the variance of a normally
distributed population
Distribution normal? - used sample variance as an
approximate estimator of population variance
Wonder about the quality? - check whether the sample
variance is an unbiased estimator of population variance.
To be unbiased - average value of the sample variance over
all possible sample must be equal to the population variance.
E() =
37. Inference about population variance
Draw all possible samples of size two from the
population consisting of the values 6, 8, 10, 12 and
14.
If we compute the sample variance
For each of the possible samples, we obtain the
sample variances as shown in the table
Sampling with replacement
E(s2
), the expected value of the mean of the sample
variance, (0 + 2 + ….. + 2 + 0)/25 = 8
Hence E(s2
) =
S E C O N D D R A W
6 8 10 12 14
F
I
R
S
T
D
R
A
w
6
8
10
12
14
0
2
8
18
32
2
0
2
8
18
8
2
0
2
8
18
8
2
0
2
32
18
8
2
0
38. Inference about population variance
Sampling without replacement: the expected value of s2
,
(0 + 2 + ….. + 2 + 0)/10 = 10
Then E(s2
) = where sampling is with replacement. Results justify the use of for computing
the sample variance.
E(s2
) when sampling is without replacement.
Interval estimation of a population variance
Success depends on our ability to find an approximate sampling distribution
Confidence interval for are usually based on the sampling distribution of
If sample of size n are drawn from a normally distributed population, this quality [ ] has a
distribution known as Chi square distribution with n - 1 degrees of freedom.
To obtain a 100(1-α)% confidence interval for we select values of from the table in such a
way that α/2 is to the left of smaller value of and α/2 is to be the right of the larger values
of .
39. Inference about population variance
The 100(1-α) % confidence interval for is
Confidence interval for , population standard deviation is
Method is widely used but have some draw back. Normality of the population is crucial.
Estimator is not in the centre of the confidence interval because distribution is not
symmetric.
40. Inference about population variance
A random sample of 20 nominally measured 2 mm diameter steel ball bearings is
taken and the diameters are measured precisely. The measurements, in mm, are
as follows:
2.02 1.94 2.09 1.95 1.98 2.00 2.03 2.04 2.08 2.07
1.99 1.96 1.99 1.95 1.99 1.99 2.03 2.05 2.01 2.03
Assuming that the diameters are normally distributed with unknown mean and
unknown variance 2
,
a) Find a 2-sided 95% confidence interval for the variance 2
b) Find a 2-sided confidence interval for the standard deviation
41. Inference about population variance
From the data we calculate and , and .
Hence,
There are 19 degrees of freedom and the critical values of the Chi-square
distribution and
The confidence interval for 2
is
The confidence interval for is
43. Inference about population variances
Confidence interval for the ratio of the variances of the two normally distributed populations:
One way of comparing two variances is to compute their ratio
To use t distribution for constructing a confidence interval for the difference between population
means requires that the population variances be equal. If the two variances are equal, the ratio
will be one. If the confidence interval for the ratio of two populations variances includes 1, we
conclude that the two populations variances, may in fact, be equal.
This is a form of inference, and we must rely on some sampling distribution. This time the
distribution of is utilised provided the following assumptions are met.
Assumption:
1. and are computed from independent samples of size n1 and n2
2. Sample have been drawn from two normally distributed populations
3. let
If the assumption are met, follows a F distribution
44. Inference about population variances
F-distribution: If U and V are independent chi-square random variables with r1 and r2
degrees of freedom, respectively, then: follows an F distribution with r1 numerator degrees
of freedom and r2 denominator degrees of freedom.
This F distribution depends on two-degrees-of-freedom values, one corresponding to the
value of n1 – 1 used in computing, and the other corresponding to the value of n2 – 1 used in
computing . They are usually referred to as the numerator degree of freedom and
denominator degree of freedom.
A confidence interval for at 100(1-α) % confidence is constructed by
the values from F table to the left lies α/2 of the area
the value from F table to the right lies α/2 of the area
45. Inference about population variances
The values of at the intersection of the column headed df1 and the row labelled df2. If we
have extensive table of F distribution, finding would be no trouble. To include every
possible percentile of F would make a very lengthy table. A relationship exists to compute
46. Inference about population variances
Problem#
The variability in the thickness of oxide layers in
semiconductor wafers is a critical characteristic, where
low variability is desirable. A company is investigating two
different ways to mix gases so as to reduce the variability
of the oxide thickness. We produce 16 wafers with each
gas mixture and our results indicate that the standard
deviation is s1 = 1.96Å and s2 = 2.13Å for the two
mixtures. What is the 95% confidence interval for the ratio
between the two variances?
47. Inference about population variances
Given information:
Sample size of population 1: n1 = 16;
sample standard deviation for sample from population 1: s1 = 1.96;
Sample size of population 2: n2 = 16;
Sample standard deviation for sample from population 2: s2 = 2.13.
Since we are looking for a 95% confidence interval, we need two
f values: