Third in a series of four seminars presented to University of North Texas librarians. This presentation focuses on using basic tests that determine the association of two sets of data based on measures of central tendency and variation.
Research method ch07 statistical methods 1naranbatn
This document provides an overview of statistical methods used in health research. It discusses descriptive statistics such as mean, median and mode that are used to describe data. It also covers inferential statistics that are used to infer characteristics of populations based on samples. Specific statistical tests covered include t-tests, which are used to test differences between means, and F-tests, which are used to compare variances. The document explains key concepts in hypothesis testing such as null and alternative hypotheses, type I and type II errors, and statistical power. Parametric tests covered assume the data meet certain statistical assumptions like normality.
This document provides an overview of key concepts in inferential statistics. Inferential statistics allows researchers to make inferences about populations based on samples. It includes techniques like hypothesis testing, t-tests, analysis of variance (ANOVA), regression analysis, and more. The goal is to determine if observed differences are statistically significant rather than due to chance. Inferential statistics helps estimate parameters and analyze variability using statistical models and software.
This document discusses hypothesis testing and inferential statistics. It covers topics like hypothesis testing process, types of errors, differentiating between critical value method and probability value method, tests for one and two populations including z-test, t-test, Wilcoxon test and binomial test. It also discusses assumptions and procedures for tests like pooled t-test, paired t-test, Mann-Whitney test and paired Wilcoxon test. Examples of applying these tests on quantitative and qualitative data are provided.
This document discusses non-parametric statistical tests, which make few assumptions about the distribution of the underlying population. It provides examples of non-parametric tests like the sign test, Wilcoxon rank sum test, and Kruskal-Wallis test. These tests involve ranking all observations from different groups together and applying statistical tests to the ranks rather than the original values. Non-parametric tests are useful when assumptions of parametric tests may not hold but lack power with small samples.
This document discusses parametric tests used for statistical analysis. It introduces t-tests, ANOVA, Pearson's correlation coefficient, and Z-tests. T-tests are used to compare means of small samples and include one-sample, unpaired two-sample, and paired two-sample t-tests. ANOVA compares multiple population means and includes one-way and two-way ANOVA. Pearson's correlation measures the strength of association between two continuous variables. Z-tests compare means or proportions of large samples. Key assumptions and calculations for each test are provided along with examples. The document emphasizes the importance of choosing the appropriate statistical test for research.
This document provides an overview of various statistical analysis techniques used in inferential statistics, including t-tests, ANOVA, ANCOVA, chi-square, regression analysis, and interpreting null hypotheses. It defines key terms like alpha levels, effect sizes, and interpreting graphs. The overall purpose is to explain common statistical methods for analyzing data and determining the probability that results occurred by chance or were statistically significant.
This ppt includes basic concepts about data types, levels of measurements. It also explains which descriptive measure, graph and tests should be used for different types of data. A brief of Pivot tables and charts is also included.
The document provides an overview of inferential statistics. It defines inferential statistics as making generalizations about a larger population based on a sample. Key topics covered include hypothesis testing, types of hypotheses, significance tests, critical values, p-values, confidence intervals, z-tests, t-tests, ANOVA, chi-square tests, correlation, and linear regression. The document aims to explain these statistical concepts and techniques at a high level.
This document outlines topics related to statistics that will be covered. It is divided into 6 parts. Part 1 discusses the role of statistics in research, descriptive statistics, sampling procedures, sample size, and inferential statistics. Part 2 covers choice of statistical tests, defining variables, scales of measurements, and number of samples. Parts 3 and 4 discuss parametric and non-parametric tests. Part 5 is about goodness of fit tests. Part 6 covers choosing correct statistical tests and introduction to multiple regression. The document also provides examples and definitions of key statistical concepts like mean, median, mode, range, and different sampling methods.
The document provides an overview of chi-square tests, including chi-square tests for goodness of fit and tests of independence. It explains that chi-square tests are used with categorical or classified data rather than numerical data. For a chi-square test of goodness of fit, the null hypothesis specifies the expected proportions in different categories. Observed and expected frequencies are calculated and compared using the chi-square statistic. A chi-square test of independence examines whether two categorical variables are related by comparing observed and expected joint frequencies.
This document provides an overview of parametric and non-parametric statistical tests. Parametric tests assume the data follows a known distribution (e.g. normal) while non-parametric tests make no assumptions. Common non-parametric tests covered include chi-square, sign, Mann-Whitney U, and Spearman's rank correlation. The chi-square test is described in more detail, including how to calculate chi-square values, degrees of freedom, and testing for independence and goodness of fit.
INFERENTIAL STATISTICS: AN INTRODUCTIONJohn Labrador
For instance, we use inferential statistics to try to infer from the sample data what the population might think. Or, we use inferential statistics to make judgments of the probability that an observed difference between groups is a dependable one or one that might have happened by chance in this study.
This document discusses non-parametric tests, which are statistical tests that make fewer assumptions about the population distribution compared to parametric tests. Some key points:
1) Non-parametric tests like the chi-square test, sign test, Wilcoxon signed-rank test, Mann-Whitney U-test, and Kruskal-Wallis test are used when the population is not normally distributed or sample sizes are small.
2) They are applied in situations where data is on an ordinal scale rather than a continuous scale, the population is not well defined, or the distribution is unknown.
3) Advantages are that they are easier to compute and make fewer assumptions than parametric tests,
This document discusses descriptive and inferential statistics used in nursing research. It defines key statistical concepts like levels of measurement, measures of central tendency, descriptive versus inferential statistics, and commonly used statistical tests. Nominal, ordinal, interval and ratio are the four levels of measurement, with ratio allowing the most data manipulation. Descriptive statistics describe sample data while inferential statistics allow estimating population parameters and testing hypotheses. Common descriptive statistics include mean, median and mode, while common inferential tests are t-tests, ANOVA, chi-square and correlation. Type I errors incorrectly reject the null hypothesis.
1) The t-test is a statistical test used to determine if there are any statistically significant differences between the means of two groups, and was developed by William Gosset under the pseudonym "Student".
2) The t-distribution is used for calculating t-tests when sample sizes are small and/or variances are unknown. It has a mean of zero and variance greater than one.
3) Paired t-tests are used to compare the means of two related groups when samples are paired, while unpaired t-tests are used to compare unrelated groups or independent samples.
Parametric and non parametric test in biostatistics Mero Eye
This ppt will helpful for optometrist where and when to use biostatistic formula along with different examples
- it contains all test on parametric or non-parametric test
The document discusses hypothesis testing using parametric and non-parametric tests. It defines key concepts like the null and alternative hypotheses, type I and type II errors, and p-values. Parametric tests like the t-test, ANOVA, and Pearson's correlation assume the data follows a particular distribution like normal. Non-parametric tests like the Wilcoxon, Mann-Whitney, and chi-square tests make fewer assumptions and can be used when sample sizes are small or the data violates assumptions of parametric tests. Examples are provided of when to use parametric or non-parametric tests depending on the type of data and statistical test being performed.
Basics of Hypothesis testing for PharmacyParag Shah
This presentation will clarify all basic concepts and terms of hypothesis testing. It will also help you to decide correct Parametric & Non-Parametric test for your data
The document discusses significance tests and their role in hypothesis testing. It defines key terms like p-value, significance level, confidence level, rejection region, and classification of significance tests. The p-value represents the probability of observing the results by chance if the null hypothesis is true. The significance level is set before data collection and represents the probability of incorrectly rejecting the null hypothesis. A p-value less than the significance level leads to rejecting the null hypothesis.
A researcher tested the effectiveness of an herbal supplement on physical fitness using the Marine Physical Fitness Test. 25 college students took the supplement for 6 weeks and averaged a score of 38.68 on the test, compared to the average population score of 35. Using a t-test with α=.05 and df=24, the researcher found the average score of 38.68 was not significantly different than the population mean of 35 (t=2.041, p=.052). Therefore, there is not enough evidence to conclude the supplement had an effect on fitness levels, as the higher average score could be due to chance for this small sample.
This document provides an overview of non-parametric statistical tests. It discusses tests such as the chi-square test, Wilcoxon signed-rank test, Mann-Whitney test, Friedman test, and median test. These tests can be used with ordinal or nominal data when the assumptions of parametric tests are not met. The document explains the appropriate uses and procedures for each non-parametric test.
This document provides steps for analyzing data using a t-test in Excel. It introduces the topic of hypothesis testing and the specific problem of comparing the daily study hours of female and male students. The null and alternative hypotheses are defined, with the null being that the mean study hours are equal between groups. The document then outlines the 5 steps to perform a t-test in Excel: 1) inputting the data, 2) accessing the data analysis tool, 3) selecting the t-test option and inputting the data ranges and hypothesized mean difference, 4) viewing the output, and 5) analyzing the results. The t-test results fail to reject the null hypothesis, indicating the mean study hours are not significantly different between female
This document provides an overview of hypothesis testing including:
1) The four steps of hypothesis testing - stating hypotheses, setting criteria, collecting data, and making a decision. It also discusses types of errors.
2) Factors that influence the outcome like effect size, sample size, and variability. Larger effects, samples, and less variability make rejecting the null hypothesis more likely.
3) Direction hypotheses tests where the alternative predicts a direction of the effect. This allows rejecting the null with smaller differences but in the predicted direction.
4) Effect size measures like Cohen's d provide information beyond just significance. Statistical power is the probability of correctly rejecting a false null hypothesis.
This document provides an overview of inferential statistics and statistical tests that can be used, including correlation tests, t-tests, and how to determine which tests are appropriate. It discusses the assumptions of parametric tests like Pearson's correlation and t-tests, and how to check assumptions graphically and using statistical tests. Specific procedures for conducting correlation analyses in Excel and SPSS are outlined, along with how to interpret and report the results.
The effect birth date has on choosing to study a sports related course at an ...Carl Page
The Relative Age Effect (RAE) in the academic study of sport. The aim of this study was to investigate the relationship of those students being born on a certain date will indeed influence the decision to study a sports related course at an educational institution.
This chapter discusses inferential statistics and the concepts underlying them. It covers key topics like types of inferential statistics (parametric vs nonparametric), important perspectives like generalizing from samples to populations, underlying concepts like null/alternative hypotheses and types of errors. Specific statistical techniques are explained like t-tests, ANOVA, regression, along with key ideas like sampling distributions, standard error, degrees of freedom, and the steps to conduct statistical tests. Different types of samples and issues with gain scores are also addressed.
This document discusses confidence intervals for estimating population means from sample data. It begins by explaining how to calculate point estimates and confidence intervals when the sample size is large (n ≥ 30) using the normal distribution. It then covers calculating confidence intervals when the sample size is small (n < 30) using the t-distribution. The key steps covered are determining the appropriate distribution to use based on sample size and knowledge of the population standard deviation, finding the critical values and margin of error, and calculating the confidence interval. Examples are provided to demonstrate how to construct confidence intervals in different situations.
The document provides an overview of inferential statistics. It defines inferential statistics as making generalizations about a larger population based on a sample. Key topics covered include hypothesis testing, types of hypotheses, significance tests, critical values, p-values, confidence intervals, z-tests, t-tests, ANOVA, chi-square tests, correlation, and linear regression. The document aims to explain these statistical concepts and techniques at a high level.
This document outlines topics related to statistics that will be covered. It is divided into 6 parts. Part 1 discusses the role of statistics in research, descriptive statistics, sampling procedures, sample size, and inferential statistics. Part 2 covers choice of statistical tests, defining variables, scales of measurements, and number of samples. Parts 3 and 4 discuss parametric and non-parametric tests. Part 5 is about goodness of fit tests. Part 6 covers choosing correct statistical tests and introduction to multiple regression. The document also provides examples and definitions of key statistical concepts like mean, median, mode, range, and different sampling methods.
The document provides an overview of chi-square tests, including chi-square tests for goodness of fit and tests of independence. It explains that chi-square tests are used with categorical or classified data rather than numerical data. For a chi-square test of goodness of fit, the null hypothesis specifies the expected proportions in different categories. Observed and expected frequencies are calculated and compared using the chi-square statistic. A chi-square test of independence examines whether two categorical variables are related by comparing observed and expected joint frequencies.
This document provides an overview of parametric and non-parametric statistical tests. Parametric tests assume the data follows a known distribution (e.g. normal) while non-parametric tests make no assumptions. Common non-parametric tests covered include chi-square, sign, Mann-Whitney U, and Spearman's rank correlation. The chi-square test is described in more detail, including how to calculate chi-square values, degrees of freedom, and testing for independence and goodness of fit.
INFERENTIAL STATISTICS: AN INTRODUCTIONJohn Labrador
For instance, we use inferential statistics to try to infer from the sample data what the population might think. Or, we use inferential statistics to make judgments of the probability that an observed difference between groups is a dependable one or one that might have happened by chance in this study.
This document discusses non-parametric tests, which are statistical tests that make fewer assumptions about the population distribution compared to parametric tests. Some key points:
1) Non-parametric tests like the chi-square test, sign test, Wilcoxon signed-rank test, Mann-Whitney U-test, and Kruskal-Wallis test are used when the population is not normally distributed or sample sizes are small.
2) They are applied in situations where data is on an ordinal scale rather than a continuous scale, the population is not well defined, or the distribution is unknown.
3) Advantages are that they are easier to compute and make fewer assumptions than parametric tests,
This document discusses descriptive and inferential statistics used in nursing research. It defines key statistical concepts like levels of measurement, measures of central tendency, descriptive versus inferential statistics, and commonly used statistical tests. Nominal, ordinal, interval and ratio are the four levels of measurement, with ratio allowing the most data manipulation. Descriptive statistics describe sample data while inferential statistics allow estimating population parameters and testing hypotheses. Common descriptive statistics include mean, median and mode, while common inferential tests are t-tests, ANOVA, chi-square and correlation. Type I errors incorrectly reject the null hypothesis.
1) The t-test is a statistical test used to determine if there are any statistically significant differences between the means of two groups, and was developed by William Gosset under the pseudonym "Student".
2) The t-distribution is used for calculating t-tests when sample sizes are small and/or variances are unknown. It has a mean of zero and variance greater than one.
3) Paired t-tests are used to compare the means of two related groups when samples are paired, while unpaired t-tests are used to compare unrelated groups or independent samples.
Parametric and non parametric test in biostatistics Mero Eye
This ppt will helpful for optometrist where and when to use biostatistic formula along with different examples
- it contains all test on parametric or non-parametric test
The document discusses hypothesis testing using parametric and non-parametric tests. It defines key concepts like the null and alternative hypotheses, type I and type II errors, and p-values. Parametric tests like the t-test, ANOVA, and Pearson's correlation assume the data follows a particular distribution like normal. Non-parametric tests like the Wilcoxon, Mann-Whitney, and chi-square tests make fewer assumptions and can be used when sample sizes are small or the data violates assumptions of parametric tests. Examples are provided of when to use parametric or non-parametric tests depending on the type of data and statistical test being performed.
Basics of Hypothesis testing for PharmacyParag Shah
This presentation will clarify all basic concepts and terms of hypothesis testing. It will also help you to decide correct Parametric & Non-Parametric test for your data
The document discusses significance tests and their role in hypothesis testing. It defines key terms like p-value, significance level, confidence level, rejection region, and classification of significance tests. The p-value represents the probability of observing the results by chance if the null hypothesis is true. The significance level is set before data collection and represents the probability of incorrectly rejecting the null hypothesis. A p-value less than the significance level leads to rejecting the null hypothesis.
A researcher tested the effectiveness of an herbal supplement on physical fitness using the Marine Physical Fitness Test. 25 college students took the supplement for 6 weeks and averaged a score of 38.68 on the test, compared to the average population score of 35. Using a t-test with α=.05 and df=24, the researcher found the average score of 38.68 was not significantly different than the population mean of 35 (t=2.041, p=.052). Therefore, there is not enough evidence to conclude the supplement had an effect on fitness levels, as the higher average score could be due to chance for this small sample.
This document provides an overview of non-parametric statistical tests. It discusses tests such as the chi-square test, Wilcoxon signed-rank test, Mann-Whitney test, Friedman test, and median test. These tests can be used with ordinal or nominal data when the assumptions of parametric tests are not met. The document explains the appropriate uses and procedures for each non-parametric test.
This document provides steps for analyzing data using a t-test in Excel. It introduces the topic of hypothesis testing and the specific problem of comparing the daily study hours of female and male students. The null and alternative hypotheses are defined, with the null being that the mean study hours are equal between groups. The document then outlines the 5 steps to perform a t-test in Excel: 1) inputting the data, 2) accessing the data analysis tool, 3) selecting the t-test option and inputting the data ranges and hypothesized mean difference, 4) viewing the output, and 5) analyzing the results. The t-test results fail to reject the null hypothesis, indicating the mean study hours are not significantly different between female
This document provides an overview of hypothesis testing including:
1) The four steps of hypothesis testing - stating hypotheses, setting criteria, collecting data, and making a decision. It also discusses types of errors.
2) Factors that influence the outcome like effect size, sample size, and variability. Larger effects, samples, and less variability make rejecting the null hypothesis more likely.
3) Direction hypotheses tests where the alternative predicts a direction of the effect. This allows rejecting the null with smaller differences but in the predicted direction.
4) Effect size measures like Cohen's d provide information beyond just significance. Statistical power is the probability of correctly rejecting a false null hypothesis.
This document provides an overview of inferential statistics and statistical tests that can be used, including correlation tests, t-tests, and how to determine which tests are appropriate. It discusses the assumptions of parametric tests like Pearson's correlation and t-tests, and how to check assumptions graphically and using statistical tests. Specific procedures for conducting correlation analyses in Excel and SPSS are outlined, along with how to interpret and report the results.
The effect birth date has on choosing to study a sports related course at an ...Carl Page
The Relative Age Effect (RAE) in the academic study of sport. The aim of this study was to investigate the relationship of those students being born on a certain date will indeed influence the decision to study a sports related course at an educational institution.
This chapter discusses inferential statistics and the concepts underlying them. It covers key topics like types of inferential statistics (parametric vs nonparametric), important perspectives like generalizing from samples to populations, underlying concepts like null/alternative hypotheses and types of errors. Specific statistical techniques are explained like t-tests, ANOVA, regression, along with key ideas like sampling distributions, standard error, degrees of freedom, and the steps to conduct statistical tests. Different types of samples and issues with gain scores are also addressed.
This document discusses confidence intervals for estimating population means from sample data. It begins by explaining how to calculate point estimates and confidence intervals when the sample size is large (n ≥ 30) using the normal distribution. It then covers calculating confidence intervals when the sample size is small (n < 30) using the t-distribution. The key steps covered are determining the appropriate distribution to use based on sample size and knowledge of the population standard deviation, finding the critical values and margin of error, and calculating the confidence interval. Examples are provided to demonstrate how to construct confidence intervals in different situations.
This document defines statistical formulas for descriptive statistics like mean, variance, and standard deviation. It also defines formulas for hypothesis testing including z-scores, t-tests, confidence intervals, and proportions. Additionally, it outlines formulas for regression analysis like the least squares regression line, residuals, coefficient of determination, standard errors, confidence levels, and prediction intervals.
The median is the middlemost score when values are arranged from lowest to highest. It divides the data set into two equal groups, with scores above and below the median. The median is not affected by extreme values and can be used when the mean would be skewed. To find the median of ungrouped data, arrange values from highest to lowest and take the middle value. For grouped data, use the formula Median = Ll + cfb/f, where Ll is the lower limit of the class containing N/2, cfb is the cumulative frequency below the assumed median, and f is the corresponding frequency.
Inferential statistics allow researchers to make generalizations about populations based on samples. Some key inferential statistical techniques discussed in the document include hypothesis testing using t-tests, chi-square tests, and regression analysis. The document provides a brief history of inferential statistics and outlines the process for hypothesis testing, including defining the null and alternative hypotheses, determining the level of significance, calculating test statistics, and drawing conclusions. It also discusses types of errors that can occur in sampling and hypothesis testing.
This document provides formulas and examples for calculating the mean and median of data sets. It defines the mean as the sum of all values divided by the total number of data points. For grouped data, the mean is calculated as the sum of the frequency multiplied by the midpoint of each class, divided by the total number of data points. Examples are provided to demonstrate calculating the mean of both ungrouped and grouped data sets. The median is defined for grouped data as the lower boundary of the class containing the median plus the class width times the amount the cumulative frequency is less than the halfway point, divided by the total number of data points. An example is given to demonstrate calculating the median of a grouped data set.
This document discusses various statistical techniques used for inferential statistics, including parametric and non-parametric techniques. Parametric techniques make assumptions about the population and can determine relationships, while non-parametric techniques make few assumptions and are useful for nominal and ordinal data. Commonly used parametric tests are t-tests, ANOVA, MANOVA, and correlation analysis. Non-parametric tests mentioned include Chi-square, Wilcoxon, and Friedman tests. Examples are provided to illustrate the appropriate uses of each technique.
This document provides an introduction to inferential statistics, including key terms like test statistic, critical value, degrees of freedom, p-value, and significance. It explains that inferential statistics allow inferences to be made about populations based on samples through probability and significance testing. Different levels of measurement are discussed, including nominal, ordinal, and interval data. Common inferential tests like the Mann-Whitney U, Chi-squared, and Wilcoxon T tests are mentioned. The process of conducting inferential tests is outlined, from collecting and analyzing data to comparing test statistics to critical values to determine significance. Type 1 and Type 2 errors in significance testing are also defined.
This presentation includes an introduction to statistics, introduction to sampling methods, collection of data, classification and tabulation, frequency distribution, graphs and measures of central tendency.
Statistics involves collecting, organizing, analyzing, and interpreting data. Descriptive statistics describe characteristics of a data set through measures like central tendency and variability. Inferential statistics draw conclusions about a population based on a sample. Key terms include population, sample, parameter, statistic, data types, levels of measurement, and sampling techniques like simple random sampling. Common data gathering methods are interviews, questionnaires, and registration records. Data can be presented textually, in tables, or graphically through charts, graphs, and maps.
This document provides an overview of descriptive and inferential statistics concepts. It discusses parameters versus statistics, descriptive versus inferential statistics, measures of central tendency (mean, median, mode), variability (standard deviation, range), distributions (normal, positively/negatively skewed), z-scores, correlations, hypothesis testing, t-tests, ANOVA, chi-square tests, and presenting results. Key terms like alpha levels, degrees of freedom, effect sizes, and probabilities are also introduced at a high level.
The document defines various statistical measures and types of statistical analysis. It discusses descriptive statistical measures like mean, median, mode, and interquartile range. It also covers inferential statistical tests like the t-test, z-test, ANOVA, chi-square test, Wilcoxon signed rank test, Mann-Whitney U test, and Kruskal-Wallis test. It explains their purposes, assumptions, formulas, and examples of their applications in statistical analysis.
This document provides an overview of statistical tests of significance used to analyze data and determine whether observed differences could reasonably be due to chance. It defines key terms like population, sample, parameters, statistics, and hypotheses. It then describes several common tests including z-tests, t-tests, F-tests, chi-square tests, and ANOVA. For each test, it outlines the assumptions, calculation steps, and how to interpret the results to evaluate the null hypothesis. The goal of these tests is to determine if an observed difference is statistically significant or could reasonably be expected due to random chance alone.
1. Statistical analysis involves collecting, organizing, analyzing data, and drawing inferences about populations based on samples. It includes both descriptive and inferential statistics.
2. The document defines key terms used in statistical analysis like population, sample, statistical analysis, and discusses various statistical measures like mean, median, mode, interquartile range, and standard deviation.
3. The purposes of statistical analysis are outlined as measuring relationships, making predictions, testing hypotheses, and summarizing results. Both parametric and non-parametric statistical analyses are discussed.
Non parametric test- Muskan (M.Pharm-3rd semester)MuskanShingari
Nonparametric tests are an alternative to parametric tests like the T-test or ANOVA, which are only applicable if the data meets certain assumptions. Nonparametric tests are relatively easy to perform, but they can be difficult to use with large amounts of data.
Some examples of nonparametric tests include:
Mann-Whitney U-Test
Wilcoxon Matched Pairs Test
Sign Test
Chi-square Test
Kruskal–Wallis Test
Kolmogorov–Smirnov Test
Nonparametric tests are more statistically powerful than parametric tests when the data is not normally distributed. They are also less affected by outliers because the size of the maximum value does not affect the rank or sign.
This document discusses statistical methods for comparing means, including t-tests and analysis of variance (ANOVA). It explains how t-tests can be used to compare two means or paired samples, and how ANOVA can compare two or more means. Key assumptions and procedures are outlined for one-sample t-tests, paired t-tests, independent t-tests with equal and unequal variances, and one-way between-subjects ANOVAs.
This document discusses various statistical tests used to analyze dental research data, including parametric and non-parametric tests. It provides information on tests of significance such as the t-test, Z-test, analysis of variance (ANOVA), and non-parametric equivalents. Key points covered include the differences between parametric and non-parametric tests, assumptions and applications of the t-test, Z-test, ANOVA, and non-parametric alternatives like the Mann-Whitney U test and Kruskal-Wallis test. Examples are provided to illustrate how to perform and interpret common statistical analyses used in dental research.
The document discusses different types of t-tests used to compare means:
- One-sample t-test compares a sample mean to a predefined value.
- Paired (dependent) t-test compares means of two conditions with the same participants.
- Independent t-test compares means of two unrelated groups.
It explains how to choose the appropriate t-test based on research design, number of means being compared, and data distribution. Formulas are provided for calculating each t-test statistic. Examples are given to demonstrate applying the one-sample and paired t-tests.
This document provides an overview of common statistical tests used in dentistry research. It first describes descriptive statistics like measures of central tendency, dispersion, position, and outliers. It then discusses inferential statistics including parametric tests like t-tests and ANOVA that assume normal distributions, and non-parametric tests that make fewer assumptions. Specific parametric tests covered are the independent and paired t-tests and ANOVA. Non-parametric tests discussed include the chi-square, Wilcoxon, Mann-Whitney U, and Kruskal-Wallis tests. The document also briefly explains correlation/regression and measures of effect size like relative risk and odds ratios.
The document discusses measures of variability in statistics including range, interquartile range, standard deviation, and variance. It provides examples of calculating each measure using sample data sets. The range is the difference between the highest and lowest values, while the interquartile range is the difference between the third and first quartiles. The standard deviation represents the average amount of dispersion from the mean, and variance is the average of the squared deviations from the mean. Both standard deviation and variance increase with greater variability in the data set.
This document discusses different types of data and statistical tests used in orthodontics. It outlines categorical (qualitative) and numerical (quantitative) data, including nominal, ordinal, discrete, and continuous variables. Appropriate statistical tests are described for each data type, such as chi square tests for categorical data and t-tests or ANOVA for numerical data. Key concepts in data summarization are also covered, including measures of central tendency, variability, normal distribution, correlation, and hypothesis testing. The importance of selecting the right analysis method based on data type is emphasized.
I do not have enough information to determine what percentage of residents are asleep now versus at the beginning of this talk. As an AI assistant without direct observation of the audience, I do not have data on individual residents' states of alertness over time.
1. An independent samples t-test was conducted to determine if there were differences in anxiety scores between male and female participants before a major competition.
2. The results of the t-test showed no significant difference between the mean anxiety scores of males (M=17, SD=4.58) and females (M=18, SD=3.16), t(8)=0.41, p>0.05.
3. Therefore, the null hypothesis that there is no difference between male and female anxiety scores before a major competition was not rejected.
The document provides information about performing chi-square tests and choosing appropriate statistical tests. It discusses key concepts like the null hypothesis, degrees of freedom, and expected versus observed values. Examples are provided to illustrate chi-square tests for goodness of fit and comparison of proportions. The document also compares parametric and non-parametric tests, providing examples of when each would be used.
The document discusses several non-parametric tests that can be used as alternatives to parametric tests when the assumptions of parametric tests are violated. Specifically, it discusses:
1. The sign test and one sample median test, which can be used instead of t-tests when the data is skewed or not normally distributed.
2. Mood's median test, which compares the medians of two independent samples and is the nonparametric version of a one-way ANOVA.
3. The Kruskal-Wallis test, which determines if there are differences in medians across three or more groups and is the nonparametric version of a one-way ANOVA.
• Non parametric tests are distribution free methods, which do not rely on assumptions that the data are drawn from a given probability distribution. As such it is the opposite of parametric statistics
• In non- parametric tests we do not assume that a particular distribution is applicable or that a certain value is attached to a parameter of the population.
When to use non parametric test???
1) Sample distribution is unknown.
2) When the population distribution is abnormal
Non-parametric tests focus on order or ranking
1) Data is changed from scores to ranks or signs
2) A parametric test focuses on the mean difference, and equivalent non-parametric test focuses on the difference between medians.
1) Chi – square test
• First formulated by Helmert and then it was developed by Karl Pearson
• It is both parametric and non-parametric test but more of non - parametric test.
• The test involves calculation of a quantity called Chi square.
• Follows specific distribution known as Chi square distribution
• It is used to test the significance of difference between 2 proportions and can be used when there are more than 2 groups to be compared.
Applications
1) Test of proportion
2) Test of association
3) Test of goodness of fit
Criteria for applying Chi- square test
• Groups: More than 2 independent
• Data: Qualitative
• Sample size: Small or Large, random sample
• Distribution: Non-Normal (Distribution free)
• Lowest expected frequency in any cell should be greater than 5
• No group should contain less than 10 items
Example: If there are two groups, one of which has received oral hygiene instructions and the other has not received any instructions and if it is desired to test if the occurrence of new cavities is associated with the instructions.
2) Fischer Exact Test
• Used when one or more of the expected counts in a 2×2 table is small.
• Used to calculate the exact probability of finding the observed numbers by using the fischer exact probability test.
3) Mc Nemar Test
• Used to compare before and after findings in the same individual or to compare findings in a matched analysis (for dichotomous variables).
Example: comparing the attitudes of medical students toward confidence in statistics analysis before and after the intensive statistics course.
4) Sign Test
• Sign test is used to find out the statistical significance of differences in matched pair comparisons.
• Its based on + or – signs of observations in a sample and not on their numerical magnitudes.
• For each subject, subtract the 2nd score from the 1st, and write down the sign of the difference.
It can be used
a. in place of a one-sample t-test
b. in place of a paired t-test or
c. for ordered categorial data where a numerical scale is inappropriate but where it is possible to rank the observations.
5) Wilcoxon signed rank test
• Analogous to paired ‘t’ test
6) Mann Whitney Test
• similar to the student’s t test
7) Spearman’s rank correlation - similar to pearson's correlation.
Keeping it real: A comprehensive and transparent evaluation of electronic res...University of North Texas
Presentation for pre-conference workshop at the Charleston Conference, 2014. There will be a time when your library will need to evaluate all of your electronic resources. How would you do it? In response to a cut to our materials budget, we have developed a method that condenses a large amount of information into a few select criteria. In this day-long workshop, we will walk through the process using the Decision Grid process developed by at the University of Maryland at College Park (Foudy and McManus 533-538) as a starting point. The workshop leaders will first demonstrate each step of our process, and then the participants will work in small groups (5-7) using their own experiences and a sample data set of their own. The steps covered will include selecting and defining the criteria, gathering and analyzing the data, and determining how to make final decisions. We will cover some technical aspects of gathering and analyzing data, including using Excel functions.
Planning for the budget-ocalypse: The evolution of a serials/ER cancellation ...University of North Texas
The University of North Texas Libraries are funded almost entirely by undergraduate student use fees. As the undergraduate enrollment has plateaued in recent years, the libraries' have not been able to keep up with rising costs, resulting in a series of cuts to the materials budget totaling nearly $4 million. While some of these cuts took the form of reductions in firm orders and dissolution of approval plans, for the past three years the bulk have come from cancellations of serials and electronic resources. With each year's cuts, the UNT Collection Development department has been forced to modify and refine their deselection process. This presentation will show the development of UNT's strategy for determining cancellations using a variety of methods (overlap analysis, usage statistics, faculty input) and tools (EBSCO Usage Consolidation, Serials Solutions 360).
The quick & the dirty: The good, the bad & the ugly of database overlap at th...University of North Texas
Given the decreasing budgets for collections at many libraries, librarians are looking at abstracting and indexing (A&I) resources and full-text databases with a more skeptic eye. In addition to traditional evaluation measures, such as costs, usage and faculty input, we looked at the overlap of indexing and/or full-text coverage. Those who have conducted such overlap studies have approached it at either the journal or article level. Article-level overlap studies demonstrate coverage of selected articles in the databases under study. Conversely, journal-level studies examine the extent of indexing of journals among the selected databases. Both methods are very time-consuming and require extensive resources. A simplification of the journal-level method is to compare lists of journals indexed. Two tools, Cufts and JISC's Academic Database Assessment Tool (ADAT) are very useful for this purpose, but do not include all databases. Many databases need to be manually collected. This presentation will describe the background to such a project, the specific tools and procedures used, how the results were used to address budget reductions, and the limitations of the results. Members of the audience will be able to consider using this method for evaluating their own abstract and index databases for budgetary purposes.
The second in a series of four seminars presented to University of North Texas librarians. This presentation focuses on organizing and presenting basic descriptive statistics, including measures of central tendency and variation.
Final session in a series of four seminars presented to University of North Texas librarians. This presentation brings together some best practices for gathering, organizing, analyzing, and presenting statistics and data.
Statistics for Librarians, Session 1: What is statistics & Why is it important?University of North Texas
This document provides an overview of key concepts in statistics including:
- The goals of statistics which are to make sense of data, explain what happens, make sound decisions, and determine how close estimates are to the truth.
- Key terms like variables, scales of measurement, validity of measures, sampling, bias, and statistical analysis techniques.
- The importance of valid and reliable data collection and analysis to produce valid results and insights.
Lagos School of Programming Final Project Updated.pdfbenuju2016
A PowerPoint presentation for a project made using MySQL, Music stores are all over the world and music is generally accepted globally, so on this project the goal was to analyze for any errors and challenges the music stores might be facing globally and how to correct them while also giving quality information on how the music stores perform in different areas and parts of the world.
vMix Pro Crack + Serial Number Torrent free Downloadeyeskye547
vMix is a comprehensive live 2025 video production and streaming software designed for Windows PCs, catering to a wide range of users from hobbyists to professional broadcasters. It enables users to create, mix, switch, record, and stream high-quality live productions with ease.
⬇️⬇️COPY & PASTE IN BROWSER TO DOWNLOAD⬇️⬇️
https://precrackfiles.com/download-setup/
Decision Trees in Artificial-Intelligence.pdfSaikat Basu
Have you heard of something called 'Decision Tree'? It's a simple concept which you can use in life to make decisions. Believe you me, AI also uses it.
Let's find out how it works in this short presentation. #AI #Decisionmaking #Decisions #Artificialintelligence #Data #Analysis
https://saikatbasu.me
保密服务圣地亚哥州立大学英文毕业证书影本美国成绩单圣地亚哥州立大学文凭【q微1954292140】办理圣地亚哥州立大学学位证(SDSU毕业证书)毕业证书购买【q微1954292140】帮您解决在美国圣地亚哥州立大学未毕业难题(San Diego State University)文凭购买、毕业证购买、大学文凭购买、大学毕业证购买、买文凭、日韩文凭、英国大学文凭、美国大学文凭、澳洲大学文凭、加拿大大学文凭(q微1954292140)新加坡大学文凭、新西兰大学文凭、爱尔兰文凭、西班牙文凭、德国文凭、教育部认证,买毕业证,毕业证购买,买大学文凭,购买日韩毕业证、英国大学毕业证、美国大学毕业证、澳洲大学毕业证、加拿大大学毕业证(q微1954292140)新加坡大学毕业证、新西兰大学毕业证、爱尔兰毕业证、西班牙毕业证、德国毕业证,回国证明,留信网认证,留信认证办理,学历认证。从而完成就业。圣地亚哥州立大学毕业证办理,圣地亚哥州立大学文凭办理,圣地亚哥州立大学成绩单办理和真实留信认证、留服认证、圣地亚哥州立大学学历认证。学院文凭定制,圣地亚哥州立大学原版文凭补办,扫描件文凭定做,100%文凭复刻。
特殊原因导致无法毕业,也可以联系我们帮您办理相关材料:
1:在圣地亚哥州立大学挂科了,不想读了,成绩不理想怎么办???
2:打算回国了,找工作的时候,需要提供认证《SDSU成绩单购买办理圣地亚哥州立大学毕业证书范本》【Q/WeChat:1954292140】Buy San Diego State University Diploma《正式成绩单论文没过》有文凭却得不到认证。又该怎么办???美国毕业证购买,美国文凭购买,【q微1954292140】美国文凭购买,美国文凭定制,美国文凭补办。专业在线定制美国大学文凭,定做美国本科文凭,【q微1954292140】复制美国San Diego State University completion letter。在线快速补办美国本科毕业证、硕士文凭证书,购买美国学位证、圣地亚哥州立大学Offer,美国大学文凭在线购买。
美国文凭圣地亚哥州立大学成绩单,SDSU毕业证【q微1954292140】办理美国圣地亚哥州立大学毕业证(SDSU毕业证书)【q微1954292140】录取通知书offer在线制作圣地亚哥州立大学offer/学位证毕业证书样本、留信官方学历认证(永久存档真实可查)采用学校原版纸张、特殊工艺完全按照原版一比一制作。帮你解决圣地亚哥州立大学学历学位认证难题。
主营项目:
1、真实教育部国外学历学位认证《美国毕业文凭证书快速办理圣地亚哥州立大学办留服认证》【q微1954292140】《论文没过圣地亚哥州立大学正式成绩单》,教育部存档,教育部留服网站100%可查.
2、办理SDSU毕业证,改成绩单《SDSU毕业证明办理圣地亚哥州立大学成绩单购买》【Q/WeChat:1954292140】Buy San Diego State University Certificates《正式成绩单论文没过》,圣地亚哥州立大学Offer、在读证明、学生卡、信封、证明信等全套材料,从防伪到印刷,从水印到钢印烫金,高精仿度跟学校原版100%相同.
3、真实使馆认证(即留学人员回国证明),使馆存档可通过大使馆查询确认.
4、留信网认证,国家专业人才认证中心颁发入库证书,留信网存档可查.
《圣地亚哥州立大学学位证书的英文美国毕业证书办理SDSU办理学历认证书》【q微1954292140】学位证1:1完美还原海外各大学毕业材料上的工艺:水印,阴影底纹,钢印LOGO烫金烫银,LOGO烫金烫银复合重叠。文字图案浮雕、激光镭射、紫外荧光、温感、复印防伪等防伪工艺。
高仿真还原美国文凭证书和外壳,定制美国圣地亚哥州立大学成绩单和信封。毕业证网上可查学历信息SDSU毕业证【q微1954292140】办理美国圣地亚哥州立大学毕业证(SDSU毕业证书)【q微1954292140】学历认证生成授权声明圣地亚哥州立大学offer/学位证文凭购买、留信官方学历认证(永久存档真实可查)采用学校原版纸张、特殊工艺完全按照原版一比一制作。帮你解决圣地亚哥州立大学学历学位认证难题。
圣地亚哥州立大学offer/学位证、留信官方学历认证(永久存档真实可查)采用学校原版纸张、特殊工艺完全按照原版一比一制作【q微1954292140】Buy San Diego State University Diploma购买美国毕业证,购买英国毕业证,购买澳洲毕业证,购买加拿大毕业证,以及德国毕业证,购买法国毕业证(q微1954292140)购买荷兰毕业证、购买瑞士毕业证、购买日本毕业证、购买韩国毕业证、购买新西兰毕业证、购买新加坡毕业证、购买西班牙毕业证、购买马来西亚毕业证等。包括了本科毕业证,硕士毕业证。
保密服务多伦多都会大学英文毕业证书影本加拿大成绩单多伦多都会大学文凭【q微1954292140】办理多伦多都会大学学位证(TMU毕业证书)成绩单VOID底纹防伪【q微1954292140】帮您解决在加拿大多伦多都会大学未毕业难题(Toronto Metropolitan University)文凭购买、毕业证购买、大学文凭购买、大学毕业证购买、买文凭、日韩文凭、英国大学文凭、美国大学文凭、澳洲大学文凭、加拿大大学文凭(q微1954292140)新加坡大学文凭、新西兰大学文凭、爱尔兰文凭、西班牙文凭、德国文凭、教育部认证,买毕业证,毕业证购买,买大学文凭,购买日韩毕业证、英国大学毕业证、美国大学毕业证、澳洲大学毕业证、加拿大大学毕业证(q微1954292140)新加坡大学毕业证、新西兰大学毕业证、爱尔兰毕业证、西班牙毕业证、德国毕业证,回国证明,留信网认证,留信认证办理,学历认证。从而完成就业。多伦多都会大学毕业证办理,多伦多都会大学文凭办理,多伦多都会大学成绩单办理和真实留信认证、留服认证、多伦多都会大学学历认证。学院文凭定制,多伦多都会大学原版文凭补办,扫描件文凭定做,100%文凭复刻。
特殊原因导致无法毕业,也可以联系我们帮您办理相关材料:
1:在多伦多都会大学挂科了,不想读了,成绩不理想怎么办???
2:打算回国了,找工作的时候,需要提供认证《TMU成绩单购买办理多伦多都会大学毕业证书范本》【Q/WeChat:1954292140】Buy Toronto Metropolitan University Diploma《正式成绩单论文没过》有文凭却得不到认证。又该怎么办???加拿大毕业证购买,加拿大文凭购买,【q微1954292140】加拿大文凭购买,加拿大文凭定制,加拿大文凭补办。专业在线定制加拿大大学文凭,定做加拿大本科文凭,【q微1954292140】复制加拿大Toronto Metropolitan University completion letter。在线快速补办加拿大本科毕业证、硕士文凭证书,购买加拿大学位证、多伦多都会大学Offer,加拿大大学文凭在线购买。
加拿大文凭多伦多都会大学成绩单,TMU毕业证【q微1954292140】办理加拿大多伦多都会大学毕业证(TMU毕业证书)【q微1954292140】学位证书电子图在线定制服务多伦多都会大学offer/学位证offer办理、留信官方学历认证(永久存档真实可查)采用学校原版纸张、特殊工艺完全按照原版一比一制作。帮你解决多伦多都会大学学历学位认证难题。
主营项目:
1、真实教育部国外学历学位认证《加拿大毕业文凭证书快速办理多伦多都会大学毕业证书不见了怎么办》【q微1954292140】《论文没过多伦多都会大学正式成绩单》,教育部存档,教育部留服网站100%可查.
2、办理TMU毕业证,改成绩单《TMU毕业证明办理多伦多都会大学学历认证定制》【Q/WeChat:1954292140】Buy Toronto Metropolitan University Certificates《正式成绩单论文没过》,多伦多都会大学Offer、在读证明、学生卡、信封、证明信等全套材料,从防伪到印刷,从水印到钢印烫金,高精仿度跟学校原版100%相同.
3、真实使馆认证(即留学人员回国证明),使馆存档可通过大使馆查询确认.
4、留信网认证,国家专业人才认证中心颁发入库证书,留信网存档可查.
《多伦多都会大学学位证购买加拿大毕业证书办理TMU假学历认证》【q微1954292140】学位证1:1完美还原海外各大学毕业材料上的工艺:水印,阴影底纹,钢印LOGO烫金烫银,LOGO烫金烫银复合重叠。文字图案浮雕、激光镭射、紫外荧光、温感、复印防伪等防伪工艺。
高仿真还原加拿大文凭证书和外壳,定制加拿大多伦多都会大学成绩单和信封。学历认证证书电子版TMU毕业证【q微1954292140】办理加拿大多伦多都会大学毕业证(TMU毕业证书)【q微1954292140】毕业证书样本多伦多都会大学offer/学位证学历本科证书、留信官方学历认证(永久存档真实可查)采用学校原版纸张、特殊工艺完全按照原版一比一制作。帮你解决多伦多都会大学学历学位认证难题。
多伦多都会大学offer/学位证、留信官方学历认证(永久存档真实可查)采用学校原版纸张、特殊工艺完全按照原版一比一制作【q微1954292140】Buy Toronto Metropolitan University Diploma购买美国毕业证,购买英国毕业证,购买澳洲毕业证,购买加拿大毕业证,以及德国毕业证,购买法国毕业证(q微1954292140)购买荷兰毕业证、购买瑞士毕业证、购买日本毕业证、购买韩国毕业证、购买新西兰毕业证、购买新加坡毕业证、购买西班牙毕业证、购买马来西亚毕业证等。包括了本科毕业证,硕士毕业证。
How to regulate and control your it-outsourcing provider with process miningProcess mining Evangelist
Oliver Wildenstein is an IT process manager at MLP. As in many other IT departments, he works together with external companies who perform supporting IT processes for his organization. With process mining he found a way to monitor these outsourcing providers.
Rather than having to believe the self-reports from the provider, process mining gives him a controlling mechanism for the outsourced process. Because such analyses are usually not foreseen in the initial outsourcing contract, companies often have to pay extra to get access to the data for their own process.
Frank van Geffen is a Process Innovator at the Rabobank. He realized that it took a lot of different disciplines and skills working together to achieve what they have achieved. It's not only about knowing what process mining is and how to operate the process mining tool. Instead, a lot of emphasis needs to be placed on the management of stakeholders and on presenting insights in a meaningful way for them.
The results speak for themselves: In their IT service desk improvement project, they could already save 50,000 steps by reducing rework and preventing incidents from being raised. In another project, business expense claim turnaround time has been reduced from 11 days to 1.2 days. They could also analyze their cross-channel mortgage customer journey process.
Tijn van der Heijden is a business analyst with Deloitte. He learned about process mining during his studies in a BPM course at Eindhoven University of Technology and became fascinated with the fact that it was possible to get a process model and so much performance information out of automatically logged events of an information system.
Tijn successfully introduced process mining as a new standard to achieve continuous improvement for the Rabobank during his Master project. At his work at Deloitte, Tijn has now successfully been using this framework in client projects.
3. SESSION OBJECTIVES
Purpose of Inferential Statistics
Probability
Elements of Significance Testing
Three key tests
• T-test
• Chi-squared
• Correlation (or binomial)
Effect Measures
4. PURPOSE OF INFERENTIAL STATISTICS
• Infer results
• Draw conclusions
• Increase the
Signal-Noise ratio
Signal
Noise
5. INFERENTIAL STATISTICS
Tests of hypotheses
• Expectations
• Associations
Accounts for uncertainty
• Random error
• Confidence interval
10. LEVELS OF MEASUREMENT (NOIR)
Nominal
•Counts by category
•Binary (Yes/No)
•No meaning
between the
categories (Blue is
not better than
Red)
Ordinal
•Ranks
•Scales
•Space between
ranks is subjective
Interval
•Integers
•Zero is just another
value – doesn’t
mean “absence
of”
•Space between
values is equal and
objective, but
discrete
Ratio
•Interval data with a
baseline
•Zero (0) means
“absence of”
•Space between is
continuous
•Includes simple
counts
22. Set the mean to 0
Standard Deviations above and
below the mean
23. DEMONSTRATION OF DISTRIBUTIONS
Distribution of the
Population
The “Truth”
N is the # of samples
n is the number of
items in each sample
Watch the cumulative mean & medians
slowly merge to the population
25. CASE STUDY
• Background: Info-Lit course is meeting
resistance from skeptical faculty.
• Research Questions:
• Does the IL course improve grades on final
papers?
• Can the IL course improve passing rates for the
course?
• Do students in different majors respond differently
to the IL training?
• Is the final score related to the number of credit
hours enrolled for each student?
26. METHODOLOGY
Selection
• Two sections of same course
with different instructors.
• Random Assignment
Outcome
• Blinded scoring by 2 TAs
• Scores range from 1-100
• Passing grade: 70
29. DISTRIBUTION OF SCORES
Table 1
•Distribution of
scores
Table 2
•Distribution of
passing rates
by broad
field of major
Table 3
•Correlation of
scores &
credit hours
31. SIGNIFICANCE TESTING
• Groups against each other
• A group against the population or
standard
Comparing
significance of
differences
• Risk of being wrong
• Alpha (α)
• Set in advance
What is
“significant”?
• The value that the statistic must meet
or exceed to be statistically significant.
• Based on statistic and α
Critical Value
32. STEPS IN SIGNIFICANCE TESTING
Which
Test?
Calculate
Statistic
Critical
Value of
Statistic?
Probability
(p-value)
33. KEY ELEMENTS OF SIGNIFICANCE
TESTING
Null Hypothesis
Measure of Central Tendency
Standard deviations
Risk of being wrong (alpha)
• Usually .05 or .025 or .01 or .001
Degrees of freedom (df)
35. DEGREES OF FREEDOM EXPLAINED
• All these have a
mean of 5:
• 5, 5, 5
• 2, 8, 5
• 3, 2, 10
• 7, 4, & ?
• If 2 values are
known and the
mean is known,
then the 3rd value is
also known.
• Only 2 of the 3
values are free to
vary.
36. CALCULATING DEGREES OF FREEDOM
(DF)
For a single sample:
• Degrees of freedom (df) for t-test = n-1
For more than one group:
• df=∑(n-1) for all groups (k)
• OR, ∑ n-k
For comparing proportions in categories (k):
• df= ∑k-1 (# of categories minus 1)
38. T-TEST
Used with interval or ratio data
Based on normal distribution
Four Decisions
• Paired or un-paired samples?
• Equal or unequal variances (standard deviations)?
• Risk?
• One- or two-tail?
• Direction of expected difference
• Best to bet on difference in both directions (2-tail)
40. T-TEST FORMULA FOR UNPAIRED
SAMPLES
𝑡 =
𝑥1 − 𝑥2
𝑆 𝑥1−𝑥2
Signal
Noise
Difference Between Group Means
𝑉𝑎𝑟𝑖𝑎𝑏𝑖𝑙𝑖𝑡𝑦 𝑜𝑓 𝐺𝑟𝑜𝑢𝑝𝑠
41. ELEMENTS OF T-TEST USING EXCEL
DATA ANALYSIS TOOLPAK
•UnpairedPaired or Unpaired
samples?
•Equal*Equal or Unequal
Variances?
•Data for intervention group
•Data for control group
Data
•0Hypothesized
difference
•0.025 (for a 2-tail test)
Alpha
46. PEARSON’S CHI-SQUARED (Χ2)
GOODNESS OF FIT TEST
Does an observed frequency distribution
differ from an expected distribution
• Observed is the sample or the intervention.
• Expected is the population or the control or a
theoretical distribution.
• Will depend on your Null Hypothesis
Nominal or categorical data
• Counts by category
47. EXPECTED RATIOS FOR CASE STUDY
Research Question:
•Do students in different majors
respond differently to the IL training?
Null Hypothesis
•The ratio of students who passed will
be the same for all majors.
48. WHEN TO USE PEARSON’S CHI-
SQUARED GOODNESS OF FIT TEST
Nominal Data
Sample Size
• Not too large:
• Sample is at most 1/10th of population
• Not too small:
• At least five in each of the categories
for the expected group.
49. OBSERVED PASSING RATES BY MAJOR
Major Passed Not Passed Grand Total
Arts 6 7 13
Humanities 8 5 13
Social
Sciences 17 10 27
STEM 20 5 25
Undeclared 16 7 23
Total 67 34 101
50. EXPECTED RATIOS OF PASSING RATES
BY MAJOR
• H0: Rates of passing will be the same for all majors.
• Expected rates: 70% of class passes.
• Expected ratios: 70% of each major passes.
Major Passed Not Passed Grand Total
Arts 11.2 (16*.7) 4.8 16
Humanities 11.2 (16*.7) 4.8 16
Social
Sciences 18.2 (26*.7) 7.8 26
STEM 16.1 (23*.7) 6.9 23
Undeclared 14 (20*.7) 6 20
51. CHI-SQUARED GOF TEST FORMULA
•χ2
=
𝑂𝑏𝑠𝑒𝑟𝑣𝑒𝑑−𝐸𝑥𝑝𝑒𝑐𝑡𝑒𝑑 2
𝐸𝑥𝑝𝑒𝑐𝑡𝑒𝑑
• Critical value of Chi-squared depends
on degrees of freedom.
• Degrees of freedom
• Based on the number of categories or
table cells (k)
• df=k-1
52. CHI-SQUARED IN EXCEL
What is Null
Hypothesis?
There is no difference between the majors regarding
passing rates.
What is your
alpha (risk)?
0.05
Data in a
summary
tables?
Actual Ratios
Expected Ratios
Excel
function:
=CHISQ.TEST(actual range1,expected range2)
Provides a
p-value
0.0000172
Is p-value
<= alpha?
Yes
54. STATISTICAL CORRELATION
Quantitative value of relationship of 2 variables
•-1 represents a perfect indirect correlation
•0 represents no correlation
•+1 represents a perfect direct correlation
Expressed in range of -1 to +1
•How much two variables change together
Based on co-variance
55. PEARSON’S PRODUCT MOMENT
CORRELATION COEFFICIENT
Most commonly used
statistic
Normally distributed
interval or ratio data only
Labeled as r
Multiplication =
Interaction
Signal
Noise
𝑟𝑥𝑦 =
𝑥 − 𝑥 𝑦 − 𝑦
𝑛 − 1 𝑠 𝑥 𝑠 𝑦
56. CORRELATION IN EXCEL
•No correlation
Null Hypothesis?
•=PEARSON(range1,range2)
Coefficient function (r):
Does NOT have a single function to test for
significance
Calculate Probability:
n # in sample 101
df # in sample - 2 99
alpha 0.025 for 2-Tail Test 0.025
r =PEARSON(range1,range2) 0.362287
t =r*SQRT(alpha)/SQRT(1-r^2) 3.867434
p =T.DIST.2T(t,df) 0.000197
57. CORRELATIONS FOR ORDINAL DATA
Spearman’s ϱ (rho)
•Use if there are limited ties in
rank.
Kendall’s τ (tau)
•Use if you have a number of ties.
60. FACTORS ASSOCIATED WITH CHOICE
OF STATISTICAL METHOD
Level of
Measurement
What is being
compared
Independence
of units
Underlying
variance in the
population
Distribution Sample size
Number of
comparison
groups
65. EFFECT SIZES OF QUANTITATIVE DATA
Differences from
the mean
• Standardized
• weighted against the
pooled (average)
standard deviation
• Cohen’s d
Correlations
• Cohen’s guidelines for
Pearson’s r
• r = 0.362
Effect Size r>
Small .10
Medium .30
Large .50
𝑑 =
𝑥1 − 𝑥2
𝑠 𝑥1,𝑥2
𝑑 =
79.47 − 69.56
11.8036
𝑑 = 0.8392
66. EFFECT SIZES OF QUALITATIVE DATA
Based on
Contingency
table
• Uses probabilities
• 𝑅𝑅 =
𝑎 𝑎+𝑏
𝑐 𝑐+𝑑
Relative risk
• 𝑅𝑅 =
41∗65
10∗36
•RR = 1.608
•The passing rate for the intervention
group was 1.6 times the passing rate for
control group.
RR of Case
Study
Pass No Pass Total
Intervention a (41) b (24) a+b (65)
Control c (26) d (10) c+d (36)
Totals a+c (67) b+d (34) a+b+c+d (101)
67. CONFIDENCE INTERVALS
Point estimates
Intervals
Based on
Expressed as:
•Single value
•Mean
•Degree of uncertainty
•Range of certainty around the
point estimate
•Point estimate (e.g. mean)
•Confidence level (usually .95)
•Standard deviation
•The mean score of the students
who had the IL training was 79.5
with a 95% CI of 76.4 and 82.5.
68. CASE STUDY CONCLUSIONS
• Research Questions:
• Could the IL course improve grades on final
papers?
• Could the IL course improve passing rates for the
course?
• Do students in different majors respond differently
to the IL training?
• Is the final score related to the number of credit
hours enrolled for each student?
• Control for external variables