SlideShare a Scribd company logo
Basic Statistics
AIJAZ SOHAG
MSc (Env:Sc),M.A.S(H.S.A.),MBA(Health
Mgt),MPH,PhD
Preface
• The purpose of this presentation is to help you
determine which statistical tests are appropriate
for analyzing your data for research project.
• Statistical tests that are presented here focuses
on the most common techniques.
Outline
• Descriptive Statistics
– Frequencies & percentages
– Means & standard deviations
• Inferential Statistics
– Correlation
– T-tests
– Chi-square
– Logistic Regression
Types of Statistics/Analyses
Descriptive Statistics
– Frequencies
– Basic measurements
Inferential Statistics
– Hypothesis Testing
– Correlation
– Confidence Intervals
– Significance Testing
– Prediction
Describing a phenomena
How many? How much?
BP, HR, BMI, IQ, etc.
Inferences about a phenomena
Proving or disproving theories
Associations between phenomena
If sample relates to the larger
population
E.g., Diet and health
Descriptive Statistics
Descriptive statistics can be used to summarize
and describe a single variable
• Frequencies (counts) & Percentages
– Use with categorical (nominal) data
• Levels, types, groupings, yes/no, Drug A vs. Drug B
• Means & Standard Deviations
– Use with continuous data
• Height, weight, cholesterol, scores on a test
Frequencies & Percentages
Look at the different ways we can display frequencies and
percentages for this data:
Table
Bar chart
Pie chart
Good if more
than 20
observations
AKA frequency
distributions –
good if more
than 20
observations
Continuous  Categorical
It is possible to take
continuous data
(such as hemoglobin
levels) and turn it
into categorical data
by grouping values
together. Then we
can calculate
frequencies and
percentages for each
group.
Continuous  Categorical
Distribution of
Glasgow Coma
Scale Scores
Even though
this is
continuous
data, it is
being treated
as “nominal”
as it is broken
down into
groups or
categoriesTip: It is usually better to collect continuous data and then break it
down into categories for data analysis as opposed to collecting data
that fits into preconceived categories.
Ordinal Level Data
Frequencies and percentages can be computed
for ordinal data
– Examples: Likert Scales (Strongly Disagree to Strongly
Agree); High School/Some College/College
Graduate/Graduate School
0
10
20
30
40
50
60
Strongly
Agree
Agree Disagree Strongly
Disagree
INFERENTIAL STATISTICS
Inferential statistics can be used to prove or
disprove theories, determine associations between
variables, and determine if findings are significant
and whether or not we can generalize from our
sample to the entire population
The types of inferential statistics we will go over:
• Correlation
• T-tests/ANOVA
• Chi-square
• Logistic Regression
Correlation
• When to use it?
– When you want to know about the association or relationship
between two continuous variables
• Ex) food intake and weight; drug dosage and blood pressure; air temperature and
metabolic rate, etc.
• What does it tell you?
– If a linear relationship exists between two variables, and how strong that
relationship is
• What do the results look like?
– The correlation coefficient = Pearson’s r
– Ranges from -1 to +1
– See next slide for examples of correlation results
Correlation
Guide for interpreting
strength of correlations:
 0 – 0.25 = Little or no
relationship
 0.25 – 0.50 = Fair degree of
relationship
 0.50 - 0.75 = Moderate
degree of relationship
 0.75 – 1.0 = Strong
relationship
 1.0 = perfect correlation
Correlation
• How do you interpret it?
– If r is positive, high values of one variable are associated with high values
of the other variable (both go in SAME direction - ↑↑ OR ↓↓)
• Ex) Diastolic blood pressure tends to rise with age, thus the two variables are
positively correlated
– If r is negative, low values of one variable are associated with high values
of the other variable (opposite direction - ↑↓ OR ↓ ↑)
• Ex) Heart rate tends to be lower in persons who exercise
frequently, the two variables correlate negatively
– Correlation of 0 indicates NO linear relationship
• How do you report it?
– “Diastolic blood pressure was positively correlated with age (r = .75, p < . 05).”
Tip: Correlation does NOT equal causation!!! Just because two variables are highly correlated, this
does NOT mean that one CAUSES the other!!!
T-tests
• When to use them?
– Paired t-tests: When comparing the MEANS of a continuous variable in
two non-independent samples (i.e., measurements on the same people
before and after a treatment)
• Ex) Is diet X effective in lowering serum cholesterol levels in a sample of 12
people?
• Ex) Do patients who receive drug X have lower blood pressure after
treatment then they did before treatment?
– Independent samples t-tests: To compare the MEANS of a
continuous variable in TWO independent samples (i.e., two different
groups of people)
• Ex) Do people with diabetes have the same Systolic Blood Pressure as
people without diabetes?
• Ex) Do patients who receive a new drug treatment have lower blood
pressure than those who receive a placebo?
Tip: if you have > 2 different groups, you use ANOVA, which compares the means of 3 or more groups
T-tests
• What does a t-test tell you?
– If there is a statistically significant difference between
the mean score (or value) of two groups (either the
same group of people before and after or two
different groups of people)
• What do the results look like?
– Student’s t
• How do you interpret it?
– By looking at corresponding p-value
• If p < .05, means are significantly different from each other
• If p > 0.05, means are not significantly different from each
other
How do you report t-tests results?
“As can be seen in Figure 1, specialty candidates had significantly
higher scores on questions dealing with treatment than residency
candidates (t = [insert t-value from stats output], p < .001).
“As can be seen in Figure 1, children’s mean reading
performance was significantly higher on the post-tests in
all four grades, ( t = [insert from stats output], p < .05)”
Chi-square
• When to use it?
– When you want to know if there is an association between two
categorical (nominal) variables (i.e., between an exposure and
outcome)
• Ex) Smoking (yes/no) and lung cancer (yes/no)
• Ex) Obesity (yes/no) and diabetes (yes/no)
• What does a chi-square test tell you?
– If the observed frequencies of occurrence in each group are
significantly different from expected frequencies (i.e., a
difference of proportions)
Chi-square
• What do the results look like?
– Chi-square test statistics = X2
• How do you interpret it?
– Usually, the higher the chi-square statistic, the
greater likelihood the finding is significant, but you
must look at the corresponding p-value to
determine significance
Tip: Chi square requires that there be 5 or more in each cell of a 2x2 table and 5 or more in 80% of
cells in larger tables. No cells can have a zero count.
How do you report chi-square?
“Distribution of obesity by gender showed
that 171 (38.9%) and 75 (17%) of women
were overweight and obese (Type I
&II), respectively. Whilst 118 (37.3%) and 12
(3.8%) of men were overweight and obese
(Type I & II), respectively (Table-II).
The Chi square test shows that these
differences are statistically significant
(p<0.001).”
“248 (56.4%) of women and 52
(16.6%) of men had abdominal
obesity (Fig-2). The Chi square
test shows that these differences
are statistically significant
(p<0.001).”
Logistic Regression
• When to use it?
– When you want to measure the strength and direction of
the association between two variables, where the
dependent or outcome variable is categorical (e.g., yes/no)
– When you want to predict the likelihood of an outcome
while controlling for confounders
• Ex) examine the relationship between health behavior
(smoking, exercise, low-fat diet) and arthritis (arthritis vs. no
arthritis)
• Ex) Predict the probability of stroke in relation to gender while
controlling for age or hypertension
• What does it tell you?
– The odds of an event occurring The probability of the
outcome event occurring divided by the probability of it
not occurring
Summary of Statistical Tests
Statistic Test Type of Data Needed Test Statistic Example
Correlation Two continuous
variables
Pearson’s r Are blood pressure and
weight correlated?
T-tests/ANOVA Means from a
continuous variable
taken from two or
more groups
Student’s t Do normal weight (group 1)
patients have lower blood
pressure than obese
patients (group 2)?
Chi-square Two categorical
variables
Chi-square X2 Are obese individuals
(obese vs. not obese)
significantly more likely to
have a stroke (stroke vs. no
stroke)?
Summary
• Descriptive statistics can be used with nominal and ordinal data
• Frequencies and percentages describe categorical data and
means and standard deviations describe continuous variables
• Inferential statistics can be used to determine associations
between variables and predict the likelihood of outcomes or
events
• Inferential statistics tell us if our findings are significant
Next Steps
• Think about the data that you have collected
or will collect as part of your research project
– What is your research question?
– What are you trying to get your data to “say”?
– Which statistical tests will best help you answer
your research question?
– Contact the bio-statistician research coordinator
to discuss how to analyze your data!

More Related Content

What's hot (20)

PPTX
Introduction to medical statistics
Mohamed Alhelaly
 
PDF
Mantel Haenszel methods in epidemiology (Stratification)
Rizwan S A
 
PDF
ANOVA test and correlation
Dr Lipilekha Patnaik
 
PPTX
Medical Statistics Pt 2
Fastbleep
 
PPTX
P-values the gold measure of statistical validity are not as reliable as many...
David Pratap
 
PPTX
Parmetric and non parametric statistical test in clinical trails
Vinod Pagidipalli
 
PPTX
Commonly used statistical tests in research
Naqeeb Ullah Khan
 
PDF
Statistical methods for the life sciences lb
priyaupm
 
PPTX
Commonly Used Statistics in Medical Research Part I
Pat Barlow
 
PDF
Non parametrict test
dobhalshiv
 
PPTX
Correlation Studies - Descriptive Studies
SalmaAsghar4
 
PPT
Chosing the appropriate_statistical_test
BRAJESH KUMAR PARASHAR
 
PPTX
Stratification and Mantel-Haenszel estimation
Vignesh Loganathan
 
PPTX
Statistic in research
Dalia El-Shafei
 
PPTX
Non parametric tests
Raghavendra Huchchannavar
 
PPTX
4.3.2. controlling confounding stratification
A M
 
PPT
Lesson 6 Nonparametric Test 2009 Ta
Sumit Prajapati
 
PPTX
Vergoulas Choosing the appropriate statistical test (2019 Hippokratia journal)
Vaggelis Vergoulas
 
PPT
Medical statistics2
Amany El-seoud
 
PPTX
Analysis 101
Brian Wells, MD, MS, MPH
 
Introduction to medical statistics
Mohamed Alhelaly
 
Mantel Haenszel methods in epidemiology (Stratification)
Rizwan S A
 
ANOVA test and correlation
Dr Lipilekha Patnaik
 
Medical Statistics Pt 2
Fastbleep
 
P-values the gold measure of statistical validity are not as reliable as many...
David Pratap
 
Parmetric and non parametric statistical test in clinical trails
Vinod Pagidipalli
 
Commonly used statistical tests in research
Naqeeb Ullah Khan
 
Statistical methods for the life sciences lb
priyaupm
 
Commonly Used Statistics in Medical Research Part I
Pat Barlow
 
Non parametrict test
dobhalshiv
 
Correlation Studies - Descriptive Studies
SalmaAsghar4
 
Chosing the appropriate_statistical_test
BRAJESH KUMAR PARASHAR
 
Stratification and Mantel-Haenszel estimation
Vignesh Loganathan
 
Statistic in research
Dalia El-Shafei
 
Non parametric tests
Raghavendra Huchchannavar
 
4.3.2. controlling confounding stratification
A M
 
Lesson 6 Nonparametric Test 2009 Ta
Sumit Prajapati
 
Vergoulas Choosing the appropriate statistical test (2019 Hippokratia journal)
Vaggelis Vergoulas
 
Medical statistics2
Amany El-seoud
 

Viewers also liked (20)

PPT
Aed1222 lesson 2
nurun2010
 
PPT
Chapter 01 power point
Ahmed El-Gendy
 
PPSX
Introduction to Statistics presentation
fazli khaliq
 
PDF
4. six sigma descriptive statistics
Hakeem-Ur- Rehman
 
PPTX
MD Paediatrics (Part 1) - Overview of Basic Statistics
Bernard Deepal W. Jayamanne
 
PPT
Introduction to statistics
Kapil Dev Ghante
 
PPT
Psychology Overview PowerPoint
KRyder
 
PDF
Introduction to Statistics - Basic Statistical Terms
sheisirenebkm
 
PPTX
Basic Statistics Presentation
IUBAT
 
PPTX
Introduction to Health GIS
Bernard Deepal W. Jayamanne
 
PDF
Data analysis
Nursing Path
 
PPT
Classification of caesarean section
limgengyan
 
PPTX
Inferential statistics powerpoint
kellula
 
PPTX
Statistical analysis using spss
jpcagphil
 
PPSX
Inferential statistics.ppt
Nursing Path
 
PPT
Quantitative analysis using SPSS
Alaa Sadik
 
PPT
Caesarean section
Jitendra Ingole
 
PPTX
Introduction to statistics
madan kumar
 
PPT
Statistical ppt
feminaargonza09
 
PPSX
Introduction to statistics...ppt rahul
Rahul Dhaker
 
Aed1222 lesson 2
nurun2010
 
Chapter 01 power point
Ahmed El-Gendy
 
Introduction to Statistics presentation
fazli khaliq
 
4. six sigma descriptive statistics
Hakeem-Ur- Rehman
 
MD Paediatrics (Part 1) - Overview of Basic Statistics
Bernard Deepal W. Jayamanne
 
Introduction to statistics
Kapil Dev Ghante
 
Psychology Overview PowerPoint
KRyder
 
Introduction to Statistics - Basic Statistical Terms
sheisirenebkm
 
Basic Statistics Presentation
IUBAT
 
Introduction to Health GIS
Bernard Deepal W. Jayamanne
 
Data analysis
Nursing Path
 
Classification of caesarean section
limgengyan
 
Inferential statistics powerpoint
kellula
 
Statistical analysis using spss
jpcagphil
 
Inferential statistics.ppt
Nursing Path
 
Quantitative analysis using SPSS
Alaa Sadik
 
Caesarean section
Jitendra Ingole
 
Introduction to statistics
madan kumar
 
Statistical ppt
feminaargonza09
 
Introduction to statistics...ppt rahul
Rahul Dhaker
 
Ad

Similar to Very good statistics-overview rbc (1) (20)

PPTX
Statistics.pptx
Subha322492
 
PPTX
statistic
Pwalmiki
 
PPTX
STATISTICAL TESTS USED IN VARIOUS STUDIES
ashishbharti990
 
PPTX
Biostatistics.pptx
SagarAGavankar
 
PPT
PARAMETRIC TEST in Public health dentistry.ppt
DrPARVATHYVINOD
 
PDF
Lecture notes on basic research statistics dr habibullah
Habibullah ADAMU
 
PPTX
Introduction to Statistics for future Biologists
RebecaBayon
 
PPTX
Overview of different statistical tests used in epidemiological
shefali jain
 
PPT
Statistics basics for oncologist kiran
Kiran Ramakrishna
 
PPTX
Inferential Statistics
Neny Isharyanti
 
PPTX
When to use, What Statistical Test for data Analysis modified.pptx
Asokan R
 
PPTX
Presentation 7.pptx
MuhammadUsman653449
 
PDF
Parametric & Non-Parametric tests SPSS WORKSHOPpdf
jyotshnasahoo5
 
PPTX
REVIEWCOMPREHENSIVE-EXAM. BY bjohn MBpptx
bjohnbagasala1
 
PPTX
Statistical test in spss
Bipin Neupane
 
PPTX
BIOSTATISTICS OVERALL JUNE 20241234567.pptx
anasabdulmajeed3sker
 
PDF
Statistical Tools that you can learn it.
joshuaandreicsabella
 
PPTX
TEST OF SIGNIFICANCE.pptx
JoicePjiji
 
PPT
Overview-of-Biostatistics-Jody-Kriemanpt
AbdirahmanIbrahimkad
 
PPT
Overview-of-Biostatistics-Jody-Krieman-5-6-09 (1).ppt
gebeyehu5
 
Statistics.pptx
Subha322492
 
statistic
Pwalmiki
 
STATISTICAL TESTS USED IN VARIOUS STUDIES
ashishbharti990
 
Biostatistics.pptx
SagarAGavankar
 
PARAMETRIC TEST in Public health dentistry.ppt
DrPARVATHYVINOD
 
Lecture notes on basic research statistics dr habibullah
Habibullah ADAMU
 
Introduction to Statistics for future Biologists
RebecaBayon
 
Overview of different statistical tests used in epidemiological
shefali jain
 
Statistics basics for oncologist kiran
Kiran Ramakrishna
 
Inferential Statistics
Neny Isharyanti
 
When to use, What Statistical Test for data Analysis modified.pptx
Asokan R
 
Presentation 7.pptx
MuhammadUsman653449
 
Parametric & Non-Parametric tests SPSS WORKSHOPpdf
jyotshnasahoo5
 
REVIEWCOMPREHENSIVE-EXAM. BY bjohn MBpptx
bjohnbagasala1
 
Statistical test in spss
Bipin Neupane
 
BIOSTATISTICS OVERALL JUNE 20241234567.pptx
anasabdulmajeed3sker
 
Statistical Tools that you can learn it.
joshuaandreicsabella
 
TEST OF SIGNIFICANCE.pptx
JoicePjiji
 
Overview-of-Biostatistics-Jody-Kriemanpt
AbdirahmanIbrahimkad
 
Overview-of-Biostatistics-Jody-Krieman-5-6-09 (1).ppt
gebeyehu5
 
Ad

More from Abdul Wasay Baloch (14)

PDF
Key to UHS dip card part 2 UQs
Abdul Wasay Baloch
 
PDF
Quick revision notes cardiology part 1 (1)
Abdul Wasay Baloch
 
PDF
Food and nutrition
Abdul Wasay Baloch
 
PDF
Behavioral Sciences Medicine
Abdul Wasay Baloch
 
PPT
Demography Com Medicine
Abdul Wasay Baloch
 
PPT
Occupational Hazards
Abdul Wasay Baloch
 
PDF
Disaster managment
Abdul Wasay Baloch
 
PPT
Introduction to Biostatistics
Abdul Wasay Baloch
 
PPT
Terminologies communicable diseases
Abdul Wasay Baloch
 
PDF
Pearls of ophthalmology
Abdul Wasay Baloch
 
PDF
Data Statistics
Abdul Wasay Baloch
 
PDF
behavioral Sciences Easy Notes MBBS
Abdul Wasay Baloch
 
PDF
Cohort and case con revised
Abdul Wasay Baloch
 
Key to UHS dip card part 2 UQs
Abdul Wasay Baloch
 
Quick revision notes cardiology part 1 (1)
Abdul Wasay Baloch
 
Food and nutrition
Abdul Wasay Baloch
 
Behavioral Sciences Medicine
Abdul Wasay Baloch
 
Demography Com Medicine
Abdul Wasay Baloch
 
Occupational Hazards
Abdul Wasay Baloch
 
Disaster managment
Abdul Wasay Baloch
 
Introduction to Biostatistics
Abdul Wasay Baloch
 
Terminologies communicable diseases
Abdul Wasay Baloch
 
Pearls of ophthalmology
Abdul Wasay Baloch
 
Data Statistics
Abdul Wasay Baloch
 
behavioral Sciences Easy Notes MBBS
Abdul Wasay Baloch
 
Cohort and case con revised
Abdul Wasay Baloch
 

Recently uploaded (20)

PPTX
PPT on the Development of Education in the Victorian England
Beena E S
 
PPTX
LEGAL ASPECTS OF PSYCHIATRUC NURSING.pptx
PoojaSen20
 
PDF
1, 2, 3… E MAIS UM CICLO CHEGA AO FIM!.pdf
Colégio Santa Teresinha
 
PPTX
2025 Winter SWAYAM NPTEL & A Student.pptx
Utsav Yagnik
 
PPTX
nutriquiz grade 4.pptx...............................................
ferdinandsanbuenaven
 
PPTX
CONVULSIVE DISORDERS: NURSING MANAGEMENT.pptx
PRADEEP ABOTHU
 
PPTX
Mrs Mhondiwa Introduction to Algebra class
sabinaschimanga
 
PPTX
Optimizing Cancer Screening With MCED Technologies: From Science to Practical...
i3 Health
 
PPTX
How to Manage Access Rights & User Types in Odoo 18
Celine George
 
PPTX
Presentation: Climate Citizenship Digital Education
Karl Donert
 
PPTX
SAMPLING: DEFINITION,PROCESS,TYPES,SAMPLE SIZE, SAMPLING ERROR.pptx
PRADEEP ABOTHU
 
PPTX
The Human Eye and The Colourful World Class 10 NCERT Science.pptx
renutripathibharat
 
PPTX
How to Define Translation to Custom Module And Add a new language in Odoo 18
Celine George
 
PPTX
Blanket Order in Odoo 17 Purchase App - Odoo Slides
Celine George
 
PPTX
Explorando Recursos do Summer '25: Dicas Essenciais - 02
Mauricio Alexandre Silva
 
PPTX
classroom based quiz bee.pptx...................
ferdinandsanbuenaven
 
PPTX
SCHOOL-BASED SEXUAL HARASSMENT PREVENTION AND RESPONSE WORKSHOP
komlalokoe
 
PPTX
Pyhton with Mysql to perform CRUD operations.pptx
Ramakrishna Reddy Bijjam
 
PPTX
Nutri-QUIZ-Bee-Elementary.pptx...................
ferdinandsanbuenaven
 
PPTX
Gall bladder, Small intestine and Large intestine.pptx
rekhapositivity
 
PPT on the Development of Education in the Victorian England
Beena E S
 
LEGAL ASPECTS OF PSYCHIATRUC NURSING.pptx
PoojaSen20
 
1, 2, 3… E MAIS UM CICLO CHEGA AO FIM!.pdf
Colégio Santa Teresinha
 
2025 Winter SWAYAM NPTEL & A Student.pptx
Utsav Yagnik
 
nutriquiz grade 4.pptx...............................................
ferdinandsanbuenaven
 
CONVULSIVE DISORDERS: NURSING MANAGEMENT.pptx
PRADEEP ABOTHU
 
Mrs Mhondiwa Introduction to Algebra class
sabinaschimanga
 
Optimizing Cancer Screening With MCED Technologies: From Science to Practical...
i3 Health
 
How to Manage Access Rights & User Types in Odoo 18
Celine George
 
Presentation: Climate Citizenship Digital Education
Karl Donert
 
SAMPLING: DEFINITION,PROCESS,TYPES,SAMPLE SIZE, SAMPLING ERROR.pptx
PRADEEP ABOTHU
 
The Human Eye and The Colourful World Class 10 NCERT Science.pptx
renutripathibharat
 
How to Define Translation to Custom Module And Add a new language in Odoo 18
Celine George
 
Blanket Order in Odoo 17 Purchase App - Odoo Slides
Celine George
 
Explorando Recursos do Summer '25: Dicas Essenciais - 02
Mauricio Alexandre Silva
 
classroom based quiz bee.pptx...................
ferdinandsanbuenaven
 
SCHOOL-BASED SEXUAL HARASSMENT PREVENTION AND RESPONSE WORKSHOP
komlalokoe
 
Pyhton with Mysql to perform CRUD operations.pptx
Ramakrishna Reddy Bijjam
 
Nutri-QUIZ-Bee-Elementary.pptx...................
ferdinandsanbuenaven
 
Gall bladder, Small intestine and Large intestine.pptx
rekhapositivity
 

Very good statistics-overview rbc (1)

  • 1. Basic Statistics AIJAZ SOHAG MSc (Env:Sc),M.A.S(H.S.A.),MBA(Health Mgt),MPH,PhD
  • 2. Preface • The purpose of this presentation is to help you determine which statistical tests are appropriate for analyzing your data for research project. • Statistical tests that are presented here focuses on the most common techniques.
  • 3. Outline • Descriptive Statistics – Frequencies & percentages – Means & standard deviations • Inferential Statistics – Correlation – T-tests – Chi-square – Logistic Regression
  • 4. Types of Statistics/Analyses Descriptive Statistics – Frequencies – Basic measurements Inferential Statistics – Hypothesis Testing – Correlation – Confidence Intervals – Significance Testing – Prediction Describing a phenomena How many? How much? BP, HR, BMI, IQ, etc. Inferences about a phenomena Proving or disproving theories Associations between phenomena If sample relates to the larger population E.g., Diet and health
  • 5. Descriptive Statistics Descriptive statistics can be used to summarize and describe a single variable • Frequencies (counts) & Percentages – Use with categorical (nominal) data • Levels, types, groupings, yes/no, Drug A vs. Drug B • Means & Standard Deviations – Use with continuous data • Height, weight, cholesterol, scores on a test
  • 6. Frequencies & Percentages Look at the different ways we can display frequencies and percentages for this data: Table Bar chart Pie chart Good if more than 20 observations AKA frequency distributions – good if more than 20 observations
  • 7. Continuous  Categorical It is possible to take continuous data (such as hemoglobin levels) and turn it into categorical data by grouping values together. Then we can calculate frequencies and percentages for each group.
  • 8. Continuous  Categorical Distribution of Glasgow Coma Scale Scores Even though this is continuous data, it is being treated as “nominal” as it is broken down into groups or categoriesTip: It is usually better to collect continuous data and then break it down into categories for data analysis as opposed to collecting data that fits into preconceived categories.
  • 9. Ordinal Level Data Frequencies and percentages can be computed for ordinal data – Examples: Likert Scales (Strongly Disagree to Strongly Agree); High School/Some College/College Graduate/Graduate School 0 10 20 30 40 50 60 Strongly Agree Agree Disagree Strongly Disagree
  • 10. INFERENTIAL STATISTICS Inferential statistics can be used to prove or disprove theories, determine associations between variables, and determine if findings are significant and whether or not we can generalize from our sample to the entire population The types of inferential statistics we will go over: • Correlation • T-tests/ANOVA • Chi-square • Logistic Regression
  • 11. Correlation • When to use it? – When you want to know about the association or relationship between two continuous variables • Ex) food intake and weight; drug dosage and blood pressure; air temperature and metabolic rate, etc. • What does it tell you? – If a linear relationship exists between two variables, and how strong that relationship is • What do the results look like? – The correlation coefficient = Pearson’s r – Ranges from -1 to +1 – See next slide for examples of correlation results
  • 12. Correlation Guide for interpreting strength of correlations:  0 – 0.25 = Little or no relationship  0.25 – 0.50 = Fair degree of relationship  0.50 - 0.75 = Moderate degree of relationship  0.75 – 1.0 = Strong relationship  1.0 = perfect correlation
  • 13. Correlation • How do you interpret it? – If r is positive, high values of one variable are associated with high values of the other variable (both go in SAME direction - ↑↑ OR ↓↓) • Ex) Diastolic blood pressure tends to rise with age, thus the two variables are positively correlated – If r is negative, low values of one variable are associated with high values of the other variable (opposite direction - ↑↓ OR ↓ ↑) • Ex) Heart rate tends to be lower in persons who exercise frequently, the two variables correlate negatively – Correlation of 0 indicates NO linear relationship • How do you report it? – “Diastolic blood pressure was positively correlated with age (r = .75, p < . 05).” Tip: Correlation does NOT equal causation!!! Just because two variables are highly correlated, this does NOT mean that one CAUSES the other!!!
  • 14. T-tests • When to use them? – Paired t-tests: When comparing the MEANS of a continuous variable in two non-independent samples (i.e., measurements on the same people before and after a treatment) • Ex) Is diet X effective in lowering serum cholesterol levels in a sample of 12 people? • Ex) Do patients who receive drug X have lower blood pressure after treatment then they did before treatment? – Independent samples t-tests: To compare the MEANS of a continuous variable in TWO independent samples (i.e., two different groups of people) • Ex) Do people with diabetes have the same Systolic Blood Pressure as people without diabetes? • Ex) Do patients who receive a new drug treatment have lower blood pressure than those who receive a placebo? Tip: if you have > 2 different groups, you use ANOVA, which compares the means of 3 or more groups
  • 15. T-tests • What does a t-test tell you? – If there is a statistically significant difference between the mean score (or value) of two groups (either the same group of people before and after or two different groups of people) • What do the results look like? – Student’s t • How do you interpret it? – By looking at corresponding p-value • If p < .05, means are significantly different from each other • If p > 0.05, means are not significantly different from each other
  • 16. How do you report t-tests results? “As can be seen in Figure 1, specialty candidates had significantly higher scores on questions dealing with treatment than residency candidates (t = [insert t-value from stats output], p < .001). “As can be seen in Figure 1, children’s mean reading performance was significantly higher on the post-tests in all four grades, ( t = [insert from stats output], p < .05)”
  • 17. Chi-square • When to use it? – When you want to know if there is an association between two categorical (nominal) variables (i.e., between an exposure and outcome) • Ex) Smoking (yes/no) and lung cancer (yes/no) • Ex) Obesity (yes/no) and diabetes (yes/no) • What does a chi-square test tell you? – If the observed frequencies of occurrence in each group are significantly different from expected frequencies (i.e., a difference of proportions)
  • 18. Chi-square • What do the results look like? – Chi-square test statistics = X2 • How do you interpret it? – Usually, the higher the chi-square statistic, the greater likelihood the finding is significant, but you must look at the corresponding p-value to determine significance Tip: Chi square requires that there be 5 or more in each cell of a 2x2 table and 5 or more in 80% of cells in larger tables. No cells can have a zero count.
  • 19. How do you report chi-square? “Distribution of obesity by gender showed that 171 (38.9%) and 75 (17%) of women were overweight and obese (Type I &II), respectively. Whilst 118 (37.3%) and 12 (3.8%) of men were overweight and obese (Type I & II), respectively (Table-II). The Chi square test shows that these differences are statistically significant (p<0.001).” “248 (56.4%) of women and 52 (16.6%) of men had abdominal obesity (Fig-2). The Chi square test shows that these differences are statistically significant (p<0.001).”
  • 20. Logistic Regression • When to use it? – When you want to measure the strength and direction of the association between two variables, where the dependent or outcome variable is categorical (e.g., yes/no) – When you want to predict the likelihood of an outcome while controlling for confounders • Ex) examine the relationship between health behavior (smoking, exercise, low-fat diet) and arthritis (arthritis vs. no arthritis) • Ex) Predict the probability of stroke in relation to gender while controlling for age or hypertension • What does it tell you? – The odds of an event occurring The probability of the outcome event occurring divided by the probability of it not occurring
  • 21. Summary of Statistical Tests Statistic Test Type of Data Needed Test Statistic Example Correlation Two continuous variables Pearson’s r Are blood pressure and weight correlated? T-tests/ANOVA Means from a continuous variable taken from two or more groups Student’s t Do normal weight (group 1) patients have lower blood pressure than obese patients (group 2)? Chi-square Two categorical variables Chi-square X2 Are obese individuals (obese vs. not obese) significantly more likely to have a stroke (stroke vs. no stroke)?
  • 22. Summary • Descriptive statistics can be used with nominal and ordinal data • Frequencies and percentages describe categorical data and means and standard deviations describe continuous variables • Inferential statistics can be used to determine associations between variables and predict the likelihood of outcomes or events • Inferential statistics tell us if our findings are significant
  • 23. Next Steps • Think about the data that you have collected or will collect as part of your research project – What is your research question? – What are you trying to get your data to “say”? – Which statistical tests will best help you answer your research question? – Contact the bio-statistician research coordinator to discuss how to analyze your data!