SlideShare a Scribd company logo
Introduction to Statistics
Measures of Central Tendency
and Dispersion
• The phrase “descriptive statistics” is used
generically in place of measures of central
tendency and dispersion for inferential
statistics.
• These statistics describe or summarize the
qualities of data.
• Another name is “summary statistics”, which
are univariate:
– Mean, Median, Mode, Range, Standard Deviation,
Variance, Min, Max, etc.
Measures of Central Tendency
• These measures tap into the average
distribution of a set of scores or values in
the data.
– Mean
– Median
– Mode
What do you “Mean”?
The “mean” of some data is the average
score or value, such as the average
age of an MPA student or average
weight of professors that like to eat
donuts.
Inferential mean of a sample: X=(X)/n
Mean of a population: =(X)/N
Problem of being “mean”
• The main problem associated with the
mean value of some data is that it is
sensitive to outliers.
• Example, the average weight of political
science professors might be affected if
there was one in the department that
weighed 600 pounds.
Donut-Eating Professors
Professor Weight Weight
Schmuggles 165 165
Bopsey 213 213
Pallitto 189 410
Homer 187 610
Schnickerson 165 165
Levin 148 148
Honkey-Doorey 251 251
Zingers 308 308
Boehmer 151 151
Queenie 132 132
Googles-Boop 199 199
Calzone 227 227
194.6 248.3
The Median (not the cement in the middle
of the road)
• Because the mean average can be
sensitive to extreme values, the median
is sometimes useful and more
accurate.
• The median is simply the middle value
among some scores of a variable. (no
standard formula for its computation)
What is the Median?
Professor Weight
Schmuggles 165
Bopsey 213
Pallitto 189
Homer 187
Schnickerson 165
Levin 148
Honkey-Doorey 251
Zingers 308
Boehmer 151
Queenie 132
Googles-Boop 199
Calzone 227
194.6
Weight
132
148
151
165
165
187
189
199
213
227
251
308
Rank order
and choose
middle value.
If even then
average
between two
in the middle
Percentiles
• If we know the median, then we can go up
or down and rank the data as being above
or below certain thresholds.
• You may be familiar with standardized
tests. 90th percentile, your score was
higher than 90% of the rest of the sample.
The Mode (hold the pie and the ala)
(What does ‘ala’ taste like anyway??)
• The most frequent response or value
for a variable.
• Multiple modes are possible: bimodal
or multimodal.
Figuring the Mode
Professor Weight
Schmuggles 165
Bopsey 213
Pallitto 189
Homer 187
Schnickerson 165
Levin 148
Honkey-Doorey 251
Zingers 308
Boehmer 151
Queenie 132
Googles-Boop 199
Calzone 227
What is the mode?
Answer: 165
Important descriptive
information that may help
inform your research and
diagnose problems like lack
of variability.
Measures of Dispersion (not something
you cast…)
• Measures of dispersion tell us about
variability in the data. Also univariate.
• Basic question: how much do values differ
for a variable from the min to max, and
distance among scores in between. We
use:
– Range
– Standard Deviation
– Variance (standard deviation squared)
• To glean information from data, i.e. to
make an inference, we need to see
variability in our variables.
• Measures of dispersion give us
information about how much our
variables vary from the mean, because if
they don’t it makes it difficult infer
anything from the data. Dispersion is
also known as the spread or range of
variability.
The Range (no Buffalo roaming!!)
• r = h – l
– Where h is high and l is low
• In other words, the range gives us the
value between the minimum and maximum
values of a variable.
• Understanding this statistic is important in
understanding your data, especially for
management and diagnostic purposes.
The Normal Curve
• Bell-shaped distribution or curve
• Perfectly symmetrical about the mean.
Mean = median = mode
• Tails are asymptotic: closer and closer to
horizontal axis but never reach it.
Sample Distribution
• What does Andre do
to the sample
distribution?
• What is the probability
of finding someone
like Andre in the
population?
• Are you ready for
more inferential
statistics?
Normal curves and probability
Andre would be here
Dr. Boehmer would be here
The Standard Deviation
• A standardized measure of distance from
the mean.
• In other words, it allows you to know how far
some cases are located from the mean.
How extreme our your data?
• 68% of cases fall within one standard
deviation from the mean, 97% for two
deviations.
=square root
=sum (sigma)
X=score for each point in data
_
X=mean of scores for the variable
n=sample size (number of
observations or cases
S =
Formula for Standard Deviation
1)
-
(n
2
)
( X
X 

We can see that the Standard Deviation equals 165.2
pounds. The weight of Zinger is still likely skewing this
calculation (indirectly through the mean).
X X- mean x-mean squared
Smuggle 165 -29.6 875.2
Bopsey 213 18.4 339.2
Pallitto 189 -5.6 31.2
Homer 187 -7.6 57.5
Schnickerson 165 -29.6 875.2
Levin 148 -46.6 2170.0
Honkey-Doorey 251 56.4 3182.8
Zingers 308 113.4 12863.3
Boehmer 151 -43.6 1899.5
Queeny 132 -62.6 3916.7
Googles-boop 199 4.4 19.5
Calzone 227 32.4 1050.8
Mean 194.6 2480.1 49.8
Std. Deviation practice
• What is the value of Democracy one std.
deviation above and below the mean?
Descriptive Statistics
319 -10.00 10.00 3.4859 6.71282
319
Democ
Valid N (listwise)
N Minimum Maximum Mean Std. Deviation
The answer is 10.20872 and -3.22692
What percentage of all the cases fall within 10.2 and -
3.2?
Roughly 68%
Std. Deviation practice
Descriptive Statistics
139 19.77 97.12 66.1166 17.74849
139
Urbanpop
Valid N (listwise)
N Minimum Maximum Mean Std. Deviation
What is the value of Urban population one std. deviation
above and below the mean?
The answer is 83.86509 and 48.36811
What percentage of all the cases fall within 83.86 and 48.36?
Roughly 68%
Organizing and Graphing
Data
Goal of Graphing?
1. Presentation of Descriptive Statistics
2. Presentation of Evidence
3. Some people understand subject
matter better with visual aids
4. Provide a sense of the underlying
data generating process (scatter-
plots)
What is the Distribution?
• Gives us a picture of
the variability and
central tendency.
• Can also show the
amount of skewness
and Kurtosis.
Graphing Data: Types
Creating Frequencies
• We create frequencies by sorting data
by value or category and then
summing the cases that fall into those
values.
• How often do certain scores occur?
This is a basic descriptive data
question.
Ranking of Donut-eating Profs.
(most to least)
Zingers 308
Honkey-Doorey 251
Calzone 227
Bopsey 213
Googles-boop 199
Pallitto 189
Homer 187
Schnickerson 165
Smuggle 165
Boehmer 151
Levin 148
Queeny 132
Weight Class Intervals of Donut-Munching Professors
0
0.5
1
1.5
2
2.5
3
3.5
130-150 151-185 186-210 211-240 241-270 271-310 311+
Number
Here we have placed the Professors into
weight classes and depict with a histogram in
columns.
Weight Class Intervals of Donut-Munching Professors
0 0.5 1 1.5 2 2.5 3 3.5
130-150
151-185
186-210
211-240
241-270
271-310
311+
Number
Here it is another histogram depicted
as a bar graph.
Pie Charts:
Proportions of Donut-Eating Professors by Weight Class
130-150
151-185
186-210
211-240
241-270
271-310
311+
Actually, why not use a donut
graph. Duh!
Proportions of Donut-Eating Professors by Weight Class
130-150
151-185
186-210
211-240
241-270
271-310
311+
See Excel for other options!!!!
Line Graphs: A Time Series
0
10
20
30
40
50
60
70
80
90
100
1
9
8
1
1
9
8
2
1
9
8
3
1
9
8
4
1
9
8
5
1
9
8
6
1
9
8
7
1
9
8
8
1
9
8
9
1
9
9
0
1
9
9
1
1
9
9
2
1
9
9
3
1
9
9
4
1
9
9
5
1
9
9
6
1
9
9
7
1
9
9
8
1
9
9
9
2
0
0
0
2
0
0
1
Month
Approval
Approval
Economic approval
Scatter Plot (Two variable)
Presidential Approval and Unemployment
0
20
40
60
80
100
0 2 4 6 8 10 12
Unemployment
Approval
Approve

More Related Content

Similar to Introduction to Statistics23122223.ppt (20)

PPTX
Biostatistics Basics Descriptive and Estimation Methods
INAAMUL HAQ
 
PPTX
Basics of Stats (2).pptx
madihamaqbool6
 
PPT
Looking at data
pcalabri
 
PDF
Summarizing data
Dr Lipilekha Patnaik
 
PPT
Chapter 3
guest3720ca
 
PPT
Chapter 3
Rose Jenkins
 
PDF
Descriptive statistics and graphs
Avjinder (Avi) Kaler
 
PDF
Biostatistic ( descriptive statistics) MOHS
leocanon82
 
PPTX
Basic biostatistics dr.eezn
EhealthMoHS
 
PPTX
descriptive statistics.pptx
Teddyteddy53
 
PPT
Bio statistics
Nc Das
 
PDF
Statistics and permeability engineering reports
wwwmostafalaith99
 
PPTX
STATISTICS.pptx for the scholars and students
ssuseref12b21
 
PPT
data_management_review_descriptive_statistics.ppt
RestyLlagas1
 
PPTX
Statistics (GE 4 CLASS).pptx
YollyCalamba
 
PPTX
Basic statistics
Seth Anandaram Jaipuria College
 
PPTX
Biostatistics community medicine or Psm.pptx
eastmusings
 
PPTX
Descriptive Statistics.pptx
Shashank Mishra
 
PPTX
Measures of Central Tendency With Variance and Ranges.pptx
NecroManXer
 
PPTX
Complete Biostatistics (Descriptive and Inferential analysis)
DrAbdiwaliMohamedAbd
 
Biostatistics Basics Descriptive and Estimation Methods
INAAMUL HAQ
 
Basics of Stats (2).pptx
madihamaqbool6
 
Looking at data
pcalabri
 
Summarizing data
Dr Lipilekha Patnaik
 
Chapter 3
guest3720ca
 
Chapter 3
Rose Jenkins
 
Descriptive statistics and graphs
Avjinder (Avi) Kaler
 
Biostatistic ( descriptive statistics) MOHS
leocanon82
 
Basic biostatistics dr.eezn
EhealthMoHS
 
descriptive statistics.pptx
Teddyteddy53
 
Bio statistics
Nc Das
 
Statistics and permeability engineering reports
wwwmostafalaith99
 
STATISTICS.pptx for the scholars and students
ssuseref12b21
 
data_management_review_descriptive_statistics.ppt
RestyLlagas1
 
Statistics (GE 4 CLASS).pptx
YollyCalamba
 
Biostatistics community medicine or Psm.pptx
eastmusings
 
Descriptive Statistics.pptx
Shashank Mishra
 
Measures of Central Tendency With Variance and Ranges.pptx
NecroManXer
 
Complete Biostatistics (Descriptive and Inferential analysis)
DrAbdiwaliMohamedAbd
 

Recently uploaded (20)

PPTX
Artificial Intelligence in Gastroentrology: Advancements and Future Presprec...
AyanHossain
 
PDF
Tips for Writing the Research Title with Examples
Thelma Villaflores
 
PPTX
Unlock the Power of Cursor AI: MuleSoft Integrations
Veera Pallapu
 
PPTX
quizbeenutirtion-230726075512-0387d08e.pptx
domingoriahlyne
 
PDF
Stepwise procedure (Manually Submitted & Un Attended) Medical Devices Cases
MUHAMMAD SOHAIL
 
PPTX
MALABSORPTION SYNDROME: NURSING MANAGEMENT.pptx
PRADEEP ABOTHU
 
PPTX
TOP 10 AI TOOLS YOU MUST LEARN TO SURVIVE IN 2025 AND ABOVE
digilearnings.com
 
PDF
Exploring-the-Investigative-World-of-Science.pdf/8th class curiosity/1st chap...
Sandeep Swamy
 
PPTX
HERNIA: INGUINAL HERNIA, UMBLICAL HERNIA.pptx
PRADEEP ABOTHU
 
PPTX
INTESTINALPARASITES OR WORM INFESTATIONS.pptx
PRADEEP ABOTHU
 
PDF
FULL DOCUMENT: Read the full Deloitte and Touche audit report on the National...
Kweku Zurek
 
PPTX
THE HUMAN INTEGUMENTARY SYSTEM#MLT#BCRAPC.pptx
Subham Panja
 
PPTX
Cybersecurity: How to Protect your Digital World from Hackers
vaidikpanda4
 
PDF
TOP 10 AI TOOLS YOU MUST LEARN TO SURVIVE IN 2025 AND ABOVE
digilearnings.com
 
PPTX
YSPH VMOC Special Report - Measles Outbreak Southwest US 7-20-2025.pptx
Yale School of Public Health - The Virtual Medical Operations Center (VMOC)
 
PPTX
Electrophysiology_of_Heart. Electrophysiology studies in Cardiovascular syste...
Rajshri Ghogare
 
PPTX
Nutrition Quiz bee for elementary 2025 1.pptx
RichellMarianoPugal
 
PPTX
Top 10 AI Tools, Like ChatGPT. You Must Learn In 2025
Digilearnings
 
PPTX
Orientation MOOCs on SWAYAM for Teachers
moocs1
 
PPTX
HEALTH CARE DELIVERY SYSTEM - UNIT 2 - GNM 3RD YEAR.pptx
Priyanshu Anand
 
Artificial Intelligence in Gastroentrology: Advancements and Future Presprec...
AyanHossain
 
Tips for Writing the Research Title with Examples
Thelma Villaflores
 
Unlock the Power of Cursor AI: MuleSoft Integrations
Veera Pallapu
 
quizbeenutirtion-230726075512-0387d08e.pptx
domingoriahlyne
 
Stepwise procedure (Manually Submitted & Un Attended) Medical Devices Cases
MUHAMMAD SOHAIL
 
MALABSORPTION SYNDROME: NURSING MANAGEMENT.pptx
PRADEEP ABOTHU
 
TOP 10 AI TOOLS YOU MUST LEARN TO SURVIVE IN 2025 AND ABOVE
digilearnings.com
 
Exploring-the-Investigative-World-of-Science.pdf/8th class curiosity/1st chap...
Sandeep Swamy
 
HERNIA: INGUINAL HERNIA, UMBLICAL HERNIA.pptx
PRADEEP ABOTHU
 
INTESTINALPARASITES OR WORM INFESTATIONS.pptx
PRADEEP ABOTHU
 
FULL DOCUMENT: Read the full Deloitte and Touche audit report on the National...
Kweku Zurek
 
THE HUMAN INTEGUMENTARY SYSTEM#MLT#BCRAPC.pptx
Subham Panja
 
Cybersecurity: How to Protect your Digital World from Hackers
vaidikpanda4
 
TOP 10 AI TOOLS YOU MUST LEARN TO SURVIVE IN 2025 AND ABOVE
digilearnings.com
 
YSPH VMOC Special Report - Measles Outbreak Southwest US 7-20-2025.pptx
Yale School of Public Health - The Virtual Medical Operations Center (VMOC)
 
Electrophysiology_of_Heart. Electrophysiology studies in Cardiovascular syste...
Rajshri Ghogare
 
Nutrition Quiz bee for elementary 2025 1.pptx
RichellMarianoPugal
 
Top 10 AI Tools, Like ChatGPT. You Must Learn In 2025
Digilearnings
 
Orientation MOOCs on SWAYAM for Teachers
moocs1
 
HEALTH CARE DELIVERY SYSTEM - UNIT 2 - GNM 3RD YEAR.pptx
Priyanshu Anand
 
Ad

Introduction to Statistics23122223.ppt

  • 1. Introduction to Statistics Measures of Central Tendency and Dispersion
  • 2. • The phrase “descriptive statistics” is used generically in place of measures of central tendency and dispersion for inferential statistics. • These statistics describe or summarize the qualities of data. • Another name is “summary statistics”, which are univariate: – Mean, Median, Mode, Range, Standard Deviation, Variance, Min, Max, etc.
  • 3. Measures of Central Tendency • These measures tap into the average distribution of a set of scores or values in the data. – Mean – Median – Mode
  • 4. What do you “Mean”? The “mean” of some data is the average score or value, such as the average age of an MPA student or average weight of professors that like to eat donuts. Inferential mean of a sample: X=(X)/n Mean of a population: =(X)/N
  • 5. Problem of being “mean” • The main problem associated with the mean value of some data is that it is sensitive to outliers. • Example, the average weight of political science professors might be affected if there was one in the department that weighed 600 pounds.
  • 6. Donut-Eating Professors Professor Weight Weight Schmuggles 165 165 Bopsey 213 213 Pallitto 189 410 Homer 187 610 Schnickerson 165 165 Levin 148 148 Honkey-Doorey 251 251 Zingers 308 308 Boehmer 151 151 Queenie 132 132 Googles-Boop 199 199 Calzone 227 227 194.6 248.3
  • 7. The Median (not the cement in the middle of the road) • Because the mean average can be sensitive to extreme values, the median is sometimes useful and more accurate. • The median is simply the middle value among some scores of a variable. (no standard formula for its computation)
  • 8. What is the Median? Professor Weight Schmuggles 165 Bopsey 213 Pallitto 189 Homer 187 Schnickerson 165 Levin 148 Honkey-Doorey 251 Zingers 308 Boehmer 151 Queenie 132 Googles-Boop 199 Calzone 227 194.6 Weight 132 148 151 165 165 187 189 199 213 227 251 308 Rank order and choose middle value. If even then average between two in the middle
  • 9. Percentiles • If we know the median, then we can go up or down and rank the data as being above or below certain thresholds. • You may be familiar with standardized tests. 90th percentile, your score was higher than 90% of the rest of the sample.
  • 10. The Mode (hold the pie and the ala) (What does ‘ala’ taste like anyway??) • The most frequent response or value for a variable. • Multiple modes are possible: bimodal or multimodal.
  • 11. Figuring the Mode Professor Weight Schmuggles 165 Bopsey 213 Pallitto 189 Homer 187 Schnickerson 165 Levin 148 Honkey-Doorey 251 Zingers 308 Boehmer 151 Queenie 132 Googles-Boop 199 Calzone 227 What is the mode? Answer: 165 Important descriptive information that may help inform your research and diagnose problems like lack of variability.
  • 12. Measures of Dispersion (not something you cast…) • Measures of dispersion tell us about variability in the data. Also univariate. • Basic question: how much do values differ for a variable from the min to max, and distance among scores in between. We use: – Range – Standard Deviation – Variance (standard deviation squared)
  • 13. • To glean information from data, i.e. to make an inference, we need to see variability in our variables. • Measures of dispersion give us information about how much our variables vary from the mean, because if they don’t it makes it difficult infer anything from the data. Dispersion is also known as the spread or range of variability.
  • 14. The Range (no Buffalo roaming!!) • r = h – l – Where h is high and l is low • In other words, the range gives us the value between the minimum and maximum values of a variable. • Understanding this statistic is important in understanding your data, especially for management and diagnostic purposes.
  • 15. The Normal Curve • Bell-shaped distribution or curve • Perfectly symmetrical about the mean. Mean = median = mode • Tails are asymptotic: closer and closer to horizontal axis but never reach it.
  • 16. Sample Distribution • What does Andre do to the sample distribution? • What is the probability of finding someone like Andre in the population? • Are you ready for more inferential statistics?
  • 17. Normal curves and probability Andre would be here Dr. Boehmer would be here
  • 18. The Standard Deviation • A standardized measure of distance from the mean. • In other words, it allows you to know how far some cases are located from the mean. How extreme our your data? • 68% of cases fall within one standard deviation from the mean, 97% for two deviations.
  • 19. =square root =sum (sigma) X=score for each point in data _ X=mean of scores for the variable n=sample size (number of observations or cases S = Formula for Standard Deviation 1) - (n 2 ) ( X X  
  • 20. We can see that the Standard Deviation equals 165.2 pounds. The weight of Zinger is still likely skewing this calculation (indirectly through the mean). X X- mean x-mean squared Smuggle 165 -29.6 875.2 Bopsey 213 18.4 339.2 Pallitto 189 -5.6 31.2 Homer 187 -7.6 57.5 Schnickerson 165 -29.6 875.2 Levin 148 -46.6 2170.0 Honkey-Doorey 251 56.4 3182.8 Zingers 308 113.4 12863.3 Boehmer 151 -43.6 1899.5 Queeny 132 -62.6 3916.7 Googles-boop 199 4.4 19.5 Calzone 227 32.4 1050.8 Mean 194.6 2480.1 49.8
  • 21. Std. Deviation practice • What is the value of Democracy one std. deviation above and below the mean? Descriptive Statistics 319 -10.00 10.00 3.4859 6.71282 319 Democ Valid N (listwise) N Minimum Maximum Mean Std. Deviation The answer is 10.20872 and -3.22692 What percentage of all the cases fall within 10.2 and - 3.2? Roughly 68%
  • 22. Std. Deviation practice Descriptive Statistics 139 19.77 97.12 66.1166 17.74849 139 Urbanpop Valid N (listwise) N Minimum Maximum Mean Std. Deviation What is the value of Urban population one std. deviation above and below the mean? The answer is 83.86509 and 48.36811 What percentage of all the cases fall within 83.86 and 48.36? Roughly 68%
  • 24. Goal of Graphing? 1. Presentation of Descriptive Statistics 2. Presentation of Evidence 3. Some people understand subject matter better with visual aids 4. Provide a sense of the underlying data generating process (scatter- plots)
  • 25. What is the Distribution? • Gives us a picture of the variability and central tendency. • Can also show the amount of skewness and Kurtosis.
  • 27. Creating Frequencies • We create frequencies by sorting data by value or category and then summing the cases that fall into those values. • How often do certain scores occur? This is a basic descriptive data question.
  • 28. Ranking of Donut-eating Profs. (most to least) Zingers 308 Honkey-Doorey 251 Calzone 227 Bopsey 213 Googles-boop 199 Pallitto 189 Homer 187 Schnickerson 165 Smuggle 165 Boehmer 151 Levin 148 Queeny 132
  • 29. Weight Class Intervals of Donut-Munching Professors 0 0.5 1 1.5 2 2.5 3 3.5 130-150 151-185 186-210 211-240 241-270 271-310 311+ Number Here we have placed the Professors into weight classes and depict with a histogram in columns.
  • 30. Weight Class Intervals of Donut-Munching Professors 0 0.5 1 1.5 2 2.5 3 3.5 130-150 151-185 186-210 211-240 241-270 271-310 311+ Number Here it is another histogram depicted as a bar graph.
  • 31. Pie Charts: Proportions of Donut-Eating Professors by Weight Class 130-150 151-185 186-210 211-240 241-270 271-310 311+
  • 32. Actually, why not use a donut graph. Duh! Proportions of Donut-Eating Professors by Weight Class 130-150 151-185 186-210 211-240 241-270 271-310 311+ See Excel for other options!!!!
  • 33. Line Graphs: A Time Series 0 10 20 30 40 50 60 70 80 90 100 1 9 8 1 1 9 8 2 1 9 8 3 1 9 8 4 1 9 8 5 1 9 8 6 1 9 8 7 1 9 8 8 1 9 8 9 1 9 9 0 1 9 9 1 1 9 9 2 1 9 9 3 1 9 9 4 1 9 9 5 1 9 9 6 1 9 9 7 1 9 9 8 1 9 9 9 2 0 0 0 2 0 0 1 Month Approval Approval Economic approval
  • 34. Scatter Plot (Two variable) Presidential Approval and Unemployment 0 20 40 60 80 100 0 2 4 6 8 10 12 Unemployment Approval Approve