SlideShare a Scribd company logo
Descriptive statistics
Statistics
• Many studies generate large numbers of
data points, and to make sense of all that
data, researchers use statistics that
summarize the data, providing a better
understanding of overall tendencies within
the distributions of scores.
Types of statistics
• Types of statistics:
1. descriptive (which summarize some characteristic of a
sample)
• Measures of central tendency
• Measures of dispersion
• Measures of skewness
2. inferential (which test for significant differences
between groups and/or significant relationships
among variables within the sample
• t-ratio, chi-square, beta-value
Reasons for using statistics
• aids in summarizing the results
• helps us recognize underlying trends and
tendencies in the data
• aids in communicating the results to others
Descriptive_statistics - Sample 1.pptx
Descriptive statistics
• If we wanted to characterize the students in this class we
would find that they are:
– Young
– From Kentucky
– Fit
– Male
• How young?
• How Kentuckian is this class?
• How fit is this class?
• What is the distribution of males and females?
Frequency distribution
• The frequency with which observations are
assigned to each category or point on a
measurement scale.
– Most basic form of descriptive statistics
– May be expressed as a percentage of the total
sample found in each category
Source : Reasoning with Statistics, by Frederick Williams &
Peter Monge, fifth edition, Harcourt College Publishers.
Frequency distribution
• The distribution is “read” differently
depending upon the measurement level
– Nominal scales are read as discrete
measurements at each level
– Ordinal measures show tendencies, but
categories should not be compared
– Interval and ratio scales allow for comparison
among categories
Ancestry of US residents
Source: Protecting Children from Harmful Television: TV Ratings and the V-chip
Amy I. Nathanson, PhD Lecturer, University of California at Santa Barbara
Joanne Cantor, PhD Professor, Communication Arts, University of Wisconsin-Madison
Source: UCLA International Institute
Source: www.cit.cornell.edu/computer/students/bandwidth/charts.html
Source: www.cit.cornell.edu/computer/students/bandwidth/charts.html
Source: Verisign
Descriptive_statistics - Sample 1.pptx
Descriptive_statistics - Sample 1.pptx
Normal distribution
• Many characteristics are distributed through the
population in a ‘normal’ manner
– Normal curves have well-defined statistical properties
– Parametric statistics are based on the assumption that
the variables are distributed normally
• Most commonly used statistics
• This is the famous “Bell curve” where many cases
fall near the middle of the distribution and few fall
very high or very low
– I.Q.
I.Q. distribution
Measures of central tendency
• These measures give us an idea what the ‘typical’
case in a distribution is like
• Mode (Mo): the most frequent score in a distribution
• good for nominal data
• Median (Mdn): the midpoint or midscore in a
distribution.
• (50% cases above/50% cases below)
– insensitive to extreme cases
--Interval or ratio
Source : Reasoning with Statistics, by Frederick Williams & Peter Monge, fifth edition, Harcourt College Publishers.
Measures of central tendency
• Mean
– The ‘average’ score—sum of all individual
scores divided by the number of scores
– has a number of useful statistical properties
• however, can be sensitive to extreme scores
(“outliers”)
– many statistics are based on the mean
Source: www.wilderdom.com/.../L2-1UnderstandingIQ.html
Statistics estimating dispersion
• Some statistics look at how widely scattered over
the scale the individual scores are
• Groups with identical means can be more or less
widely dispersed
• To find out how the group is distributed, we need
to know how far from or close to the mean
individual scores are
• Like the mean, these statistics are only meaningful
for interval or ratio-level measures
Estimates of dispersion
• Range
• Distance between the highest and lowest scores in
a distribution;
• sensitive to extreme scores;
• Can compensate by calculating interquartile range
(distance between the 25th and 75th percentile
points) which represents the range of scores for the
middle half of a distribution
Usually used in combination with other measures of
dispersion.
Range
Source: www.animatedsoftware.com/ statglos/sgrange.htm
Source: http://pse.cs.vt.edu/SoSci/converted/Dispersion_I/box_n_hist.gif
Variance (S2)
– Average of squared distances of individual points from
the mean
• sample variance
– High variance means that most scores are far away from
the mean. Low variance indicates that most scores cluster
tightly about the mean.
– The amount that one score differs from the mean is called
its deviation score (deviate)
– The sum of all deviation scores in a sample is called the
sum of squares
Estimates of dispersion
A summary statistic of how much scores vary
from the mean
Square root of the Variance
– expressed in the original units of measurement
– Represents the average amount of dispersion in
a sample
– Used in a number of inferential statistics
Standard Deviation (SD)
Skewness of distributions
• Measures look at how lopsided distributions are—how far
from the ideal of the normal curve they are
• When the median and the mean are different, the
distribution is skewed. The greater the difference, the
greater the skew.
• Distributions that trail away to the left are negatively
skewed and those that trail away to the right are positively
skewed
• If the skewness is extreme, the researcher should either
transform the data to make them better resemble a normal
curve or else use a different set of statistics—
nonparametric statistics—to carry out the analysis
Descriptive_statistics - Sample 1.pptx
Descriptive_statistics - Sample 1.pptx
Distribution of posting frequency on Usenet
So
• Descriptive statistics are used to summarize
data from individual respondents, etc.
– They help to make sense of large numbers of
individual responses, to communicate the
essence of those responses to others
• They focus on typical or average scores, the
dispersion of scores over the available
responses, and the shape of the response
curve
Ad

More Related Content

Similar to Descriptive_statistics - Sample 1.pptx (20)

Presentation research- chapter 10-11 istiqlal
Presentation research- chapter 10-11 istiqlalPresentation research- chapter 10-11 istiqlal
Presentation research- chapter 10-11 istiqlal
IstiqlalEid
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statistics
Sarfraz Ahmad
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statistics
Attaullah Khan
 
Introduction to Statistics53004300.ppt
Introduction to Statistics53004300.pptIntroduction to Statistics53004300.ppt
Introduction to Statistics53004300.ppt
TripthiDubey
 
Biostatistics.pptx
Biostatistics.pptxBiostatistics.pptx
Biostatistics.pptx
Tawhid4
 
MIDTERM-2_PEC-8-ASSESSMENT-OF-LEARNING-1.pptx
MIDTERM-2_PEC-8-ASSESSMENT-OF-LEARNING-1.pptxMIDTERM-2_PEC-8-ASSESSMENT-OF-LEARNING-1.pptx
MIDTERM-2_PEC-8-ASSESSMENT-OF-LEARNING-1.pptx
AdonaRomasanta
 
Applied statistics lecture_2
Applied statistics lecture_2Applied statistics lecture_2
Applied statistics lecture_2
Daria Bogdanova
 
Ch5-quantitative-data analysis.pptx
Ch5-quantitative-data analysis.pptxCh5-quantitative-data analysis.pptx
Ch5-quantitative-data analysis.pptx
zerihunnana
 
Stats-Review-Maie-St-John-5-20-2009.ppt
Stats-Review-Maie-St-John-5-20-2009.pptStats-Review-Maie-St-John-5-20-2009.ppt
Stats-Review-Maie-St-John-5-20-2009.ppt
DiptoKumerSarker1
 
Statr sessions 4 to 6
Statr sessions 4 to 6Statr sessions 4 to 6
Statr sessions 4 to 6
Ruru Chowdhury
 
06 samples and-populations
06 samples and-populations06 samples and-populations
06 samples and-populations
Todd Bill
 
Intro statistics
Intro statisticsIntro statistics
Intro statistics
Arash Kamrani
 
RM7.ppt
RM7.pptRM7.ppt
RM7.ppt
HanaKassahun1
 
Introduction to Statistics2312.ppt
Introduction to Statistics2312.pptIntroduction to Statistics2312.ppt
Introduction to Statistics2312.ppt
pathianithanaidu
 
Introduction to Statistics23122223.ppt
Introduction to Statistics23122223.pptIntroduction to Statistics23122223.ppt
Introduction to Statistics23122223.ppt
pathianithanaidu
 
Measure of central tendency grouped data.pptx
Measure of central tendency grouped data.pptxMeasure of central tendency grouped data.pptx
Measure of central tendency grouped data.pptx
SandeAlotaBoco
 
Review of Basic Statistics and Terminology
Review of Basic Statistics and TerminologyReview of Basic Statistics and Terminology
Review of Basic Statistics and Terminology
aswhite
 
MEASURE-OF-VARIABILITY- for students. Ppt
MEASURE-OF-VARIABILITY- for students. PptMEASURE-OF-VARIABILITY- for students. Ppt
MEASURE-OF-VARIABILITY- for students. Ppt
PrincessjaynoviaKali
 
Planning-Data-Analysis-CHOOSING-STATISTICAL-TOOL.docx
Planning-Data-Analysis-CHOOSING-STATISTICAL-TOOL.docxPlanning-Data-Analysis-CHOOSING-STATISTICAL-TOOL.docx
Planning-Data-Analysis-CHOOSING-STATISTICAL-TOOL.docx
emmanuelangelof
 
ANALYSIS ANDINTERPRETATION OF DATA Analysis and Interpr.docx
ANALYSIS ANDINTERPRETATION  OF DATA Analysis and Interpr.docxANALYSIS ANDINTERPRETATION  OF DATA Analysis and Interpr.docx
ANALYSIS ANDINTERPRETATION OF DATA Analysis and Interpr.docx
cullenrjzsme
 
Presentation research- chapter 10-11 istiqlal
Presentation research- chapter 10-11 istiqlalPresentation research- chapter 10-11 istiqlal
Presentation research- chapter 10-11 istiqlal
IstiqlalEid
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statistics
Sarfraz Ahmad
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statistics
Attaullah Khan
 
Introduction to Statistics53004300.ppt
Introduction to Statistics53004300.pptIntroduction to Statistics53004300.ppt
Introduction to Statistics53004300.ppt
TripthiDubey
 
Biostatistics.pptx
Biostatistics.pptxBiostatistics.pptx
Biostatistics.pptx
Tawhid4
 
MIDTERM-2_PEC-8-ASSESSMENT-OF-LEARNING-1.pptx
MIDTERM-2_PEC-8-ASSESSMENT-OF-LEARNING-1.pptxMIDTERM-2_PEC-8-ASSESSMENT-OF-LEARNING-1.pptx
MIDTERM-2_PEC-8-ASSESSMENT-OF-LEARNING-1.pptx
AdonaRomasanta
 
Applied statistics lecture_2
Applied statistics lecture_2Applied statistics lecture_2
Applied statistics lecture_2
Daria Bogdanova
 
Ch5-quantitative-data analysis.pptx
Ch5-quantitative-data analysis.pptxCh5-quantitative-data analysis.pptx
Ch5-quantitative-data analysis.pptx
zerihunnana
 
Stats-Review-Maie-St-John-5-20-2009.ppt
Stats-Review-Maie-St-John-5-20-2009.pptStats-Review-Maie-St-John-5-20-2009.ppt
Stats-Review-Maie-St-John-5-20-2009.ppt
DiptoKumerSarker1
 
06 samples and-populations
06 samples and-populations06 samples and-populations
06 samples and-populations
Todd Bill
 
Introduction to Statistics2312.ppt
Introduction to Statistics2312.pptIntroduction to Statistics2312.ppt
Introduction to Statistics2312.ppt
pathianithanaidu
 
Introduction to Statistics23122223.ppt
Introduction to Statistics23122223.pptIntroduction to Statistics23122223.ppt
Introduction to Statistics23122223.ppt
pathianithanaidu
 
Measure of central tendency grouped data.pptx
Measure of central tendency grouped data.pptxMeasure of central tendency grouped data.pptx
Measure of central tendency grouped data.pptx
SandeAlotaBoco
 
Review of Basic Statistics and Terminology
Review of Basic Statistics and TerminologyReview of Basic Statistics and Terminology
Review of Basic Statistics and Terminology
aswhite
 
MEASURE-OF-VARIABILITY- for students. Ppt
MEASURE-OF-VARIABILITY- for students. PptMEASURE-OF-VARIABILITY- for students. Ppt
MEASURE-OF-VARIABILITY- for students. Ppt
PrincessjaynoviaKali
 
Planning-Data-Analysis-CHOOSING-STATISTICAL-TOOL.docx
Planning-Data-Analysis-CHOOSING-STATISTICAL-TOOL.docxPlanning-Data-Analysis-CHOOSING-STATISTICAL-TOOL.docx
Planning-Data-Analysis-CHOOSING-STATISTICAL-TOOL.docx
emmanuelangelof
 
ANALYSIS ANDINTERPRETATION OF DATA Analysis and Interpr.docx
ANALYSIS ANDINTERPRETATION  OF DATA Analysis and Interpr.docxANALYSIS ANDINTERPRETATION  OF DATA Analysis and Interpr.docx
ANALYSIS ANDINTERPRETATION OF DATA Analysis and Interpr.docx
cullenrjzsme
 

Recently uploaded (20)

Customer Segmentation using K-Means clustering
Customer Segmentation using K-Means clusteringCustomer Segmentation using K-Means clustering
Customer Segmentation using K-Means clustering
Ingrid Nyakerario
 
L1_Slides_Foundational Concepts_508.pptx
L1_Slides_Foundational Concepts_508.pptxL1_Slides_Foundational Concepts_508.pptx
L1_Slides_Foundational Concepts_508.pptx
38NoopurPatel
 
vMix Pro Crack + Serial Number Torrent free Download
vMix Pro Crack + Serial Number Torrent free DownloadvMix Pro Crack + Serial Number Torrent free Download
vMix Pro Crack + Serial Number Torrent free Download
eyeskye547
 
Lagos School of Programming Final Project Updated.pdf
Lagos School of Programming Final Project Updated.pdfLagos School of Programming Final Project Updated.pdf
Lagos School of Programming Final Project Updated.pdf
benuju2016
 
Process Mining and Official Statistics - CBS
Process Mining and Official Statistics - CBSProcess Mining and Official Statistics - CBS
Process Mining and Official Statistics - CBS
Process mining Evangelist
 
2024-Media-Literacy-Index-Of-Ukrainians-ENG-SHORT.pdf
2024-Media-Literacy-Index-Of-Ukrainians-ENG-SHORT.pdf2024-Media-Literacy-Index-Of-Ukrainians-ENG-SHORT.pdf
2024-Media-Literacy-Index-Of-Ukrainians-ENG-SHORT.pdf
OlhaTatokhina1
 
新西兰文凭奥克兰理工大学毕业证书AUT成绩单补办
新西兰文凭奥克兰理工大学毕业证书AUT成绩单补办新西兰文凭奥克兰理工大学毕业证书AUT成绩单补办
新西兰文凭奥克兰理工大学毕业证书AUT成绩单补办
Taqyea
 
problem solving.presentation slideshow bsc nursing
problem solving.presentation slideshow bsc nursingproblem solving.presentation slideshow bsc nursing
problem solving.presentation slideshow bsc nursing
vishnudathas123
 
Just-In-Timeasdfffffffghhhhhhhhhhj Systems.ppt
Just-In-Timeasdfffffffghhhhhhhhhhj Systems.pptJust-In-Timeasdfffffffghhhhhhhhhhj Systems.ppt
Just-In-Timeasdfffffffghhhhhhhhhhj Systems.ppt
ssuser5f8f49
 
录取通知书加拿大TMU毕业证多伦多都会大学电子版毕业证成绩单
录取通知书加拿大TMU毕业证多伦多都会大学电子版毕业证成绩单录取通知书加拿大TMU毕业证多伦多都会大学电子版毕业证成绩单
录取通知书加拿大TMU毕业证多伦多都会大学电子版毕业证成绩单
Taqyea
 
定制(意大利Rimini毕业证)布鲁诺马代尔纳嘉雷迪米音乐学院学历认证
定制(意大利Rimini毕业证)布鲁诺马代尔纳嘉雷迪米音乐学院学历认证定制(意大利Rimini毕业证)布鲁诺马代尔纳嘉雷迪米音乐学院学历认证
定制(意大利Rimini毕业证)布鲁诺马代尔纳嘉雷迪米音乐学院学历认证
Taqyea
 
HershAggregator (2).pdf musicretaildistribution
HershAggregator (2).pdf musicretaildistributionHershAggregator (2).pdf musicretaildistribution
HershAggregator (2).pdf musicretaildistribution
hershtara1
 
hersh's midterm project.pdf music retail and distribution
hersh's midterm project.pdf music retail and distributionhersh's midterm project.pdf music retail and distribution
hersh's midterm project.pdf music retail and distribution
hershtara1
 
Analysis of Billboards hot 100 toop five hit makers on the chart.docx
Analysis of Billboards hot 100 toop five hit makers on the chart.docxAnalysis of Billboards hot 100 toop five hit makers on the chart.docx
Analysis of Billboards hot 100 toop five hit makers on the chart.docx
hershtara1
 
4. Multivariable statistics_Using Stata_2025.pdf
4. Multivariable statistics_Using Stata_2025.pdf4. Multivariable statistics_Using Stata_2025.pdf
4. Multivariable statistics_Using Stata_2025.pdf
axonneurologycenter1
 
Adopting Process Mining at the Rabobank - use case
Adopting Process Mining at the Rabobank - use caseAdopting Process Mining at the Rabobank - use case
Adopting Process Mining at the Rabobank - use case
Process mining Evangelist
 
Suncorp - Integrating Process Mining at Australia's Largest Insurer
Suncorp - Integrating Process Mining at Australia's Largest InsurerSuncorp - Integrating Process Mining at Australia's Largest Insurer
Suncorp - Integrating Process Mining at Australia's Largest Insurer
Process mining Evangelist
 
AWS-AIML-PRESENTATION RELATED TO DATA SCIENCE TO DATA
AWS-AIML-PRESENTATION RELATED TO DATA SCIENCE TO DATAAWS-AIML-PRESENTATION RELATED TO DATA SCIENCE TO DATA
AWS-AIML-PRESENTATION RELATED TO DATA SCIENCE TO DATA
SnehaBoja
 
RAG Chatbot using AWS Bedrock and Streamlit Framework
RAG Chatbot using AWS Bedrock and Streamlit FrameworkRAG Chatbot using AWS Bedrock and Streamlit Framework
RAG Chatbot using AWS Bedrock and Streamlit Framework
apanneer
 
Process Mining at Dimension Data - Jan vermeulen
Process Mining at Dimension Data - Jan vermeulenProcess Mining at Dimension Data - Jan vermeulen
Process Mining at Dimension Data - Jan vermeulen
Process mining Evangelist
 
Customer Segmentation using K-Means clustering
Customer Segmentation using K-Means clusteringCustomer Segmentation using K-Means clustering
Customer Segmentation using K-Means clustering
Ingrid Nyakerario
 
L1_Slides_Foundational Concepts_508.pptx
L1_Slides_Foundational Concepts_508.pptxL1_Slides_Foundational Concepts_508.pptx
L1_Slides_Foundational Concepts_508.pptx
38NoopurPatel
 
vMix Pro Crack + Serial Number Torrent free Download
vMix Pro Crack + Serial Number Torrent free DownloadvMix Pro Crack + Serial Number Torrent free Download
vMix Pro Crack + Serial Number Torrent free Download
eyeskye547
 
Lagos School of Programming Final Project Updated.pdf
Lagos School of Programming Final Project Updated.pdfLagos School of Programming Final Project Updated.pdf
Lagos School of Programming Final Project Updated.pdf
benuju2016
 
Process Mining and Official Statistics - CBS
Process Mining and Official Statistics - CBSProcess Mining and Official Statistics - CBS
Process Mining and Official Statistics - CBS
Process mining Evangelist
 
2024-Media-Literacy-Index-Of-Ukrainians-ENG-SHORT.pdf
2024-Media-Literacy-Index-Of-Ukrainians-ENG-SHORT.pdf2024-Media-Literacy-Index-Of-Ukrainians-ENG-SHORT.pdf
2024-Media-Literacy-Index-Of-Ukrainians-ENG-SHORT.pdf
OlhaTatokhina1
 
新西兰文凭奥克兰理工大学毕业证书AUT成绩单补办
新西兰文凭奥克兰理工大学毕业证书AUT成绩单补办新西兰文凭奥克兰理工大学毕业证书AUT成绩单补办
新西兰文凭奥克兰理工大学毕业证书AUT成绩单补办
Taqyea
 
problem solving.presentation slideshow bsc nursing
problem solving.presentation slideshow bsc nursingproblem solving.presentation slideshow bsc nursing
problem solving.presentation slideshow bsc nursing
vishnudathas123
 
Just-In-Timeasdfffffffghhhhhhhhhhj Systems.ppt
Just-In-Timeasdfffffffghhhhhhhhhhj Systems.pptJust-In-Timeasdfffffffghhhhhhhhhhj Systems.ppt
Just-In-Timeasdfffffffghhhhhhhhhhj Systems.ppt
ssuser5f8f49
 
录取通知书加拿大TMU毕业证多伦多都会大学电子版毕业证成绩单
录取通知书加拿大TMU毕业证多伦多都会大学电子版毕业证成绩单录取通知书加拿大TMU毕业证多伦多都会大学电子版毕业证成绩单
录取通知书加拿大TMU毕业证多伦多都会大学电子版毕业证成绩单
Taqyea
 
定制(意大利Rimini毕业证)布鲁诺马代尔纳嘉雷迪米音乐学院学历认证
定制(意大利Rimini毕业证)布鲁诺马代尔纳嘉雷迪米音乐学院学历认证定制(意大利Rimini毕业证)布鲁诺马代尔纳嘉雷迪米音乐学院学历认证
定制(意大利Rimini毕业证)布鲁诺马代尔纳嘉雷迪米音乐学院学历认证
Taqyea
 
HershAggregator (2).pdf musicretaildistribution
HershAggregator (2).pdf musicretaildistributionHershAggregator (2).pdf musicretaildistribution
HershAggregator (2).pdf musicretaildistribution
hershtara1
 
hersh's midterm project.pdf music retail and distribution
hersh's midterm project.pdf music retail and distributionhersh's midterm project.pdf music retail and distribution
hersh's midterm project.pdf music retail and distribution
hershtara1
 
Analysis of Billboards hot 100 toop five hit makers on the chart.docx
Analysis of Billboards hot 100 toop five hit makers on the chart.docxAnalysis of Billboards hot 100 toop five hit makers on the chart.docx
Analysis of Billboards hot 100 toop five hit makers on the chart.docx
hershtara1
 
4. Multivariable statistics_Using Stata_2025.pdf
4. Multivariable statistics_Using Stata_2025.pdf4. Multivariable statistics_Using Stata_2025.pdf
4. Multivariable statistics_Using Stata_2025.pdf
axonneurologycenter1
 
Adopting Process Mining at the Rabobank - use case
Adopting Process Mining at the Rabobank - use caseAdopting Process Mining at the Rabobank - use case
Adopting Process Mining at the Rabobank - use case
Process mining Evangelist
 
Suncorp - Integrating Process Mining at Australia's Largest Insurer
Suncorp - Integrating Process Mining at Australia's Largest InsurerSuncorp - Integrating Process Mining at Australia's Largest Insurer
Suncorp - Integrating Process Mining at Australia's Largest Insurer
Process mining Evangelist
 
AWS-AIML-PRESENTATION RELATED TO DATA SCIENCE TO DATA
AWS-AIML-PRESENTATION RELATED TO DATA SCIENCE TO DATAAWS-AIML-PRESENTATION RELATED TO DATA SCIENCE TO DATA
AWS-AIML-PRESENTATION RELATED TO DATA SCIENCE TO DATA
SnehaBoja
 
RAG Chatbot using AWS Bedrock and Streamlit Framework
RAG Chatbot using AWS Bedrock and Streamlit FrameworkRAG Chatbot using AWS Bedrock and Streamlit Framework
RAG Chatbot using AWS Bedrock and Streamlit Framework
apanneer
 
Process Mining at Dimension Data - Jan vermeulen
Process Mining at Dimension Data - Jan vermeulenProcess Mining at Dimension Data - Jan vermeulen
Process Mining at Dimension Data - Jan vermeulen
Process mining Evangelist
 
Ad

Descriptive_statistics - Sample 1.pptx

  • 2. Statistics • Many studies generate large numbers of data points, and to make sense of all that data, researchers use statistics that summarize the data, providing a better understanding of overall tendencies within the distributions of scores.
  • 3. Types of statistics • Types of statistics: 1. descriptive (which summarize some characteristic of a sample) • Measures of central tendency • Measures of dispersion • Measures of skewness 2. inferential (which test for significant differences between groups and/or significant relationships among variables within the sample • t-ratio, chi-square, beta-value
  • 4. Reasons for using statistics • aids in summarizing the results • helps us recognize underlying trends and tendencies in the data • aids in communicating the results to others
  • 6. Descriptive statistics • If we wanted to characterize the students in this class we would find that they are: – Young – From Kentucky – Fit – Male • How young? • How Kentuckian is this class? • How fit is this class? • What is the distribution of males and females?
  • 7. Frequency distribution • The frequency with which observations are assigned to each category or point on a measurement scale. – Most basic form of descriptive statistics – May be expressed as a percentage of the total sample found in each category Source : Reasoning with Statistics, by Frederick Williams & Peter Monge, fifth edition, Harcourt College Publishers.
  • 8. Frequency distribution • The distribution is “read” differently depending upon the measurement level – Nominal scales are read as discrete measurements at each level – Ordinal measures show tendencies, but categories should not be compared – Interval and ratio scales allow for comparison among categories
  • 9. Ancestry of US residents
  • 10. Source: Protecting Children from Harmful Television: TV Ratings and the V-chip Amy I. Nathanson, PhD Lecturer, University of California at Santa Barbara Joanne Cantor, PhD Professor, Communication Arts, University of Wisconsin-Madison
  • 17. Normal distribution • Many characteristics are distributed through the population in a ‘normal’ manner – Normal curves have well-defined statistical properties – Parametric statistics are based on the assumption that the variables are distributed normally • Most commonly used statistics • This is the famous “Bell curve” where many cases fall near the middle of the distribution and few fall very high or very low – I.Q.
  • 19. Measures of central tendency • These measures give us an idea what the ‘typical’ case in a distribution is like • Mode (Mo): the most frequent score in a distribution • good for nominal data • Median (Mdn): the midpoint or midscore in a distribution. • (50% cases above/50% cases below) – insensitive to extreme cases --Interval or ratio Source : Reasoning with Statistics, by Frederick Williams & Peter Monge, fifth edition, Harcourt College Publishers.
  • 20. Measures of central tendency • Mean – The ‘average’ score—sum of all individual scores divided by the number of scores – has a number of useful statistical properties • however, can be sensitive to extreme scores (“outliers”) – many statistics are based on the mean
  • 22. Statistics estimating dispersion • Some statistics look at how widely scattered over the scale the individual scores are • Groups with identical means can be more or less widely dispersed • To find out how the group is distributed, we need to know how far from or close to the mean individual scores are • Like the mean, these statistics are only meaningful for interval or ratio-level measures
  • 23. Estimates of dispersion • Range • Distance between the highest and lowest scores in a distribution; • sensitive to extreme scores; • Can compensate by calculating interquartile range (distance between the 25th and 75th percentile points) which represents the range of scores for the middle half of a distribution Usually used in combination with other measures of dispersion.
  • 26. Variance (S2) – Average of squared distances of individual points from the mean • sample variance – High variance means that most scores are far away from the mean. Low variance indicates that most scores cluster tightly about the mean. – The amount that one score differs from the mean is called its deviation score (deviate) – The sum of all deviation scores in a sample is called the sum of squares Estimates of dispersion
  • 27. A summary statistic of how much scores vary from the mean Square root of the Variance – expressed in the original units of measurement – Represents the average amount of dispersion in a sample – Used in a number of inferential statistics Standard Deviation (SD)
  • 28. Skewness of distributions • Measures look at how lopsided distributions are—how far from the ideal of the normal curve they are • When the median and the mean are different, the distribution is skewed. The greater the difference, the greater the skew. • Distributions that trail away to the left are negatively skewed and those that trail away to the right are positively skewed • If the skewness is extreme, the researcher should either transform the data to make them better resemble a normal curve or else use a different set of statistics— nonparametric statistics—to carry out the analysis
  • 31. Distribution of posting frequency on Usenet
  • 32. So • Descriptive statistics are used to summarize data from individual respondents, etc. – They help to make sense of large numbers of individual responses, to communicate the essence of those responses to others • They focus on typical or average scores, the dispersion of scores over the available responses, and the shape of the response curve