SlideShare a Scribd company logo
Diversity in Datasets: (d) e constructing Descriptive Statistics and Data Visualization  Douglas James Joubert National Institutes of Health Library
Outline Types of Scale Levels of Measurement Descriptive vs. Inferential Statistics Univariate Analysis Graphical Methods for Displaying Data
Before you Survey Consult with a Statistician Vital to your success Great way to collaborate
Analysis Always Follows Design Johnson (2005) Question Hypothesis Experimental Design Samples Data Analysis
Descriptive Statistics Location Spread (Dispersion) Shape of the Distribution Mean Mode Median SD Variance COV Skewness (+ or -) Kurtosis
Levels of Measurement The questions you ask are just as important as what is being measured Consult, confer, and pick apart your hypothesis Results are only as good as your poorest measurement Your measurement will never provide the absolute truth Try to control as much as possible to reduce error Random error – due to chance – either direction Systematic error – due to bias – one direction
Reducing Measurement Error Triangulate Different measures for same construct X2 X1
Types of Scale Nominal or Categorical Mutually exclusive group: gender, sick vs. healthy, remote user vs. library user Used for identification purposes only Cannot be ranked from smallest to largest Ordinal Mutually exclusive group that is also ordered in a meaningful manner Distance between categories is unknown—you cannot say that a person with a job satisfaction of 2 is twice as satisfied as a person rated as a 1
Types of Scale Interval Ordered groups with equal intervals between any two pairs of adjacent classes No absolute zero and you cannot compute ratios, for example, temperature Ratio Interval scale with a true absolute zero, for example, weight You can tell how much larger or smaller one value is compared with another
Hierarchy of Measurement Ratio Interval Ordinal Nominal Trochim (2001) Absolute Zero Distance is meaningful Characteristics can be ordered Classification is arbitrary
Descriptive vs. Inferential Statistics Descriptive (Summary) statistics describe or characterize data in such a way that none of the original information is  lost or distorted 1 Inferential statistics allow one to draw conclusions about a population based on data obtained from a sample Munro (2002) S1 S2 S3 S4 S5 S6 ? ? ? ? ? ? Sample Population
Univariate Descriptive Analysis Allows one to examine each variable separately to check for data inconsistencies, variability of variables Also allows one to check statistical assumptions about the shape of the distribution before moving on to more complex analysis Univariate descriptive statistics can also be used to determine central tendency, variability, skewness, and kurtosis
Graphical Methods for Displaying Data Frequency Distributions Histograms Plots Pareto Charts Boxplots Error Bar Charts
Frequency distributions Frequency distributions are a nice tool for categorizing data into meaningful groups Organizing data in tabular form using classes or frequencies Two main types: Categorical: qualitative data such as gender, treatment group or not, religious affiliation Ungrouped or grouped quantitative data
Categorical Frequency distributions A O B A AB AB A A B B O O O A B AB 16 Total 3 AB 4 O 4 B 5 A Frequency  f Class
Ungrouped Frequency distributions 161 155 103 103 Birth weight data in (oz) 101 100 98 98 89 94 94 93 91 88 88 67 64 64 58 32
Ungrouped Frequency distributions … 1 93 1 91 2 88 1 67 2 64 1 58 1 32 Count (Frequency  f) Birth weight
Grouped Frequency Distribution Grouped frequency distribution is obtained by constructing classes (intervals) for the data If the difference between minimum and maximum values exceed 15 then you need to divide the data into classes Should have a minimum of 5 classes and a maximum of 20 Histogram is a graphical representation of a frequency distribution
Grouped Frequency Distribution Typically grouped frequency distributions will contain: The frequency of the value within each category Relative frequency: The percentage of values within each category based on the total number of cases Valid percent is the percentage of cases in each category based on non-missing scores Cumulative frequency: sum of the frequencies for all values at or below the given value Cumulative relative frequency: sum of the relative frequencies for all values at or below the given value
Grouped Frequency Distribution of CA patients  *=(E2/ $E$ 8)*100, in Excel to force absolute reference 1.00 .1463 .1498 .2439 .2055 .2473 0.0696 rf* 287 245 202 132 73 2 cf 287 Total .9997 42 More .8534 43 40 – 50 .7036 70 30 – 40 .4597 59 20 – 30 .2542 71 10 – 20 .0696 2 0 – 10 crf Frequency Age
Table Tips Use tables to highlight major facts Keep it simple – tables are usually intended to demystify your data, not make it more difficult to understand If you are using a software program to create class intervals make sure the default works with you data Think of your audience – how can I convey my message without losing important data
Table Tips The clustering that best describes the data should be the ultimate guide Too few or too many class intervals will obscure important information about your data Tables used to analyzed data are rarely published
Charts Effective way to give the reader a snapshot of the differences and patterns in a set of data Primary disadvantage to charts is that you lose the details Things to consider when constructing charts Does my data represent a single moment in time (cross sectional) or does my data occur over time (time series) Do I have a qualitative or quantitative variables? If my variable is quantitative, is the variable discrete or continuous? Munro (2002)
Bar Charts For nominal or ordinal data use simple bar charts Simple bar charts you will have spaces between categories Cluster bar charts can be used to represent univariate distributions Cluster bar charts can also be stacked
Simple Bar Chart Nominal data
Stacked Bar Chart You are really just stacking two or more columns into a single new column Compares the percentage that each group contributes to the total across categories Want to have 100% stacked columns so you can compare the percentages in each group
Stacked Bar Chart
Histograms Best for interval and ratio data Represent percentages rather than counts Each histogram has total area of 100% Since this is a range of values no gaps between bars From a descriptive standpoint allows one to look at the distribution of variables Consider grouping the data if range > 15 Height of the vertical axis is important
Histogram of Family Terms
Histogram Std Err Bars   Normal Dist Fit
Histogram: SEM and Normal Distributions Standard error of the mean is the estimate of how much we would expect the mean to vary in a population, given repeated samples Fit distribution (Normal) estimates the parameters of the normal distribution based on the analysis sample
Pareto Charts Pareto chart is a special type of histogram that is arranged from largest to smallest Allows one to determine which values are least important and which values are more important Pareto charts combines a bar chart displaying percentages of categories in the data with a line plot showing cumulative percentages of the categories
Pareto Chart SAS (1990)
2-Way Comparative Pareto Chart  SAS (1990)
Overlay Chart Similar to a scatterplot but…your are only looking at one variable SAS (1989–2004)
Plots Scatterplots look at the relationship between two or more variables Great way to identify outliers Typically the Y-axis is the DV and X-axis the IV Using a control variable allows one to identify different groups For example, the relationship between bp and weight, and controlling for smoking vs. non-smoking
Plots Scatterplots look at the relationship between two or more variables Great way to identify outliers Typically the Y-axis is the DV and X-axis the IV Using a  control variable  allows one to identify different groups For example, the relationship between bp and weight, and controlling for smoking vs. non-smoking Why? Because we are controlling for some factor
Simple Scatterplot SAS (1989–2004)
Simple Scatterplot In correlation, this is the least-square line (scary math, but very important) SAS (1989–2004)
Box-and-Whisker Plots A graphical method based on percentiles Useful for visualizing the distribution of a variable Simultaneously displays the median, the IQR, and the smallest and largest values for a group More compact than a histogram but less revealing Good tool for identifying outliers and extreme values Two common types: Outlier Box Plot and a Quantile Box Plot
Outlier Box Plot Possible Outliers IQR Largest value not an outlier  Smallest value not an outlier  75th 25th 50 th  (median)
Quantile Box Plot
Contact Information Douglas J. Joubert, MLIS Biomedical Informationist National Institutes of Health Library Bldg. 10, Room 1L09A Bethesda, MD 20906-1150 Phone: 301.594.6282 Fax: 301.402.0254 E-mail: joubertd@ors.od.nih.gov E-mail: joubertd@helix.nih.gov http://nihlibrary.nih.gov/
References Johnson, Laura Lee Ph.D (2004). Principles and Practices of Clinical Research (Lecture), NIH. SAS (1990). Common causes of failure during the fabrication of integrated circuits. Data from "Selected SAS/QC Software Examples, Release 6.06, SAS Users Group International Conference, April 2, 1990 pg 383. Munro, B. H. (2001). Statistical methods for health care research (4th ed.). Philadelphia: Lippincott Williams & Wilkins. SAS Institute Inc. (1989-2004). SAS Help Files. Cary: North Carolina.
Ad

More Related Content

What's hot (20)

Data Analysis & Visualization using MS. Excel
Data Analysis & Visualization using MS. ExcelData Analysis & Visualization using MS. Excel
Data Analysis & Visualization using MS. Excel
Frehiwot Mulugeta
 
Normality tests
Normality testsNormality tests
Normality tests
Dr Lipilekha Patnaik
 
"A basic guide to SPSS"
"A basic guide to SPSS""A basic guide to SPSS"
"A basic guide to SPSS"
Bashir7576
 
Introduction to Descriptive Statistics
Introduction to Descriptive StatisticsIntroduction to Descriptive Statistics
Introduction to Descriptive Statistics
Sanju Rusara Seneviratne
 
Data management in Stata
Data management in StataData management in Stata
Data management in Stata
izahn
 
Introduction to statistics
Introduction to statisticsIntroduction to statistics
Introduction to statistics
Kapil Dev Ghante
 
Basic Statistics & Data Analysis
Basic Statistics & Data AnalysisBasic Statistics & Data Analysis
Basic Statistics & Data Analysis
Ajendra Sharma
 
Sampling and statistical inference
Sampling and statistical inferenceSampling and statistical inference
Sampling and statistical inference
Bhavik A Shah
 
Introduction To Statistics
Introduction To StatisticsIntroduction To Statistics
Introduction To Statistics
albertlaporte
 
Inferential statistics
Inferential statisticsInferential statistics
Inferential statistics
Dalia El-Shafei
 
Data presentation 2
Data presentation 2Data presentation 2
Data presentation 2
Rawalpindi Medical College
 
(Manual spss)
(Manual spss)(Manual spss)
(Manual spss)
Enas Ahmed
 
Full Lecture Presentation on ANOVA
Full Lecture Presentation on ANOVAFull Lecture Presentation on ANOVA
Full Lecture Presentation on ANOVA
StevegellKololi
 
Introduction to Statistics (Part -I)
Introduction to Statistics (Part -I)Introduction to Statistics (Part -I)
Introduction to Statistics (Part -I)
YesAnalytics
 
Statistical inference concept, procedure of hypothesis testing
Statistical inference   concept, procedure of hypothesis testingStatistical inference   concept, procedure of hypothesis testing
Statistical inference concept, procedure of hypothesis testing
AmitaChaudhary19
 
Data Analysis and Statistics
Data Analysis and StatisticsData Analysis and Statistics
Data Analysis and Statistics
T.S. Lim
 
General Statistics boa
General Statistics boaGeneral Statistics boa
General Statistics boa
raileeanne
 
Correlation Coefficient
Correlation CoefficientCorrelation Coefficient
Correlation Coefficient
SaadSaif6
 
Statistics:Fundamentals Of Statistics
Statistics:Fundamentals Of StatisticsStatistics:Fundamentals Of Statistics
Statistics:Fundamentals Of Statistics
St Mary's College,Thrissur,Kerala
 
Introduction to Rstudio
Introduction to RstudioIntroduction to Rstudio
Introduction to Rstudio
Olga Scrivner
 
Data Analysis & Visualization using MS. Excel
Data Analysis & Visualization using MS. ExcelData Analysis & Visualization using MS. Excel
Data Analysis & Visualization using MS. Excel
Frehiwot Mulugeta
 
"A basic guide to SPSS"
"A basic guide to SPSS""A basic guide to SPSS"
"A basic guide to SPSS"
Bashir7576
 
Data management in Stata
Data management in StataData management in Stata
Data management in Stata
izahn
 
Introduction to statistics
Introduction to statisticsIntroduction to statistics
Introduction to statistics
Kapil Dev Ghante
 
Basic Statistics & Data Analysis
Basic Statistics & Data AnalysisBasic Statistics & Data Analysis
Basic Statistics & Data Analysis
Ajendra Sharma
 
Sampling and statistical inference
Sampling and statistical inferenceSampling and statistical inference
Sampling and statistical inference
Bhavik A Shah
 
Introduction To Statistics
Introduction To StatisticsIntroduction To Statistics
Introduction To Statistics
albertlaporte
 
Full Lecture Presentation on ANOVA
Full Lecture Presentation on ANOVAFull Lecture Presentation on ANOVA
Full Lecture Presentation on ANOVA
StevegellKololi
 
Introduction to Statistics (Part -I)
Introduction to Statistics (Part -I)Introduction to Statistics (Part -I)
Introduction to Statistics (Part -I)
YesAnalytics
 
Statistical inference concept, procedure of hypothesis testing
Statistical inference   concept, procedure of hypothesis testingStatistical inference   concept, procedure of hypothesis testing
Statistical inference concept, procedure of hypothesis testing
AmitaChaudhary19
 
Data Analysis and Statistics
Data Analysis and StatisticsData Analysis and Statistics
Data Analysis and Statistics
T.S. Lim
 
General Statistics boa
General Statistics boaGeneral Statistics boa
General Statistics boa
raileeanne
 
Correlation Coefficient
Correlation CoefficientCorrelation Coefficient
Correlation Coefficient
SaadSaif6
 
Introduction to Rstudio
Introduction to RstudioIntroduction to Rstudio
Introduction to Rstudio
Olga Scrivner
 

Viewers also liked (20)

Asking the Right Questions of Your Data
Asking the Right Questions of Your DataAsking the Right Questions of Your Data
Asking the Right Questions of Your Data
DataWorks Summit
 
Triangulasi
TriangulasiTriangulasi
Triangulasi
Norhanimah Mahadi
 
Bab iii tamsir
Bab iii tamsirBab iii tamsir
Bab iii tamsir
tanux5792
 
Chapter 2 250110 083240
Chapter 2 250110 083240Chapter 2 250110 083240
Chapter 2 250110 083240
guest25d353
 
Analisis diskriptif
Analisis diskriptifAnalisis diskriptif
Analisis diskriptif
materi-x2
 
Medical Statistics Part-I:Descriptive statistics
Medical Statistics Part-I:Descriptive statisticsMedical Statistics Part-I:Descriptive statistics
Medical Statistics Part-I:Descriptive statistics
https://aiimsbhubaneswar.nic.in/
 
Peta buih
Peta buihPeta buih
Peta buih
Norhanimah Mahadi
 
penelitan deskriptif
penelitan deskriptifpenelitan deskriptif
penelitan deskriptif
Umi Muc
 
Kesahan & kebolehpercayaan pembentukan instrumen, kesahan dan kebolehpercayaa...
Kesahan & kebolehpercayaan pembentukan instrumen, kesahan dan kebolehpercayaa...Kesahan & kebolehpercayaan pembentukan instrumen, kesahan dan kebolehpercayaa...
Kesahan & kebolehpercayaan pembentukan instrumen, kesahan dan kebolehpercayaa...
Muhamad Farhan
 
Apa itu kajian kualitatif
Apa itu kajian kualitatifApa itu kajian kualitatif
Apa itu kajian kualitatif
Rashidah Awang
 
Visualization-Driven Data Aggregation
Visualization-Driven Data AggregationVisualization-Driven Data Aggregation
Visualization-Driven Data Aggregation
Zbigniew Jerzak
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statistics
Aileen Balbido
 
Kajian kuantitatif
Kajian kuantitatifKajian kuantitatif
Kajian kuantitatif
Zen Shah
 
Basic Descriptive Statistics
Basic Descriptive StatisticsBasic Descriptive Statistics
Basic Descriptive Statistics
sikojp
 
Measurement of variables IN RESEARCH
Measurement of variables IN RESEARCHMeasurement of variables IN RESEARCH
Measurement of variables IN RESEARCH
Vinold John
 
Kajian tindakan kaedah pengumpulan data kajian tindakan
Kajian tindakan  kaedah pengumpulan data kajian tindakanKajian tindakan  kaedah pengumpulan data kajian tindakan
Kajian tindakan kaedah pengumpulan data kajian tindakan
nym_namrod
 
Nota Penyelidikan Kajian
Nota Penyelidikan KajianNota Penyelidikan Kajian
Nota Penyelidikan Kajian
Ribut Taufan
 
Research Variables types and identification
Research Variables types and identificationResearch Variables types and identification
Research Variables types and identification
aneez103
 
Descriptive Method
Descriptive MethodDescriptive Method
Descriptive Method
Jerome Angelitud Porto
 
Asking the Right Questions of Your Data
Asking the Right Questions of Your DataAsking the Right Questions of Your Data
Asking the Right Questions of Your Data
DataWorks Summit
 
Bab iii tamsir
Bab iii tamsirBab iii tamsir
Bab iii tamsir
tanux5792
 
Chapter 2 250110 083240
Chapter 2 250110 083240Chapter 2 250110 083240
Chapter 2 250110 083240
guest25d353
 
Analisis diskriptif
Analisis diskriptifAnalisis diskriptif
Analisis diskriptif
materi-x2
 
penelitan deskriptif
penelitan deskriptifpenelitan deskriptif
penelitan deskriptif
Umi Muc
 
Kesahan & kebolehpercayaan pembentukan instrumen, kesahan dan kebolehpercayaa...
Kesahan & kebolehpercayaan pembentukan instrumen, kesahan dan kebolehpercayaa...Kesahan & kebolehpercayaan pembentukan instrumen, kesahan dan kebolehpercayaa...
Kesahan & kebolehpercayaan pembentukan instrumen, kesahan dan kebolehpercayaa...
Muhamad Farhan
 
Apa itu kajian kualitatif
Apa itu kajian kualitatifApa itu kajian kualitatif
Apa itu kajian kualitatif
Rashidah Awang
 
Visualization-Driven Data Aggregation
Visualization-Driven Data AggregationVisualization-Driven Data Aggregation
Visualization-Driven Data Aggregation
Zbigniew Jerzak
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statistics
Aileen Balbido
 
Kajian kuantitatif
Kajian kuantitatifKajian kuantitatif
Kajian kuantitatif
Zen Shah
 
Basic Descriptive Statistics
Basic Descriptive StatisticsBasic Descriptive Statistics
Basic Descriptive Statistics
sikojp
 
Measurement of variables IN RESEARCH
Measurement of variables IN RESEARCHMeasurement of variables IN RESEARCH
Measurement of variables IN RESEARCH
Vinold John
 
Kajian tindakan kaedah pengumpulan data kajian tindakan
Kajian tindakan  kaedah pengumpulan data kajian tindakanKajian tindakan  kaedah pengumpulan data kajian tindakan
Kajian tindakan kaedah pengumpulan data kajian tindakan
nym_namrod
 
Nota Penyelidikan Kajian
Nota Penyelidikan KajianNota Penyelidikan Kajian
Nota Penyelidikan Kajian
Ribut Taufan
 
Research Variables types and identification
Research Variables types and identificationResearch Variables types and identification
Research Variables types and identification
aneez103
 
Ad

Similar to Descriptive Statistics and Data Visualization (20)

Biostatistics basics-biostatistics4734
Biostatistics basics-biostatistics4734Biostatistics basics-biostatistics4734
Biostatistics basics-biostatistics4734
AbhishekDas15
 
Biostatistics basics-biostatistics4734
Biostatistics basics-biostatistics4734Biostatistics basics-biostatistics4734
Biostatistics basics-biostatistics4734
AbhishekDas15
 
Data presenatation
Data presenatationData presenatation
Data presenatation
singhdharmendra
 
Wynberg girls high-Jade Gibson-maths-data analysis statistics
Wynberg girls high-Jade Gibson-maths-data analysis statisticsWynberg girls high-Jade Gibson-maths-data analysis statistics
Wynberg girls high-Jade Gibson-maths-data analysis statistics
Wynberg Girls High
 
Exploratory Data Analysis for Biotechnology and Pharmaceutical Sciences
Exploratory Data Analysis for Biotechnology and Pharmaceutical SciencesExploratory Data Analysis for Biotechnology and Pharmaceutical Sciences
Exploratory Data Analysis for Biotechnology and Pharmaceutical Sciences
Parag Shah
 
Statistics review
Statistics reviewStatistics review
Statistics review
jpcagphil
 
Class1.ppt
Class1.pptClass1.ppt
Class1.ppt
Gautam G
 
Class1.ppt
Class1.pptClass1.ppt
Class1.ppt
PerumalPitchandi
 
Class1.ppt
Class1.pptClass1.ppt
Class1.ppt
Sandeepkumar628916
 
Introduction to Statistics - Basics of Data - Class 1
Introduction to Statistics - Basics of Data - Class 1Introduction to Statistics - Basics of Data - Class 1
Introduction to Statistics - Basics of Data - Class 1
RajnishSingh367990
 
Class1.ppt
Class1.pptClass1.ppt
Class1.ppt
hanreaz219
 
STATISTICS BASICS INCLUDING DESCRIPTIVE STATISTICS
STATISTICS BASICS INCLUDING DESCRIPTIVE STATISTICSSTATISTICS BASICS INCLUDING DESCRIPTIVE STATISTICS
STATISTICS BASICS INCLUDING DESCRIPTIVE STATISTICS
nagamani651296
 
Introduction to statistics covering the basics
Introduction to statistics covering the basicsIntroduction to statistics covering the basics
Introduction to statistics covering the basics
OptiAgileBusinessSer
 
Statistics and Public Health. Curso de Inglés Técnico para profesionales de S...
Statistics and Public Health. Curso de Inglés Técnico para profesionales de S...Statistics and Public Health. Curso de Inglés Técnico para profesionales de S...
Statistics and Public Health. Curso de Inglés Técnico para profesionales de S...
Universidad Particular de Loja
 
EDUCATIONAL STATISTICS_Unit_I.ppt
EDUCATIONAL STATISTICS_Unit_I.pptEDUCATIONAL STATISTICS_Unit_I.ppt
EDUCATIONAL STATISTICS_Unit_I.ppt
Sasi Kumar
 
DescribingandPresentingData.ppt
DescribingandPresentingData.pptDescribingandPresentingData.ppt
DescribingandPresentingData.ppt
UpasanaSagarPrajapat
 
Describing data collected and Presenting Data.ppt
Describing data collected and Presenting Data.pptDescribing data collected and Presenting Data.ppt
Describing data collected and Presenting Data.ppt
Ameha3
 
Class1.ppt Class StructureBasics of Statistics
Class1.ppt Class StructureBasics of StatisticsClass1.ppt Class StructureBasics of Statistics
Class1.ppt Class StructureBasics of Statistics
deepanoel
 
Stat11t chapter2
Stat11t chapter2Stat11t chapter2
Stat11t chapter2
raylenepotter
 
Statistics
StatisticsStatistics
Statistics
Deepanshu Sharma
 
Biostatistics basics-biostatistics4734
Biostatistics basics-biostatistics4734Biostatistics basics-biostatistics4734
Biostatistics basics-biostatistics4734
AbhishekDas15
 
Biostatistics basics-biostatistics4734
Biostatistics basics-biostatistics4734Biostatistics basics-biostatistics4734
Biostatistics basics-biostatistics4734
AbhishekDas15
 
Wynberg girls high-Jade Gibson-maths-data analysis statistics
Wynberg girls high-Jade Gibson-maths-data analysis statisticsWynberg girls high-Jade Gibson-maths-data analysis statistics
Wynberg girls high-Jade Gibson-maths-data analysis statistics
Wynberg Girls High
 
Exploratory Data Analysis for Biotechnology and Pharmaceutical Sciences
Exploratory Data Analysis for Biotechnology and Pharmaceutical SciencesExploratory Data Analysis for Biotechnology and Pharmaceutical Sciences
Exploratory Data Analysis for Biotechnology and Pharmaceutical Sciences
Parag Shah
 
Statistics review
Statistics reviewStatistics review
Statistics review
jpcagphil
 
Class1.ppt
Class1.pptClass1.ppt
Class1.ppt
Gautam G
 
Introduction to Statistics - Basics of Data - Class 1
Introduction to Statistics - Basics of Data - Class 1Introduction to Statistics - Basics of Data - Class 1
Introduction to Statistics - Basics of Data - Class 1
RajnishSingh367990
 
STATISTICS BASICS INCLUDING DESCRIPTIVE STATISTICS
STATISTICS BASICS INCLUDING DESCRIPTIVE STATISTICSSTATISTICS BASICS INCLUDING DESCRIPTIVE STATISTICS
STATISTICS BASICS INCLUDING DESCRIPTIVE STATISTICS
nagamani651296
 
Introduction to statistics covering the basics
Introduction to statistics covering the basicsIntroduction to statistics covering the basics
Introduction to statistics covering the basics
OptiAgileBusinessSer
 
Statistics and Public Health. Curso de Inglés Técnico para profesionales de S...
Statistics and Public Health. Curso de Inglés Técnico para profesionales de S...Statistics and Public Health. Curso de Inglés Técnico para profesionales de S...
Statistics and Public Health. Curso de Inglés Técnico para profesionales de S...
Universidad Particular de Loja
 
EDUCATIONAL STATISTICS_Unit_I.ppt
EDUCATIONAL STATISTICS_Unit_I.pptEDUCATIONAL STATISTICS_Unit_I.ppt
EDUCATIONAL STATISTICS_Unit_I.ppt
Sasi Kumar
 
Describing data collected and Presenting Data.ppt
Describing data collected and Presenting Data.pptDescribing data collected and Presenting Data.ppt
Describing data collected and Presenting Data.ppt
Ameha3
 
Class1.ppt Class StructureBasics of Statistics
Class1.ppt Class StructureBasics of StatisticsClass1.ppt Class StructureBasics of Statistics
Class1.ppt Class StructureBasics of Statistics
deepanoel
 
Ad

More from Douglas Joubert (20)

Developing a library-based data visualization service
Developing a library-based data visualization serviceDeveloping a library-based data visualization service
Developing a library-based data visualization service
Douglas Joubert
 
Developing and Implementing a Technology Hub
Developing and Implementing a Technology HubDeveloping and Implementing a Technology Hub
Developing and Implementing a Technology Hub
Douglas Joubert
 
Developing and implementing a technology hub
Developing and implementing a technology hubDeveloping and implementing a technology hub
Developing and implementing a technology hub
Douglas Joubert
 
Developing a library based data visualization service
Developing a library based data visualization serviceDeveloping a library based data visualization service
Developing a library based data visualization service
Douglas Joubert
 
Analytical Methods for Systematic Review Support
Analytical Methods for Systematic Review SupportAnalytical Methods for Systematic Review Support
Analytical Methods for Systematic Review Support
Douglas Joubert
 
Using Social Technologies for Public Health, 2014
Using Social Technologies for Public Health, 2014Using Social Technologies for Public Health, 2014
Using Social Technologies for Public Health, 2014
Douglas Joubert
 
Keeping up with Public Health Series: A Pilot Project for Public Health Resea...
Keeping up with Public Health Series: A Pilot Project for Public Health Resea...Keeping up with Public Health Series: A Pilot Project for Public Health Resea...
Keeping up with Public Health Series: A Pilot Project for Public Health Resea...
Douglas Joubert
 
2013 Johns Hopkins School of Public Health Lecture
2013 Johns Hopkins School of Public Health Lecture2013 Johns Hopkins School of Public Health Lecture
2013 Johns Hopkins School of Public Health Lecture
Douglas Joubert
 
Developing Staff Competencies in Emerging Technologies
Developing Staff Competencies in Emerging TechnologiesDeveloping Staff Competencies in Emerging Technologies
Developing Staff Competencies in Emerging Technologies
Douglas Joubert
 
Developing Staff Competencies in Emerging Technologies
Developing Staff Competencies in Emerging TechnologiesDeveloping Staff Competencies in Emerging Technologies
Developing Staff Competencies in Emerging Technologies
Douglas Joubert
 
Social Media Brown-bag
Social Media Brown-bagSocial Media Brown-bag
Social Media Brown-bag
Douglas Joubert
 
Developing a program to use iPads in your library
Developing a program to use iPads in your libraryDeveloping a program to use iPads in your library
Developing a program to use iPads in your library
Douglas Joubert
 
Brave New World: Developing Staff Competencies Around Mobile
Brave New World: Developing Staff Competencies Around MobileBrave New World: Developing Staff Competencies Around Mobile
Brave New World: Developing Staff Competencies Around Mobile
Douglas Joubert
 
Using Social Technologies for Public Health
Using Social Technologies for Public HealthUsing Social Technologies for Public Health
Using Social Technologies for Public Health
Douglas Joubert
 
Research in the Library: An Evidence-based Approach for Making Informed Decis...
Research in the Library: An Evidence-based Approach for Making Informed Decis...Research in the Library: An Evidence-based Approach for Making Informed Decis...
Research in the Library: An Evidence-based Approach for Making Informed Decis...
Douglas Joubert
 
2010 NIH Handheld Users Meeting
2010 NIH Handheld Users Meeting2010 NIH Handheld Users Meeting
2010 NIH Handheld Users Meeting
Douglas Joubert
 
Characterization of genes and proteins of cross-species biological pathways
Characterization of genes and proteins of cross-species biological pathwaysCharacterization of genes and proteins of cross-species biological pathways
Characterization of genes and proteins of cross-species biological pathways
Douglas Joubert
 
Clicking Past Google
Clicking Past GoogleClicking Past Google
Clicking Past Google
Douglas Joubert
 
2006 Catholic University Presentation
2006 Catholic University Presentation2006 Catholic University Presentation
2006 Catholic University Presentation
Douglas Joubert
 
Explorations in bioinformatics
Explorations in bioinformaticsExplorations in bioinformatics
Explorations in bioinformatics
Douglas Joubert
 
Developing a library-based data visualization service
Developing a library-based data visualization serviceDeveloping a library-based data visualization service
Developing a library-based data visualization service
Douglas Joubert
 
Developing and Implementing a Technology Hub
Developing and Implementing a Technology HubDeveloping and Implementing a Technology Hub
Developing and Implementing a Technology Hub
Douglas Joubert
 
Developing and implementing a technology hub
Developing and implementing a technology hubDeveloping and implementing a technology hub
Developing and implementing a technology hub
Douglas Joubert
 
Developing a library based data visualization service
Developing a library based data visualization serviceDeveloping a library based data visualization service
Developing a library based data visualization service
Douglas Joubert
 
Analytical Methods for Systematic Review Support
Analytical Methods for Systematic Review SupportAnalytical Methods for Systematic Review Support
Analytical Methods for Systematic Review Support
Douglas Joubert
 
Using Social Technologies for Public Health, 2014
Using Social Technologies for Public Health, 2014Using Social Technologies for Public Health, 2014
Using Social Technologies for Public Health, 2014
Douglas Joubert
 
Keeping up with Public Health Series: A Pilot Project for Public Health Resea...
Keeping up with Public Health Series: A Pilot Project for Public Health Resea...Keeping up with Public Health Series: A Pilot Project for Public Health Resea...
Keeping up with Public Health Series: A Pilot Project for Public Health Resea...
Douglas Joubert
 
2013 Johns Hopkins School of Public Health Lecture
2013 Johns Hopkins School of Public Health Lecture2013 Johns Hopkins School of Public Health Lecture
2013 Johns Hopkins School of Public Health Lecture
Douglas Joubert
 
Developing Staff Competencies in Emerging Technologies
Developing Staff Competencies in Emerging TechnologiesDeveloping Staff Competencies in Emerging Technologies
Developing Staff Competencies in Emerging Technologies
Douglas Joubert
 
Developing Staff Competencies in Emerging Technologies
Developing Staff Competencies in Emerging TechnologiesDeveloping Staff Competencies in Emerging Technologies
Developing Staff Competencies in Emerging Technologies
Douglas Joubert
 
Developing a program to use iPads in your library
Developing a program to use iPads in your libraryDeveloping a program to use iPads in your library
Developing a program to use iPads in your library
Douglas Joubert
 
Brave New World: Developing Staff Competencies Around Mobile
Brave New World: Developing Staff Competencies Around MobileBrave New World: Developing Staff Competencies Around Mobile
Brave New World: Developing Staff Competencies Around Mobile
Douglas Joubert
 
Using Social Technologies for Public Health
Using Social Technologies for Public HealthUsing Social Technologies for Public Health
Using Social Technologies for Public Health
Douglas Joubert
 
Research in the Library: An Evidence-based Approach for Making Informed Decis...
Research in the Library: An Evidence-based Approach for Making Informed Decis...Research in the Library: An Evidence-based Approach for Making Informed Decis...
Research in the Library: An Evidence-based Approach for Making Informed Decis...
Douglas Joubert
 
2010 NIH Handheld Users Meeting
2010 NIH Handheld Users Meeting2010 NIH Handheld Users Meeting
2010 NIH Handheld Users Meeting
Douglas Joubert
 
Characterization of genes and proteins of cross-species biological pathways
Characterization of genes and proteins of cross-species biological pathwaysCharacterization of genes and proteins of cross-species biological pathways
Characterization of genes and proteins of cross-species biological pathways
Douglas Joubert
 
2006 Catholic University Presentation
2006 Catholic University Presentation2006 Catholic University Presentation
2006 Catholic University Presentation
Douglas Joubert
 
Explorations in bioinformatics
Explorations in bioinformaticsExplorations in bioinformatics
Explorations in bioinformatics
Douglas Joubert
 

Recently uploaded (20)

Lecture 4 INSECT CUTICLE and moulting.pptx
Lecture 4 INSECT CUTICLE and moulting.pptxLecture 4 INSECT CUTICLE and moulting.pptx
Lecture 4 INSECT CUTICLE and moulting.pptx
Arshad Shaikh
 
Computer crime and Legal issues Computer crime and Legal issues
Computer crime and Legal issues Computer crime and Legal issuesComputer crime and Legal issues Computer crime and Legal issues
Computer crime and Legal issues Computer crime and Legal issues
Abhijit Bodhe
 
dynastic art of the Pallava dynasty south India
dynastic art of the Pallava dynasty south Indiadynastic art of the Pallava dynasty south India
dynastic art of the Pallava dynasty south India
PrachiSontakke5
 
How to Configure Public Holidays & Mandatory Days in Odoo 18
How to Configure Public Holidays & Mandatory Days in Odoo 18How to Configure Public Holidays & Mandatory Days in Odoo 18
How to Configure Public Holidays & Mandatory Days in Odoo 18
Celine George
 
pulse ppt.pptx Types of pulse , characteristics of pulse , Alteration of pulse
pulse  ppt.pptx Types of pulse , characteristics of pulse , Alteration of pulsepulse  ppt.pptx Types of pulse , characteristics of pulse , Alteration of pulse
pulse ppt.pptx Types of pulse , characteristics of pulse , Alteration of pulse
sushreesangita003
 
YSPH VMOC Special Report - Measles Outbreak Southwest US 5-3-2025.pptx
YSPH VMOC Special Report - Measles Outbreak  Southwest US 5-3-2025.pptxYSPH VMOC Special Report - Measles Outbreak  Southwest US 5-3-2025.pptx
YSPH VMOC Special Report - Measles Outbreak Southwest US 5-3-2025.pptx
Yale School of Public Health - The Virtual Medical Operations Center (VMOC)
 
Myopathies (muscle disorders) for undergraduate
Myopathies (muscle disorders) for undergraduateMyopathies (muscle disorders) for undergraduate
Myopathies (muscle disorders) for undergraduate
Mohamed Rizk Khodair
 
Form View Attributes in Odoo 18 - Odoo Slides
Form View Attributes in Odoo 18 - Odoo SlidesForm View Attributes in Odoo 18 - Odoo Slides
Form View Attributes in Odoo 18 - Odoo Slides
Celine George
 
Rococo versus Neoclassicism. The artistic styles of the 18th century
Rococo versus Neoclassicism. The artistic styles of the 18th centuryRococo versus Neoclassicism. The artistic styles of the 18th century
Rococo versus Neoclassicism. The artistic styles of the 18th century
Gema
 
SCI BIZ TECH QUIZ (OPEN) PRELIMS XTASY 2025.pptx
SCI BIZ TECH QUIZ (OPEN) PRELIMS XTASY 2025.pptxSCI BIZ TECH QUIZ (OPEN) PRELIMS XTASY 2025.pptx
SCI BIZ TECH QUIZ (OPEN) PRELIMS XTASY 2025.pptx
Ronisha Das
 
The History of Kashmir Karkota Dynasty NEP.pptx
The History of Kashmir Karkota Dynasty NEP.pptxThe History of Kashmir Karkota Dynasty NEP.pptx
The History of Kashmir Karkota Dynasty NEP.pptx
Arya Mahila P. G. College, Banaras Hindu University, Varanasi, India.
 
Herbs Used in Cosmetic Formulations .pptx
Herbs Used in Cosmetic Formulations .pptxHerbs Used in Cosmetic Formulations .pptx
Herbs Used in Cosmetic Formulations .pptx
RAJU THENGE
 
Rock Art As a Source of Ancient Indian History
Rock Art As a Source of Ancient Indian HistoryRock Art As a Source of Ancient Indian History
Rock Art As a Source of Ancient Indian History
Virag Sontakke
 
Grade 3 - English - Printable Worksheet (PDF Format)
Grade 3 - English - Printable Worksheet  (PDF Format)Grade 3 - English - Printable Worksheet  (PDF Format)
Grade 3 - English - Printable Worksheet (PDF Format)
Sritoma Majumder
 
03#UNTAGGED. Generosity in architecture.
03#UNTAGGED. Generosity in architecture.03#UNTAGGED. Generosity in architecture.
03#UNTAGGED. Generosity in architecture.
MCH
 
CNS infections (encephalitis, meningitis & Brain abscess
CNS infections (encephalitis, meningitis & Brain abscessCNS infections (encephalitis, meningitis & Brain abscess
CNS infections (encephalitis, meningitis & Brain abscess
Mohamed Rizk Khodair
 
What is the Philosophy of Statistics? (and how I was drawn to it)
What is the Philosophy of Statistics? (and how I was drawn to it)What is the Philosophy of Statistics? (and how I was drawn to it)
What is the Philosophy of Statistics? (and how I was drawn to it)
jemille6
 
Exercise Physiology MCQS By DR. NASIR MUSTAFA
Exercise Physiology MCQS By DR. NASIR MUSTAFAExercise Physiology MCQS By DR. NASIR MUSTAFA
Exercise Physiology MCQS By DR. NASIR MUSTAFA
Dr. Nasir Mustafa
 
Grade 2 - Mathematics - Printable Worksheet
Grade 2 - Mathematics - Printable WorksheetGrade 2 - Mathematics - Printable Worksheet
Grade 2 - Mathematics - Printable Worksheet
Sritoma Majumder
 
Junction Field Effect Transistors (JFET)
Junction Field Effect Transistors (JFET)Junction Field Effect Transistors (JFET)
Junction Field Effect Transistors (JFET)
GS Virdi
 
Lecture 4 INSECT CUTICLE and moulting.pptx
Lecture 4 INSECT CUTICLE and moulting.pptxLecture 4 INSECT CUTICLE and moulting.pptx
Lecture 4 INSECT CUTICLE and moulting.pptx
Arshad Shaikh
 
Computer crime and Legal issues Computer crime and Legal issues
Computer crime and Legal issues Computer crime and Legal issuesComputer crime and Legal issues Computer crime and Legal issues
Computer crime and Legal issues Computer crime and Legal issues
Abhijit Bodhe
 
dynastic art of the Pallava dynasty south India
dynastic art of the Pallava dynasty south Indiadynastic art of the Pallava dynasty south India
dynastic art of the Pallava dynasty south India
PrachiSontakke5
 
How to Configure Public Holidays & Mandatory Days in Odoo 18
How to Configure Public Holidays & Mandatory Days in Odoo 18How to Configure Public Holidays & Mandatory Days in Odoo 18
How to Configure Public Holidays & Mandatory Days in Odoo 18
Celine George
 
pulse ppt.pptx Types of pulse , characteristics of pulse , Alteration of pulse
pulse  ppt.pptx Types of pulse , characteristics of pulse , Alteration of pulsepulse  ppt.pptx Types of pulse , characteristics of pulse , Alteration of pulse
pulse ppt.pptx Types of pulse , characteristics of pulse , Alteration of pulse
sushreesangita003
 
Myopathies (muscle disorders) for undergraduate
Myopathies (muscle disorders) for undergraduateMyopathies (muscle disorders) for undergraduate
Myopathies (muscle disorders) for undergraduate
Mohamed Rizk Khodair
 
Form View Attributes in Odoo 18 - Odoo Slides
Form View Attributes in Odoo 18 - Odoo SlidesForm View Attributes in Odoo 18 - Odoo Slides
Form View Attributes in Odoo 18 - Odoo Slides
Celine George
 
Rococo versus Neoclassicism. The artistic styles of the 18th century
Rococo versus Neoclassicism. The artistic styles of the 18th centuryRococo versus Neoclassicism. The artistic styles of the 18th century
Rococo versus Neoclassicism. The artistic styles of the 18th century
Gema
 
SCI BIZ TECH QUIZ (OPEN) PRELIMS XTASY 2025.pptx
SCI BIZ TECH QUIZ (OPEN) PRELIMS XTASY 2025.pptxSCI BIZ TECH QUIZ (OPEN) PRELIMS XTASY 2025.pptx
SCI BIZ TECH QUIZ (OPEN) PRELIMS XTASY 2025.pptx
Ronisha Das
 
Herbs Used in Cosmetic Formulations .pptx
Herbs Used in Cosmetic Formulations .pptxHerbs Used in Cosmetic Formulations .pptx
Herbs Used in Cosmetic Formulations .pptx
RAJU THENGE
 
Rock Art As a Source of Ancient Indian History
Rock Art As a Source of Ancient Indian HistoryRock Art As a Source of Ancient Indian History
Rock Art As a Source of Ancient Indian History
Virag Sontakke
 
Grade 3 - English - Printable Worksheet (PDF Format)
Grade 3 - English - Printable Worksheet  (PDF Format)Grade 3 - English - Printable Worksheet  (PDF Format)
Grade 3 - English - Printable Worksheet (PDF Format)
Sritoma Majumder
 
03#UNTAGGED. Generosity in architecture.
03#UNTAGGED. Generosity in architecture.03#UNTAGGED. Generosity in architecture.
03#UNTAGGED. Generosity in architecture.
MCH
 
CNS infections (encephalitis, meningitis & Brain abscess
CNS infections (encephalitis, meningitis & Brain abscessCNS infections (encephalitis, meningitis & Brain abscess
CNS infections (encephalitis, meningitis & Brain abscess
Mohamed Rizk Khodair
 
What is the Philosophy of Statistics? (and how I was drawn to it)
What is the Philosophy of Statistics? (and how I was drawn to it)What is the Philosophy of Statistics? (and how I was drawn to it)
What is the Philosophy of Statistics? (and how I was drawn to it)
jemille6
 
Exercise Physiology MCQS By DR. NASIR MUSTAFA
Exercise Physiology MCQS By DR. NASIR MUSTAFAExercise Physiology MCQS By DR. NASIR MUSTAFA
Exercise Physiology MCQS By DR. NASIR MUSTAFA
Dr. Nasir Mustafa
 
Grade 2 - Mathematics - Printable Worksheet
Grade 2 - Mathematics - Printable WorksheetGrade 2 - Mathematics - Printable Worksheet
Grade 2 - Mathematics - Printable Worksheet
Sritoma Majumder
 
Junction Field Effect Transistors (JFET)
Junction Field Effect Transistors (JFET)Junction Field Effect Transistors (JFET)
Junction Field Effect Transistors (JFET)
GS Virdi
 

Descriptive Statistics and Data Visualization

  • 1. Diversity in Datasets: (d) e constructing Descriptive Statistics and Data Visualization Douglas James Joubert National Institutes of Health Library
  • 2. Outline Types of Scale Levels of Measurement Descriptive vs. Inferential Statistics Univariate Analysis Graphical Methods for Displaying Data
  • 3. Before you Survey Consult with a Statistician Vital to your success Great way to collaborate
  • 4. Analysis Always Follows Design Johnson (2005) Question Hypothesis Experimental Design Samples Data Analysis
  • 5. Descriptive Statistics Location Spread (Dispersion) Shape of the Distribution Mean Mode Median SD Variance COV Skewness (+ or -) Kurtosis
  • 6. Levels of Measurement The questions you ask are just as important as what is being measured Consult, confer, and pick apart your hypothesis Results are only as good as your poorest measurement Your measurement will never provide the absolute truth Try to control as much as possible to reduce error Random error – due to chance – either direction Systematic error – due to bias – one direction
  • 7. Reducing Measurement Error Triangulate Different measures for same construct X2 X1
  • 8. Types of Scale Nominal or Categorical Mutually exclusive group: gender, sick vs. healthy, remote user vs. library user Used for identification purposes only Cannot be ranked from smallest to largest Ordinal Mutually exclusive group that is also ordered in a meaningful manner Distance between categories is unknown—you cannot say that a person with a job satisfaction of 2 is twice as satisfied as a person rated as a 1
  • 9. Types of Scale Interval Ordered groups with equal intervals between any two pairs of adjacent classes No absolute zero and you cannot compute ratios, for example, temperature Ratio Interval scale with a true absolute zero, for example, weight You can tell how much larger or smaller one value is compared with another
  • 10. Hierarchy of Measurement Ratio Interval Ordinal Nominal Trochim (2001) Absolute Zero Distance is meaningful Characteristics can be ordered Classification is arbitrary
  • 11. Descriptive vs. Inferential Statistics Descriptive (Summary) statistics describe or characterize data in such a way that none of the original information is lost or distorted 1 Inferential statistics allow one to draw conclusions about a population based on data obtained from a sample Munro (2002) S1 S2 S3 S4 S5 S6 ? ? ? ? ? ? Sample Population
  • 12. Univariate Descriptive Analysis Allows one to examine each variable separately to check for data inconsistencies, variability of variables Also allows one to check statistical assumptions about the shape of the distribution before moving on to more complex analysis Univariate descriptive statistics can also be used to determine central tendency, variability, skewness, and kurtosis
  • 13. Graphical Methods for Displaying Data Frequency Distributions Histograms Plots Pareto Charts Boxplots Error Bar Charts
  • 14. Frequency distributions Frequency distributions are a nice tool for categorizing data into meaningful groups Organizing data in tabular form using classes or frequencies Two main types: Categorical: qualitative data such as gender, treatment group or not, religious affiliation Ungrouped or grouped quantitative data
  • 15. Categorical Frequency distributions A O B A AB AB A A B B O O O A B AB 16 Total 3 AB 4 O 4 B 5 A Frequency f Class
  • 16. Ungrouped Frequency distributions 161 155 103 103 Birth weight data in (oz) 101 100 98 98 89 94 94 93 91 88 88 67 64 64 58 32
  • 17. Ungrouped Frequency distributions … 1 93 1 91 2 88 1 67 2 64 1 58 1 32 Count (Frequency f) Birth weight
  • 18. Grouped Frequency Distribution Grouped frequency distribution is obtained by constructing classes (intervals) for the data If the difference between minimum and maximum values exceed 15 then you need to divide the data into classes Should have a minimum of 5 classes and a maximum of 20 Histogram is a graphical representation of a frequency distribution
  • 19. Grouped Frequency Distribution Typically grouped frequency distributions will contain: The frequency of the value within each category Relative frequency: The percentage of values within each category based on the total number of cases Valid percent is the percentage of cases in each category based on non-missing scores Cumulative frequency: sum of the frequencies for all values at or below the given value Cumulative relative frequency: sum of the relative frequencies for all values at or below the given value
  • 20. Grouped Frequency Distribution of CA patients *=(E2/ $E$ 8)*100, in Excel to force absolute reference 1.00 .1463 .1498 .2439 .2055 .2473 0.0696 rf* 287 245 202 132 73 2 cf 287 Total .9997 42 More .8534 43 40 – 50 .7036 70 30 – 40 .4597 59 20 – 30 .2542 71 10 – 20 .0696 2 0 – 10 crf Frequency Age
  • 21. Table Tips Use tables to highlight major facts Keep it simple – tables are usually intended to demystify your data, not make it more difficult to understand If you are using a software program to create class intervals make sure the default works with you data Think of your audience – how can I convey my message without losing important data
  • 22. Table Tips The clustering that best describes the data should be the ultimate guide Too few or too many class intervals will obscure important information about your data Tables used to analyzed data are rarely published
  • 23. Charts Effective way to give the reader a snapshot of the differences and patterns in a set of data Primary disadvantage to charts is that you lose the details Things to consider when constructing charts Does my data represent a single moment in time (cross sectional) or does my data occur over time (time series) Do I have a qualitative or quantitative variables? If my variable is quantitative, is the variable discrete or continuous? Munro (2002)
  • 24. Bar Charts For nominal or ordinal data use simple bar charts Simple bar charts you will have spaces between categories Cluster bar charts can be used to represent univariate distributions Cluster bar charts can also be stacked
  • 25. Simple Bar Chart Nominal data
  • 26. Stacked Bar Chart You are really just stacking two or more columns into a single new column Compares the percentage that each group contributes to the total across categories Want to have 100% stacked columns so you can compare the percentages in each group
  • 28. Histograms Best for interval and ratio data Represent percentages rather than counts Each histogram has total area of 100% Since this is a range of values no gaps between bars From a descriptive standpoint allows one to look at the distribution of variables Consider grouping the data if range > 15 Height of the vertical axis is important
  • 30. Histogram Std Err Bars Normal Dist Fit
  • 31. Histogram: SEM and Normal Distributions Standard error of the mean is the estimate of how much we would expect the mean to vary in a population, given repeated samples Fit distribution (Normal) estimates the parameters of the normal distribution based on the analysis sample
  • 32. Pareto Charts Pareto chart is a special type of histogram that is arranged from largest to smallest Allows one to determine which values are least important and which values are more important Pareto charts combines a bar chart displaying percentages of categories in the data with a line plot showing cumulative percentages of the categories
  • 34. 2-Way Comparative Pareto Chart SAS (1990)
  • 35. Overlay Chart Similar to a scatterplot but…your are only looking at one variable SAS (1989–2004)
  • 36. Plots Scatterplots look at the relationship between two or more variables Great way to identify outliers Typically the Y-axis is the DV and X-axis the IV Using a control variable allows one to identify different groups For example, the relationship between bp and weight, and controlling for smoking vs. non-smoking
  • 37. Plots Scatterplots look at the relationship between two or more variables Great way to identify outliers Typically the Y-axis is the DV and X-axis the IV Using a control variable allows one to identify different groups For example, the relationship between bp and weight, and controlling for smoking vs. non-smoking Why? Because we are controlling for some factor
  • 38. Simple Scatterplot SAS (1989–2004)
  • 39. Simple Scatterplot In correlation, this is the least-square line (scary math, but very important) SAS (1989–2004)
  • 40. Box-and-Whisker Plots A graphical method based on percentiles Useful for visualizing the distribution of a variable Simultaneously displays the median, the IQR, and the smallest and largest values for a group More compact than a histogram but less revealing Good tool for identifying outliers and extreme values Two common types: Outlier Box Plot and a Quantile Box Plot
  • 41. Outlier Box Plot Possible Outliers IQR Largest value not an outlier Smallest value not an outlier 75th 25th 50 th (median)
  • 43. Contact Information Douglas J. Joubert, MLIS Biomedical Informationist National Institutes of Health Library Bldg. 10, Room 1L09A Bethesda, MD 20906-1150 Phone: 301.594.6282 Fax: 301.402.0254 E-mail: joubertd@ors.od.nih.gov E-mail: joubertd@helix.nih.gov http://nihlibrary.nih.gov/
  • 44. References Johnson, Laura Lee Ph.D (2004). Principles and Practices of Clinical Research (Lecture), NIH. SAS (1990). Common causes of failure during the fabrication of integrated circuits. Data from "Selected SAS/QC Software Examples, Release 6.06, SAS Users Group International Conference, April 2, 1990 pg 383. Munro, B. H. (2001). Statistical methods for health care research (4th ed.). Philadelphia: Lippincott Williams & Wilkins. SAS Institute Inc. (1989-2004). SAS Help Files. Cary: North Carolina.