SlideShare a Scribd company logo
Introduction
Summary Statistics
Definition
Summary statistics are used to summarize a
set of observations in order to
communicate as much as information
about the data as possible. It is part of
descriptive statistics and are used to
basically summarize or describe a set of
observations.
Rupak Roy
Example
The weight of the population are
45 kg
57kg
72 kg
52 kg
Now what we want here is the summary of
weight of the population , we can say it is the
average weight of the population is 56.5 kg and
now we can describe the population in the
simplest way as possible.
Rupak Roy
Types
Summary statistics
Measures of Central
Tendency
1 . Mean
2 . Median
3 . Mode
5 . Geometric Mean
Measures of
Dispersion
1. Standard
Deviation
2. Variance
3. Interquartile
Range
Others
1. Co efficient
2. Skewness
3. Kurtosis
4. Probability
Distributions.
5. Distribution plot
Rupak Roy
Definition
 Measures of central tendency : is the value that describes
which group of data clusters around a central value. In
simple words , it is a way to describe the center of a data
set. Again what is center of data ? A single number that
summarizes the entire dataset using techniques such as
mean/average or median of the dataset.
 Measures of Dispersion: “dispersion (also
called variability, scatter, or spread) is the extent to which
a distribution of data is stretched or squeezed.”
Here in the graph we can see the
distribution of data (assume population)
is more stretched at the right side
ranging from 50 to 80
Measures of Central Tendency
1. Mean : is the average of observations. Most effective
when data is not heavily skewed.
2. Median: represents the middle value of the dataset.
Useful for skewed data.
We will talk about skewed data in the upcoming
slides.
3. Mode: means max no of times the data has occurred.
4. Geometric mean: nth root of a product of n numbers.
It is used when we want to get the average rate of the
event and the event rate is determined by multiplication.
For example growth of a bank account per year in a
ABC bank is calculated by geometric mean since the
growth event rate is determined by multiplying the
amount of a bank account by the percentage of growth.
then we use geometric mean.
Rupak Roy
 Formula for calculating Geometric Mean
GM =
example: Geometric Mean of 23,56,66 ?
3 23 * 56*66
3 85008 = 43.9696761which means 3times of 43.9696761
is 85008
Note:
if one of the observation in the event is zero , Geometric
Mean becomes Zero and also it doesn’t works with
negative numbers like -1 , -4 , -5 and so on.
Rupak Roy
Calculation of Mode ; <- Delta
For ungrouped data = Max no of items
Example : 23,45,76,33,54,33,76,33 Therefore Mode = 33
For grouped data = = {(L + Delta 1) / Delta 1+Detal2 } * i
Where Delta 1 = f1 +f0
and Delta 2 = f1- f2
Nowadays, we don’t have to worry about the calculation, as in
any statistical software's like R, excel it will automatically calculate
the intense calculation for large amount of data but
for more in-depth information you can visit this website.
https://www.mathsisfun.com/data/frequency-grouped-mean-median-mode.html
Measures of dispersion
Standard Deviation is basically a measure of how near or far the
observations are from the mean.
Variance: the fact or quality of being different , divergent or
inconsistent. A value of zero means that there is no variability , all the
values in the data set are the same.
Interquartile Range: is a measure of variability ,
by dividing a data set into parts that is quartiles .
Say
Q1 is the middle value in the first half of the data set.
Q2 is the median value .
Q3 is the middle value in the second half of the
rank-ordered data set.
There interquartile range = Q3 – Q1
Skewness – refers to the lack of symmetry or imbalance in data
distribution.
In a symmetric distribution the data is
normally distributed where mean,
median, mode is at the same point.
However in real life data is never perfectly
distributed, hence we call it skewed data.
If the Left side has longer tail then the mass
distribution of data is concentrated on the right
side which is known as negatively skewed.
If the Right side has longer tail then the
mass distribution of data is
concentrated on the left side is
known as positive skewed.
Here is the summary of all the skewness as shown in the figure below.
Example (skewed data)
Temp(*c)
10
40
35
33
35
Mean = 153/5 = 30.6, if we apply mean is 30.6
which is incorrect since we can see maximum
number of values are above 35.
So we have to use median For Ungrouped
data ((n+1)/2)th
That will be ((5+1)/2)th = 6/2 = 3
i.e. 3th term ie 35.
For grouped data:
where L, lower class boundary of the group containing the group.
B, Cumulative frequency of the groups
G , Frequency of the median group
W , width/Range of the group
Again, we don’t have to worry about the calculation, as in any statistical software's like R
, excel it will automatically calculate the intense calculation for large amount of data
but for more in-depth information you can visit this website.
https://www.mathsisfun.com/data/frequency-grouped-mean-median-mode.html
Kurtosis : is a measure of whether data are peaked or flat relative to
normal distribution
(+) Leptokurtic
(-) PlatyKurtic
(0) Meskurtic
(+) Leptokurtic
This means the distribution is more clustered near the mean and has a
relativity less standard deviation
(-) PlatyKurtic
Where the distribution is less clustered around the mean and a standard
deviation more then Leptokurtic
(0) Meskurtic is typically measured with respect to the normal
distribution. Meskurtic has tails similar to normal distribution i.e neither
high nor low, rather it is consider to be a baseline for the other two’s.
 Now how to check the data is skewed or not
in Excel:
=skew(select the range of values/numbers)
=skew(10.24,9.48……….-0.42,-0.95)
= - 0.27 means Negatively skewed.
And to check the Kurtosis in Excel
=kurt(select the values/numbers)
=kurt(10.24,9.48……….-0.42,-0.95)
= -1.6 means it is PlatyKurtic
Recap
What we have learned ?
Measures of central tendency,
Measures of dispersion,
Measure of risk,
Next we will see how to compute this theory in
practical and analyze any data using our
everyday simple tools like Excel.
Rupak Roy
To be continued ………
Ad

More Related Content

What's hot (20)

Incidence or incidence rate (Epidemiology short lecture)
Incidence or incidence rate (Epidemiology short lecture)Incidence or incidence rate (Epidemiology short lecture)
Incidence or incidence rate (Epidemiology short lecture)
Muhammad Akbar Rashid Qadri
 
Basic Statistics & Data Analysis
Basic Statistics & Data AnalysisBasic Statistics & Data Analysis
Basic Statistics & Data Analysis
Ajendra Sharma
 
Standard normal distribution
Standard normal distributionStandard normal distribution
Standard normal distribution
Nadeem Uddin
 
ODDS RATIO AND RELATIVE RISK EVALUATION
ODDS RATIO AND RELATIVE RISK EVALUATIONODDS RATIO AND RELATIVE RISK EVALUATION
ODDS RATIO AND RELATIVE RISK EVALUATION
Kanhu Charan
 
Ch4 Confidence Interval
Ch4 Confidence IntervalCh4 Confidence Interval
Ch4 Confidence Interval
Farhan Alfin
 
Biostatistics
BiostatisticsBiostatistics
Biostatistics
anju mathew
 
Survival analysis & Kaplan Meire
Survival analysis & Kaplan MeireSurvival analysis & Kaplan Meire
Survival analysis & Kaplan Meire
Dr Athar Khan
 
Standardization of rates by Dr. Basil Tumaini
Standardization of rates by Dr. Basil TumainiStandardization of rates by Dr. Basil Tumaini
Standardization of rates by Dr. Basil Tumaini
Basil Tumaini
 
Sample Size Determination
Sample Size Determination Sample Size Determination
Sample Size Determination
chanu Bhattacharya
 
Poisson regression models for count data
Poisson regression models for count dataPoisson regression models for count data
Poisson regression models for count data
University of Southampton
 
Quartile in Statistics
Quartile in StatisticsQuartile in Statistics
Quartile in Statistics
HennaAnsari
 
BIOSTATISTICS
BIOSTATISTICSBIOSTATISTICS
BIOSTATISTICS
Dr. Murtaza Kamal MRCPCH,MD,DNB,DrNB Ped Cardiology
 
Sampling distribution
Sampling distributionSampling distribution
Sampling distribution
Nilanjan Bhaumik
 
INTRODUCTION TO BIO STATISTICS
INTRODUCTION TO BIO STATISTICS INTRODUCTION TO BIO STATISTICS
INTRODUCTION TO BIO STATISTICS
Meklelle university
 
Probability distribution
Probability distributionProbability distribution
Probability distribution
Rohit kumar
 
Biostatistics
BiostatisticsBiostatistics
Biostatistics
khushbu mishra
 
SURVIVAL ANALYSIS.ppt
SURVIVAL ANALYSIS.pptSURVIVAL ANALYSIS.ppt
SURVIVAL ANALYSIS.ppt
mbang ernest
 
Descriptive and Analytical Epidemiology
Descriptive and Analytical Epidemiology Descriptive and Analytical Epidemiology
Descriptive and Analytical Epidemiology
coolboy101pk
 
Introduction to biostatistics
Introduction to biostatisticsIntroduction to biostatistics
Introduction to biostatistics
shivamdixit57
 
Variables (Statistics )
Variables (Statistics ) Variables (Statistics )
Variables (Statistics )
Md. Asif Hassan
 
Incidence or incidence rate (Epidemiology short lecture)
Incidence or incidence rate (Epidemiology short lecture)Incidence or incidence rate (Epidemiology short lecture)
Incidence or incidence rate (Epidemiology short lecture)
Muhammad Akbar Rashid Qadri
 
Basic Statistics & Data Analysis
Basic Statistics & Data AnalysisBasic Statistics & Data Analysis
Basic Statistics & Data Analysis
Ajendra Sharma
 
Standard normal distribution
Standard normal distributionStandard normal distribution
Standard normal distribution
Nadeem Uddin
 
ODDS RATIO AND RELATIVE RISK EVALUATION
ODDS RATIO AND RELATIVE RISK EVALUATIONODDS RATIO AND RELATIVE RISK EVALUATION
ODDS RATIO AND RELATIVE RISK EVALUATION
Kanhu Charan
 
Ch4 Confidence Interval
Ch4 Confidence IntervalCh4 Confidence Interval
Ch4 Confidence Interval
Farhan Alfin
 
Survival analysis & Kaplan Meire
Survival analysis & Kaplan MeireSurvival analysis & Kaplan Meire
Survival analysis & Kaplan Meire
Dr Athar Khan
 
Standardization of rates by Dr. Basil Tumaini
Standardization of rates by Dr. Basil TumainiStandardization of rates by Dr. Basil Tumaini
Standardization of rates by Dr. Basil Tumaini
Basil Tumaini
 
Quartile in Statistics
Quartile in StatisticsQuartile in Statistics
Quartile in Statistics
HennaAnsari
 
Probability distribution
Probability distributionProbability distribution
Probability distribution
Rohit kumar
 
SURVIVAL ANALYSIS.ppt
SURVIVAL ANALYSIS.pptSURVIVAL ANALYSIS.ppt
SURVIVAL ANALYSIS.ppt
mbang ernest
 
Descriptive and Analytical Epidemiology
Descriptive and Analytical Epidemiology Descriptive and Analytical Epidemiology
Descriptive and Analytical Epidemiology
coolboy101pk
 
Introduction to biostatistics
Introduction to biostatisticsIntroduction to biostatistics
Introduction to biostatistics
shivamdixit57
 
Variables (Statistics )
Variables (Statistics ) Variables (Statistics )
Variables (Statistics )
Md. Asif Hassan
 

Similar to Summary statistics (20)

Types of Statistics
Types of Statistics Types of Statistics
Types of Statistics
Rupak Roy
 
1 descriptive statistics
1 descriptive statistics1 descriptive statistics
1 descriptive statistics
Sanu Kumar
 
Basic statistics
Basic statisticsBasic statistics
Basic statistics
Seth Anandaram Jaipuria College
 
Descriptive Statistics.pptx
Descriptive Statistics.pptxDescriptive Statistics.pptx
Descriptive Statistics.pptx
Shashank Mishra
 
Descriptive Statistics: Measures of Central Tendency - Measures of Dispersion...
Descriptive Statistics: Measures of Central Tendency - Measures of Dispersion...Descriptive Statistics: Measures of Central Tendency - Measures of Dispersion...
Descriptive Statistics: Measures of Central Tendency - Measures of Dispersion...
EqraBaig
 
UNIT 3-1.pptx of biostatistics nursing 6th sem
UNIT 3-1.pptx of biostatistics nursing 6th semUNIT 3-1.pptx of biostatistics nursing 6th sem
UNIT 3-1.pptx of biostatistics nursing 6th sem
hashirmalik9002
 
Statistics (GE 4 CLASS).pptx
Statistics (GE 4 CLASS).pptxStatistics (GE 4 CLASS).pptx
Statistics (GE 4 CLASS).pptx
YollyCalamba
 
Measures of central tendency
Measures of central tendencyMeasures of central tendency
Measures of central tendency
Mmedsc Hahm
 
Graphical presentation of data
Graphical presentation of dataGraphical presentation of data
Graphical presentation of data
drasifk
 
Lect 3 background mathematics
Lect 3 background mathematicsLect 3 background mathematics
Lect 3 background mathematics
hktripathy
 
QT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central TendencyQT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central Tendency
Prithwis Mukerjee
 
QT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central TendencyQT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central Tendency
Prithwis Mukerjee
 
Basic Statistical Concepts in Machine Learning.pptx
Basic Statistical Concepts in Machine Learning.pptxBasic Statistical Concepts in Machine Learning.pptx
Basic Statistical Concepts in Machine Learning.pptx
bajajrishabh96tech
 
Measures of Dispersion.pptx
Measures of Dispersion.pptxMeasures of Dispersion.pptx
Measures of Dispersion.pptx
Vanmala Buchke
 
3. measures of central tendency
3. measures of central tendency3. measures of central tendency
3. measures of central tendency
renz50
 
Lect 3 background mathematics for Data Mining
Lect 3 background mathematics for Data MiningLect 3 background mathematics for Data Mining
Lect 3 background mathematics for Data Mining
hktripathy
 
Central tendancy 4
Central tendancy 4Central tendancy 4
Central tendancy 4
Sundar B N
 
Basic Statistical Descriptions of Data.pptx
Basic Statistical Descriptions of Data.pptxBasic Statistical Descriptions of Data.pptx
Basic Statistical Descriptions of Data.pptx
Anusuya123
 
SUMMARY MEASURES.pdf
SUMMARY MEASURES.pdfSUMMARY MEASURES.pdf
SUMMARY MEASURES.pdf
GillaMarieLeopardas1
 
Basic statisctis -Anandh Shankar
Basic statisctis -Anandh ShankarBasic statisctis -Anandh Shankar
Basic statisctis -Anandh Shankar
Anandh Shankar Sundararajan
 
Types of Statistics
Types of Statistics Types of Statistics
Types of Statistics
Rupak Roy
 
1 descriptive statistics
1 descriptive statistics1 descriptive statistics
1 descriptive statistics
Sanu Kumar
 
Descriptive Statistics.pptx
Descriptive Statistics.pptxDescriptive Statistics.pptx
Descriptive Statistics.pptx
Shashank Mishra
 
Descriptive Statistics: Measures of Central Tendency - Measures of Dispersion...
Descriptive Statistics: Measures of Central Tendency - Measures of Dispersion...Descriptive Statistics: Measures of Central Tendency - Measures of Dispersion...
Descriptive Statistics: Measures of Central Tendency - Measures of Dispersion...
EqraBaig
 
UNIT 3-1.pptx of biostatistics nursing 6th sem
UNIT 3-1.pptx of biostatistics nursing 6th semUNIT 3-1.pptx of biostatistics nursing 6th sem
UNIT 3-1.pptx of biostatistics nursing 6th sem
hashirmalik9002
 
Statistics (GE 4 CLASS).pptx
Statistics (GE 4 CLASS).pptxStatistics (GE 4 CLASS).pptx
Statistics (GE 4 CLASS).pptx
YollyCalamba
 
Measures of central tendency
Measures of central tendencyMeasures of central tendency
Measures of central tendency
Mmedsc Hahm
 
Graphical presentation of data
Graphical presentation of dataGraphical presentation of data
Graphical presentation of data
drasifk
 
Lect 3 background mathematics
Lect 3 background mathematicsLect 3 background mathematics
Lect 3 background mathematics
hktripathy
 
QT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central TendencyQT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central Tendency
Prithwis Mukerjee
 
QT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central TendencyQT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central Tendency
Prithwis Mukerjee
 
Basic Statistical Concepts in Machine Learning.pptx
Basic Statistical Concepts in Machine Learning.pptxBasic Statistical Concepts in Machine Learning.pptx
Basic Statistical Concepts in Machine Learning.pptx
bajajrishabh96tech
 
Measures of Dispersion.pptx
Measures of Dispersion.pptxMeasures of Dispersion.pptx
Measures of Dispersion.pptx
Vanmala Buchke
 
3. measures of central tendency
3. measures of central tendency3. measures of central tendency
3. measures of central tendency
renz50
 
Lect 3 background mathematics for Data Mining
Lect 3 background mathematics for Data MiningLect 3 background mathematics for Data Mining
Lect 3 background mathematics for Data Mining
hktripathy
 
Central tendancy 4
Central tendancy 4Central tendancy 4
Central tendancy 4
Sundar B N
 
Basic Statistical Descriptions of Data.pptx
Basic Statistical Descriptions of Data.pptxBasic Statistical Descriptions of Data.pptx
Basic Statistical Descriptions of Data.pptx
Anusuya123
 
Ad

More from Rupak Roy (20)

Hierarchical Clustering - Text Mining/NLP
Hierarchical Clustering - Text Mining/NLPHierarchical Clustering - Text Mining/NLP
Hierarchical Clustering - Text Mining/NLP
Rupak Roy
 
Clustering K means and Hierarchical - NLP
Clustering K means and Hierarchical - NLPClustering K means and Hierarchical - NLP
Clustering K means and Hierarchical - NLP
Rupak Roy
 
Network Analysis - NLP
Network Analysis  - NLPNetwork Analysis  - NLP
Network Analysis - NLP
Rupak Roy
 
Topic Modeling - NLP
Topic Modeling - NLPTopic Modeling - NLP
Topic Modeling - NLP
Rupak Roy
 
Sentiment Analysis Practical Steps
Sentiment Analysis Practical StepsSentiment Analysis Practical Steps
Sentiment Analysis Practical Steps
Rupak Roy
 
NLP - Sentiment Analysis
NLP - Sentiment AnalysisNLP - Sentiment Analysis
NLP - Sentiment Analysis
Rupak Roy
 
Text Mining using Regular Expressions
Text Mining using Regular ExpressionsText Mining using Regular Expressions
Text Mining using Regular Expressions
Rupak Roy
 
Introduction to Text Mining
Introduction to Text Mining Introduction to Text Mining
Introduction to Text Mining
Rupak Roy
 
Apache Hbase Architecture
Apache Hbase ArchitectureApache Hbase Architecture
Apache Hbase Architecture
Rupak Roy
 
Introduction to Hbase
Introduction to Hbase Introduction to Hbase
Introduction to Hbase
Rupak Roy
 
Apache Hive Table Partition and HQL
Apache Hive Table Partition and HQLApache Hive Table Partition and HQL
Apache Hive Table Partition and HQL
Rupak Roy
 
Installing Apache Hive, internal and external table, import-export
Installing Apache Hive, internal and external table, import-export Installing Apache Hive, internal and external table, import-export
Installing Apache Hive, internal and external table, import-export
Rupak Roy
 
Introductive to Hive
Introductive to Hive Introductive to Hive
Introductive to Hive
Rupak Roy
 
Scoop Job, import and export to RDBMS
Scoop Job, import and export to RDBMSScoop Job, import and export to RDBMS
Scoop Job, import and export to RDBMS
Rupak Roy
 
Apache Scoop - Import with Append mode and Last Modified mode
Apache Scoop - Import with Append mode and Last Modified mode Apache Scoop - Import with Append mode and Last Modified mode
Apache Scoop - Import with Append mode and Last Modified mode
Rupak Roy
 
Introduction to scoop and its functions
Introduction to scoop and its functionsIntroduction to scoop and its functions
Introduction to scoop and its functions
Rupak Roy
 
Introduction to Flume
Introduction to FlumeIntroduction to Flume
Introduction to Flume
Rupak Roy
 
Apache Pig Relational Operators - II
Apache Pig Relational Operators - II Apache Pig Relational Operators - II
Apache Pig Relational Operators - II
Rupak Roy
 
Passing Parameters using File and Command Line
Passing Parameters using File and Command LinePassing Parameters using File and Command Line
Passing Parameters using File and Command Line
Rupak Roy
 
Apache PIG Relational Operations
Apache PIG Relational Operations Apache PIG Relational Operations
Apache PIG Relational Operations
Rupak Roy
 
Hierarchical Clustering - Text Mining/NLP
Hierarchical Clustering - Text Mining/NLPHierarchical Clustering - Text Mining/NLP
Hierarchical Clustering - Text Mining/NLP
Rupak Roy
 
Clustering K means and Hierarchical - NLP
Clustering K means and Hierarchical - NLPClustering K means and Hierarchical - NLP
Clustering K means and Hierarchical - NLP
Rupak Roy
 
Network Analysis - NLP
Network Analysis  - NLPNetwork Analysis  - NLP
Network Analysis - NLP
Rupak Roy
 
Topic Modeling - NLP
Topic Modeling - NLPTopic Modeling - NLP
Topic Modeling - NLP
Rupak Roy
 
Sentiment Analysis Practical Steps
Sentiment Analysis Practical StepsSentiment Analysis Practical Steps
Sentiment Analysis Practical Steps
Rupak Roy
 
NLP - Sentiment Analysis
NLP - Sentiment AnalysisNLP - Sentiment Analysis
NLP - Sentiment Analysis
Rupak Roy
 
Text Mining using Regular Expressions
Text Mining using Regular ExpressionsText Mining using Regular Expressions
Text Mining using Regular Expressions
Rupak Roy
 
Introduction to Text Mining
Introduction to Text Mining Introduction to Text Mining
Introduction to Text Mining
Rupak Roy
 
Apache Hbase Architecture
Apache Hbase ArchitectureApache Hbase Architecture
Apache Hbase Architecture
Rupak Roy
 
Introduction to Hbase
Introduction to Hbase Introduction to Hbase
Introduction to Hbase
Rupak Roy
 
Apache Hive Table Partition and HQL
Apache Hive Table Partition and HQLApache Hive Table Partition and HQL
Apache Hive Table Partition and HQL
Rupak Roy
 
Installing Apache Hive, internal and external table, import-export
Installing Apache Hive, internal and external table, import-export Installing Apache Hive, internal and external table, import-export
Installing Apache Hive, internal and external table, import-export
Rupak Roy
 
Introductive to Hive
Introductive to Hive Introductive to Hive
Introductive to Hive
Rupak Roy
 
Scoop Job, import and export to RDBMS
Scoop Job, import and export to RDBMSScoop Job, import and export to RDBMS
Scoop Job, import and export to RDBMS
Rupak Roy
 
Apache Scoop - Import with Append mode and Last Modified mode
Apache Scoop - Import with Append mode and Last Modified mode Apache Scoop - Import with Append mode and Last Modified mode
Apache Scoop - Import with Append mode and Last Modified mode
Rupak Roy
 
Introduction to scoop and its functions
Introduction to scoop and its functionsIntroduction to scoop and its functions
Introduction to scoop and its functions
Rupak Roy
 
Introduction to Flume
Introduction to FlumeIntroduction to Flume
Introduction to Flume
Rupak Roy
 
Apache Pig Relational Operators - II
Apache Pig Relational Operators - II Apache Pig Relational Operators - II
Apache Pig Relational Operators - II
Rupak Roy
 
Passing Parameters using File and Command Line
Passing Parameters using File and Command LinePassing Parameters using File and Command Line
Passing Parameters using File and Command Line
Rupak Roy
 
Apache PIG Relational Operations
Apache PIG Relational Operations Apache PIG Relational Operations
Apache PIG Relational Operations
Rupak Roy
 
Ad

Recently uploaded (20)

PHYSIOLOGY MCQS By DR. NASIR MUSTAFA (PHYSIOLOGY)
PHYSIOLOGY MCQS By DR. NASIR MUSTAFA (PHYSIOLOGY)PHYSIOLOGY MCQS By DR. NASIR MUSTAFA (PHYSIOLOGY)
PHYSIOLOGY MCQS By DR. NASIR MUSTAFA (PHYSIOLOGY)
Dr. Nasir Mustafa
 
Biophysics Chapter 3 Methods of Studying Macromolecules.pdf
Biophysics Chapter 3 Methods of Studying Macromolecules.pdfBiophysics Chapter 3 Methods of Studying Macromolecules.pdf
Biophysics Chapter 3 Methods of Studying Macromolecules.pdf
PKLI-Institute of Nursing and Allied Health Sciences Lahore , Pakistan.
 
Grade 2 - Mathematics - Printable Worksheet
Grade 2 - Mathematics - Printable WorksheetGrade 2 - Mathematics - Printable Worksheet
Grade 2 - Mathematics - Printable Worksheet
Sritoma Majumder
 
Drive Supporter Growth from Awareness to Advocacy with TechSoup Marketing Ser...
Drive Supporter Growth from Awareness to Advocacy with TechSoup Marketing Ser...Drive Supporter Growth from Awareness to Advocacy with TechSoup Marketing Ser...
Drive Supporter Growth from Awareness to Advocacy with TechSoup Marketing Ser...
TechSoup
 
How to Create A Todo List In Todo of Odoo 18
How to Create A Todo List In Todo of Odoo 18How to Create A Todo List In Todo of Odoo 18
How to Create A Todo List In Todo of Odoo 18
Celine George
 
Lecture 1 Introduction history and institutes of entomology_1.pptx
Lecture 1 Introduction history and institutes of entomology_1.pptxLecture 1 Introduction history and institutes of entomology_1.pptx
Lecture 1 Introduction history and institutes of entomology_1.pptx
Arshad Shaikh
 
Cultivation Practice of Garlic in Nepal.pptx
Cultivation Practice of Garlic in Nepal.pptxCultivation Practice of Garlic in Nepal.pptx
Cultivation Practice of Garlic in Nepal.pptx
UmeshTimilsina1
 
Exercise Physiology MCQS By DR. NASIR MUSTAFA
Exercise Physiology MCQS By DR. NASIR MUSTAFAExercise Physiology MCQS By DR. NASIR MUSTAFA
Exercise Physiology MCQS By DR. NASIR MUSTAFA
Dr. Nasir Mustafa
 
Herbs Used in Cosmetic Formulations .pptx
Herbs Used in Cosmetic Formulations .pptxHerbs Used in Cosmetic Formulations .pptx
Herbs Used in Cosmetic Formulations .pptx
RAJU THENGE
 
BỘ ĐỀ TUYỂN SINH VÀO LỚP 10 TIẾNG ANH - 25 ĐỀ THI BÁM SÁT CẤU TRÚC MỚI NHẤT, ...
BỘ ĐỀ TUYỂN SINH VÀO LỚP 10 TIẾNG ANH - 25 ĐỀ THI BÁM SÁT CẤU TRÚC MỚI NHẤT, ...BỘ ĐỀ TUYỂN SINH VÀO LỚP 10 TIẾNG ANH - 25 ĐỀ THI BÁM SÁT CẤU TRÚC MỚI NHẤT, ...
BỘ ĐỀ TUYỂN SINH VÀO LỚP 10 TIẾNG ANH - 25 ĐỀ THI BÁM SÁT CẤU TRÚC MỚI NHẤT, ...
Nguyen Thanh Tu Collection
 
03#UNTAGGED. Generosity in architecture.
03#UNTAGGED. Generosity in architecture.03#UNTAGGED. Generosity in architecture.
03#UNTAGGED. Generosity in architecture.
MCH
 
apa-style-referencing-visual-guide-2025.pdf
apa-style-referencing-visual-guide-2025.pdfapa-style-referencing-visual-guide-2025.pdf
apa-style-referencing-visual-guide-2025.pdf
Ishika Ghosh
 
The History of Kashmir Karkota Dynasty NEP.pptx
The History of Kashmir Karkota Dynasty NEP.pptxThe History of Kashmir Karkota Dynasty NEP.pptx
The History of Kashmir Karkota Dynasty NEP.pptx
Arya Mahila P. G. College, Banaras Hindu University, Varanasi, India.
 
All About the 990 Unlocking Its Mysteries and Its Power.pdf
All About the 990 Unlocking Its Mysteries and Its Power.pdfAll About the 990 Unlocking Its Mysteries and Its Power.pdf
All About the 990 Unlocking Its Mysteries and Its Power.pdf
TechSoup
 
How to Clean Your Contacts Using the Deduplication Menu in Odoo 18
How to Clean Your Contacts Using the Deduplication Menu in Odoo 18How to Clean Your Contacts Using the Deduplication Menu in Odoo 18
How to Clean Your Contacts Using the Deduplication Menu in Odoo 18
Celine George
 
SCI BIZ TECH QUIZ (OPEN) PRELIMS XTASY 2025.pptx
SCI BIZ TECH QUIZ (OPEN) PRELIMS XTASY 2025.pptxSCI BIZ TECH QUIZ (OPEN) PRELIMS XTASY 2025.pptx
SCI BIZ TECH QUIZ (OPEN) PRELIMS XTASY 2025.pptx
Ronisha Das
 
dynastic art of the Pallava dynasty south India
dynastic art of the Pallava dynasty south Indiadynastic art of the Pallava dynasty south India
dynastic art of the Pallava dynasty south India
PrachiSontakke5
 
Ajanta Paintings: Study as a Source of History
Ajanta Paintings: Study as a Source of HistoryAjanta Paintings: Study as a Source of History
Ajanta Paintings: Study as a Source of History
Virag Sontakke
 
Tax evasion, Tax planning & Tax avoidance.pptx
Tax evasion, Tax  planning &  Tax avoidance.pptxTax evasion, Tax  planning &  Tax avoidance.pptx
Tax evasion, Tax planning & Tax avoidance.pptx
manishbaidya2017
 
Link your Lead Opportunities into Spreadsheet using odoo CRM
Link your Lead Opportunities into Spreadsheet using odoo CRMLink your Lead Opportunities into Spreadsheet using odoo CRM
Link your Lead Opportunities into Spreadsheet using odoo CRM
Celine George
 
PHYSIOLOGY MCQS By DR. NASIR MUSTAFA (PHYSIOLOGY)
PHYSIOLOGY MCQS By DR. NASIR MUSTAFA (PHYSIOLOGY)PHYSIOLOGY MCQS By DR. NASIR MUSTAFA (PHYSIOLOGY)
PHYSIOLOGY MCQS By DR. NASIR MUSTAFA (PHYSIOLOGY)
Dr. Nasir Mustafa
 
Grade 2 - Mathematics - Printable Worksheet
Grade 2 - Mathematics - Printable WorksheetGrade 2 - Mathematics - Printable Worksheet
Grade 2 - Mathematics - Printable Worksheet
Sritoma Majumder
 
Drive Supporter Growth from Awareness to Advocacy with TechSoup Marketing Ser...
Drive Supporter Growth from Awareness to Advocacy with TechSoup Marketing Ser...Drive Supporter Growth from Awareness to Advocacy with TechSoup Marketing Ser...
Drive Supporter Growth from Awareness to Advocacy with TechSoup Marketing Ser...
TechSoup
 
How to Create A Todo List In Todo of Odoo 18
How to Create A Todo List In Todo of Odoo 18How to Create A Todo List In Todo of Odoo 18
How to Create A Todo List In Todo of Odoo 18
Celine George
 
Lecture 1 Introduction history and institutes of entomology_1.pptx
Lecture 1 Introduction history and institutes of entomology_1.pptxLecture 1 Introduction history and institutes of entomology_1.pptx
Lecture 1 Introduction history and institutes of entomology_1.pptx
Arshad Shaikh
 
Cultivation Practice of Garlic in Nepal.pptx
Cultivation Practice of Garlic in Nepal.pptxCultivation Practice of Garlic in Nepal.pptx
Cultivation Practice of Garlic in Nepal.pptx
UmeshTimilsina1
 
Exercise Physiology MCQS By DR. NASIR MUSTAFA
Exercise Physiology MCQS By DR. NASIR MUSTAFAExercise Physiology MCQS By DR. NASIR MUSTAFA
Exercise Physiology MCQS By DR. NASIR MUSTAFA
Dr. Nasir Mustafa
 
Herbs Used in Cosmetic Formulations .pptx
Herbs Used in Cosmetic Formulations .pptxHerbs Used in Cosmetic Formulations .pptx
Herbs Used in Cosmetic Formulations .pptx
RAJU THENGE
 
BỘ ĐỀ TUYỂN SINH VÀO LỚP 10 TIẾNG ANH - 25 ĐỀ THI BÁM SÁT CẤU TRÚC MỚI NHẤT, ...
BỘ ĐỀ TUYỂN SINH VÀO LỚP 10 TIẾNG ANH - 25 ĐỀ THI BÁM SÁT CẤU TRÚC MỚI NHẤT, ...BỘ ĐỀ TUYỂN SINH VÀO LỚP 10 TIẾNG ANH - 25 ĐỀ THI BÁM SÁT CẤU TRÚC MỚI NHẤT, ...
BỘ ĐỀ TUYỂN SINH VÀO LỚP 10 TIẾNG ANH - 25 ĐỀ THI BÁM SÁT CẤU TRÚC MỚI NHẤT, ...
Nguyen Thanh Tu Collection
 
03#UNTAGGED. Generosity in architecture.
03#UNTAGGED. Generosity in architecture.03#UNTAGGED. Generosity in architecture.
03#UNTAGGED. Generosity in architecture.
MCH
 
apa-style-referencing-visual-guide-2025.pdf
apa-style-referencing-visual-guide-2025.pdfapa-style-referencing-visual-guide-2025.pdf
apa-style-referencing-visual-guide-2025.pdf
Ishika Ghosh
 
All About the 990 Unlocking Its Mysteries and Its Power.pdf
All About the 990 Unlocking Its Mysteries and Its Power.pdfAll About the 990 Unlocking Its Mysteries and Its Power.pdf
All About the 990 Unlocking Its Mysteries and Its Power.pdf
TechSoup
 
How to Clean Your Contacts Using the Deduplication Menu in Odoo 18
How to Clean Your Contacts Using the Deduplication Menu in Odoo 18How to Clean Your Contacts Using the Deduplication Menu in Odoo 18
How to Clean Your Contacts Using the Deduplication Menu in Odoo 18
Celine George
 
SCI BIZ TECH QUIZ (OPEN) PRELIMS XTASY 2025.pptx
SCI BIZ TECH QUIZ (OPEN) PRELIMS XTASY 2025.pptxSCI BIZ TECH QUIZ (OPEN) PRELIMS XTASY 2025.pptx
SCI BIZ TECH QUIZ (OPEN) PRELIMS XTASY 2025.pptx
Ronisha Das
 
dynastic art of the Pallava dynasty south India
dynastic art of the Pallava dynasty south Indiadynastic art of the Pallava dynasty south India
dynastic art of the Pallava dynasty south India
PrachiSontakke5
 
Ajanta Paintings: Study as a Source of History
Ajanta Paintings: Study as a Source of HistoryAjanta Paintings: Study as a Source of History
Ajanta Paintings: Study as a Source of History
Virag Sontakke
 
Tax evasion, Tax planning & Tax avoidance.pptx
Tax evasion, Tax  planning &  Tax avoidance.pptxTax evasion, Tax  planning &  Tax avoidance.pptx
Tax evasion, Tax planning & Tax avoidance.pptx
manishbaidya2017
 
Link your Lead Opportunities into Spreadsheet using odoo CRM
Link your Lead Opportunities into Spreadsheet using odoo CRMLink your Lead Opportunities into Spreadsheet using odoo CRM
Link your Lead Opportunities into Spreadsheet using odoo CRM
Celine George
 

Summary statistics

  • 2. Definition Summary statistics are used to summarize a set of observations in order to communicate as much as information about the data as possible. It is part of descriptive statistics and are used to basically summarize or describe a set of observations. Rupak Roy
  • 3. Example The weight of the population are 45 kg 57kg 72 kg 52 kg Now what we want here is the summary of weight of the population , we can say it is the average weight of the population is 56.5 kg and now we can describe the population in the simplest way as possible. Rupak Roy
  • 4. Types Summary statistics Measures of Central Tendency 1 . Mean 2 . Median 3 . Mode 5 . Geometric Mean Measures of Dispersion 1. Standard Deviation 2. Variance 3. Interquartile Range Others 1. Co efficient 2. Skewness 3. Kurtosis 4. Probability Distributions. 5. Distribution plot Rupak Roy
  • 5. Definition  Measures of central tendency : is the value that describes which group of data clusters around a central value. In simple words , it is a way to describe the center of a data set. Again what is center of data ? A single number that summarizes the entire dataset using techniques such as mean/average or median of the dataset.  Measures of Dispersion: “dispersion (also called variability, scatter, or spread) is the extent to which a distribution of data is stretched or squeezed.” Here in the graph we can see the distribution of data (assume population) is more stretched at the right side ranging from 50 to 80
  • 6. Measures of Central Tendency 1. Mean : is the average of observations. Most effective when data is not heavily skewed. 2. Median: represents the middle value of the dataset. Useful for skewed data. We will talk about skewed data in the upcoming slides. 3. Mode: means max no of times the data has occurred. 4. Geometric mean: nth root of a product of n numbers. It is used when we want to get the average rate of the event and the event rate is determined by multiplication. For example growth of a bank account per year in a ABC bank is calculated by geometric mean since the growth event rate is determined by multiplying the amount of a bank account by the percentage of growth. then we use geometric mean. Rupak Roy
  • 7.  Formula for calculating Geometric Mean GM = example: Geometric Mean of 23,56,66 ? 3 23 * 56*66 3 85008 = 43.9696761which means 3times of 43.9696761 is 85008 Note: if one of the observation in the event is zero , Geometric Mean becomes Zero and also it doesn’t works with negative numbers like -1 , -4 , -5 and so on. Rupak Roy
  • 8. Calculation of Mode ; <- Delta For ungrouped data = Max no of items Example : 23,45,76,33,54,33,76,33 Therefore Mode = 33 For grouped data = = {(L + Delta 1) / Delta 1+Detal2 } * i Where Delta 1 = f1 +f0 and Delta 2 = f1- f2 Nowadays, we don’t have to worry about the calculation, as in any statistical software's like R, excel it will automatically calculate the intense calculation for large amount of data but for more in-depth information you can visit this website. https://www.mathsisfun.com/data/frequency-grouped-mean-median-mode.html
  • 9. Measures of dispersion Standard Deviation is basically a measure of how near or far the observations are from the mean. Variance: the fact or quality of being different , divergent or inconsistent. A value of zero means that there is no variability , all the values in the data set are the same. Interquartile Range: is a measure of variability , by dividing a data set into parts that is quartiles . Say Q1 is the middle value in the first half of the data set. Q2 is the median value . Q3 is the middle value in the second half of the rank-ordered data set. There interquartile range = Q3 – Q1
  • 10. Skewness – refers to the lack of symmetry or imbalance in data distribution. In a symmetric distribution the data is normally distributed where mean, median, mode is at the same point. However in real life data is never perfectly distributed, hence we call it skewed data. If the Left side has longer tail then the mass distribution of data is concentrated on the right side which is known as negatively skewed.
  • 11. If the Right side has longer tail then the mass distribution of data is concentrated on the left side is known as positive skewed. Here is the summary of all the skewness as shown in the figure below.
  • 12. Example (skewed data) Temp(*c) 10 40 35 33 35 Mean = 153/5 = 30.6, if we apply mean is 30.6 which is incorrect since we can see maximum number of values are above 35. So we have to use median For Ungrouped data ((n+1)/2)th That will be ((5+1)/2)th = 6/2 = 3 i.e. 3th term ie 35. For grouped data: where L, lower class boundary of the group containing the group. B, Cumulative frequency of the groups G , Frequency of the median group W , width/Range of the group Again, we don’t have to worry about the calculation, as in any statistical software's like R , excel it will automatically calculate the intense calculation for large amount of data but for more in-depth information you can visit this website. https://www.mathsisfun.com/data/frequency-grouped-mean-median-mode.html
  • 13. Kurtosis : is a measure of whether data are peaked or flat relative to normal distribution (+) Leptokurtic (-) PlatyKurtic (0) Meskurtic (+) Leptokurtic This means the distribution is more clustered near the mean and has a relativity less standard deviation (-) PlatyKurtic Where the distribution is less clustered around the mean and a standard deviation more then Leptokurtic (0) Meskurtic is typically measured with respect to the normal distribution. Meskurtic has tails similar to normal distribution i.e neither high nor low, rather it is consider to be a baseline for the other two’s.
  • 14.  Now how to check the data is skewed or not in Excel: =skew(select the range of values/numbers) =skew(10.24,9.48……….-0.42,-0.95) = - 0.27 means Negatively skewed. And to check the Kurtosis in Excel =kurt(select the values/numbers) =kurt(10.24,9.48……….-0.42,-0.95) = -1.6 means it is PlatyKurtic
  • 15. Recap What we have learned ? Measures of central tendency, Measures of dispersion, Measure of risk, Next we will see how to compute this theory in practical and analyze any data using our everyday simple tools like Excel. Rupak Roy
  • 16. To be continued ………