SlideShare a Scribd company logo
Descriptive Statistics
Numerical Measures
Dr. Subhradev Sen
Alliance School of Business
ALLIANCE UNIVERSITY
Lecture Overview
• Measures of Location/Central Tendency
• Measures of Variability/Dispersion
• Measures of distribution shape, relative
location
• Weighted mean & Grouped Data
• Outliers & Box Plot
• Correlation Coefficient
MEASURES OF LOCATION
 Mean
 Median
 Mode
 Percentiles
 Quartiles
MEASURES OF LOCATION
Mean:
MEASURES OF LOCATION
Mean:
MEASURES OF LOCATION
Median:
The median is another measure of central
location. The median is the value in the middle
when the data are arranged in ascending order
(smallest value to largest value).
MEASURES OF LOCATION
MEASURES OF LOCATION
MEASURES OF LOCATION
Whenever a data set contains extreme values,
the median is often the preferred measure of
central location.
MEASURES OF LOCATION
Mode:
The mode is the value that occurs with greatest
frequency.
MEASURES OF LOCATION
Mode:
MEASURES OF LOCATION
Percentiles:
A percentile provides information about how
the data are spread over the interval from the
smallest value to the largest value.
MEASURES OF LOCATION
Percentiles:
For data that do not contain numerous repeated
values, the pth percentile divides the data into
two parts. Approximately p percent of the
observations have values less than the pth
percentile; approximately (100-p) percent of the
observations have values greater than the pth
percentile.
MEASURES OF LOCATION
Percentiles (Formal Definition):
The pth percentile is a value such that at least p
percent of the observations are less than or
equal to this value and at least (100-p) percent
of the observations are greater than or equal to
this value.
MEASURES OF LOCATION
Percentiles:
Colleges and universities frequently report
admission test scores in terms of percentiles.
MEASURES OF LOCATION
Percentiles:
Suppose an applicant obtains a raw score of 54 on
the verbal portion of an admission test. How this
student performed in relation to other students
taking the same test may not be readily apparent.
However, if the raw score of 54 corresponds to the
70th percentile, we know that approximately 70%
of the students scored lower than this individual
and approximately 30% of the students scored
higher than this individual.
MEASURES OF LOCATION
Percentiles Calculation:
MEASURES OF LOCATION
Percentiles Calculation Illustration:
MEASURES OF LOCATION
Percentiles Calculation Illustration: Interest is to
find the 85th percentile.
MEASURES OF LOCATION
Percentiles:
Note that the
50th percentile is the median.
MEASURES OF LOCATION
Quartiles:
It is often desirable to divide data into four
parts, with each part containing approximately
one-fourth, or 25% of the observations.
MEASURES OF LOCATION
Quartiles:
MEASURES OF LOCATION
Quartiles:
MEASURES OF LOCATION
Quartiles: (Illustration)
MEASURES OF LOCATION
Quartiles: (Illustration)
MEASURES OF LOCATION
Grouped Data: AVERAGE or MEAN
In case data are given as frequency distribution
or as grouped data. The mean is obtained by
Where
MEASURES OF LOCATION
Grouped Data: AVERAGE or MEAN
Example:
Value Frequency
1 5
2 9
3 12
4 17
5 14
6 10
7 6
MEASURES OF LOCATION
Grouped Data: AVERAGE or MEAN
Example:
Mean=299/73=4.09
Value(x) Frequency(f) xf
1 5 5
2 9 18
3 12 36
4 17 68
5 14 70
6 10 60
7 6 42
TOTAL 73=N 299
MEASURES OF LOCATION
Grouped Data: AVERAGE or MEAN
In case of grouped or continuous frequency
distribution, “x” is taken as the mid value of the
corresponding class
MEASURES OF LOCATION
Grouped Data: AVERAGE or MEAN
Example: Marks No. of Students
0-10 12
10-20 18
20-30 27
30-40 20
40-50 17
50-60 6
MEASURES OF LOCATION
Grouped Data: AVERAGE or MEAN
Example:
Mean=2800/100=28
Marks
No. of
Students
(f)
Mid-Point
(x) x.f
0-10 12 5 60
10-20 18 15 270
20-30 27 25 675
30-40 20 35 700
40-50 17 45 765
50-60 6 55 330
TOTAL 100 2800
MEASURES OF LOCATION
Grouped Data: MEDIAN
Example:
Value Frequency
1 8
2 10
3 11
4 16
5 20
6 25
7 15
8 9
9 6
MEASURES OF LOCATION
Grouped Data: MEDIAN
Example:
1. Find N/2, where N is the total frequency.
2. See the cumulative frequency just greater than
N/2.
3. The corresponding value of “x” is the median
MEASURES OF LOCATION
Grouped Data: MEDIAN
N/2=120/2=60
Median=5
Value Frequency
Cumulative
frequency(<)
1 8 8
2 10 18
3 11 29
4 16 45
5 20 65
6 25 90
7 15 105
8 9 114
9 6 120
MEASURES OF LOCATION
Grouped Data: MEDIAN
Wage No. of Employees
2000-3000 3
3000-4000 5
4000-5000 20
5000-6000 10
6000-7000 5
MEASURES OF LOCATION
Grouped Data: MEDIAN
L: Lower limit of the median class.
f: Frequency of the median class.
h: Width of the median class.
C: Cumulative Frequency of the class just
preceding the median class.
MEASURES OF LOCATION
Grouped Data: MEDIAN
N/2=43/2=21.5
Median = 4000+(1000/20)(21.5-8)
= 4000+675 = 4675
Wage
No. of
Employees cf
2000-3000 3 3
3000-4000 5 8
4000-5000 20 28
5000-6000 10 38
6000-7000 5 43
MEASURES OF LOCATION
Grouped Data: MODE
Example:
Marks No. of Students
0-10 5
10-20 8
20-30 7
30-40 12
40-50 28
50-60 20
60-70 10
70-80 10
MEASURES OF LOCATION
Grouped Data: MODE
MEASURES OF LOCATION
Grouped Data: MODE
Marks
No. of
Students
0-10 5
10-20 8
20-30 7
30-40 12
40-50 28
50-60 20
60-70 10
70-80 10
MEASURES OF VARIABILITY
 Range
 Inter Quartile Range (IQR)
 Variance
 Standard Deviation
 Coefficient of Variation (CV)
MEASURES OF VARIABILITY
Range:
Range is
Largest Value - Smallest Value
MEASURES OF VARIABILITY
Range: (Illustration)
The largest starting salary is 3925 and the
smallest is 3310. The range is 3925-3310=615.
MEASURES OF VARIABILITY
Inter Quartile Range (IQR):
MEASURES OF VARIABILITY
Inter Quartile Range (IQR): (Illustration)
MEASURES OF VARIABILITY
Variance:
For Raw Data:
For Frequency Data:
Sample variance is denoted by 𝒔𝟐
(Not by σ2
),
σ2
is used for population variance indication.
MEASURES OF VARIABILITY
Variance:
MEASURES OF VARIABILITY
Standard Deviation:
The standard deviation (sd) is defined to be the
positive square root of the variance. It is
denoted by σ.
Sample sd is denoted by s
For the previous example,
MEASURES OF VARIABILITY
Note:
Note that when we deal with sample, standard
deviation notation is “s”.
MEASURES OF VARIABILITY
Coefficient of Variation (CV):
In some situations we may be interested in a
descriptive statistic that indicates how large the
standard deviation is relative to the mean.
This measure is called the coefficient of
variation and is usually expressed as a
percentage.
MEASURES OF VARIABILITY
Coefficient of Variation (CV):
CV is expressed in percentage and is utilized for
comparing performances of individual on a similar
variable of measurement
MEASURES OF VARIABILITY
Moments:
 Raw Moments (Moments about origin “0”)
 Central Moments (Moments about sample
mean)
MEASURES OF VARIABILITY
Moments: Raw Moments (Moments about
origin “0”)
MEASURES OF VARIABILITY
Moments: Central Moments (Moments about
mean)
MEASURES OF DISTRIBUTION SHAPE
Measures of Skewness:
MEASURES OF DISTRIBUTION SHAPE
Other Measures of Skewness:
MEASURES OF DISTRIBUTION SHAPE
Measure of Kurtosis
Measure of Kurtosis
Pooled Mean & Pooled Variance
Suppose the data is given in different groups in
the following fashion.
Groups Sample Size Sample
Mean
Sample
Variance
Group-1 𝐧𝟏 𝐱𝟏 𝐬𝟏
𝟐
Group-2 𝐧𝟐 𝐱𝟐 𝐬𝟐
𝟐
Group-3 𝐧𝟑 𝐱𝟑 𝐬𝟑
𝟐
Pooled Mean & pooled Variance
Then to find
overall mean taking all the groups together we
use the concept of pooled mean or Overall
Mean.
overall variance taking all the groups together
we use the concept of pooled variance or
Composite variance.
Pooled Mean & Pooled Variance
Pooled Mean:
Pooled Variance (or Composite Variance):
where
MEASURES OF RELATIVE LOCATION
Z-Score:
Measures of relative location help us determine
how far a particular value is from the mean.
MEASURES OF RELATIVE LOCATION
Z-Score:
Z-score is called a standardized value
MEASURES OF RELATIVE LOCATION
Z-Score:
The z-score, 𝑧𝑖 , can be interpreted as the
number of standard deviations 𝑥𝑖 is from the
mean .
For example, 𝑧1= 1.2 would indicate that 𝑥1 is
1.2 standard deviations greater than the sample
mean.
Similarly, 𝑧2= - 0.5 would indicate that 𝑥2 is .5,
or 1/2, standard deviation less than the sample
mean.
MEASURES OF RELATIVE LOCATION
MEASURES OF RELATIVE LOCATION
Data Exploration
Five Number Summary:
In a five-number summary, the following five
numbers are used to summarize the data:
Data Exploration
Five Number Summary:
For Graduate Salary Data:
Data Exploration
Box Plot:
A box plot is a graphical summary of data that is
based on a five-number summary.
Data Exploration
Box Plot:
Data Exploration
Outliers:
Sometimes a data set will have one or more
observations with unusually large or unusually
small values. These extreme values are called
outliers.
Data Exploration
Outliers:
Experienced statisticians take steps to identify
outliers and then review each one carefully.
Data Exploration
Outliers:
 An outlier may be a data value that has
been incorrectly recorded. If so, it can be
corrected before further analysis.
 An outlier may also be from an observation
that was incorrectly included in the data
set; if so, it can be removed.
Data Exploration
Outliers:
 Finally, an outlier may be an unusual data
value that has been recorded correctly and
belongs in the data set. In such cases it
should remain.
Correlation
Sometime we may be interested in finding out
relationship measure between two variables
considered in our data.
Correlation
Covariance between X and Y
Correlation
Covariance:
Example: Sample data for Stereo & Sound
Equipment.
Correlation
Covariance:
Scatter diagram
Correlation
Covariance:
Correlation
Covariance:
Correlation
Interpretation of Covariance:
Correlation
Interpretation of Covariance:
Correlation
Interpretation of Covariance:
Pearson’s Correlation Coefficient
Karl Pearson’s Correlation Coefficient:
A unit free measure which measures the linear
relationship between two variables.
Karl Pearson’s Correlation Coefficient:
Where,
Pearson’s Correlation Coefficient
Pearson’s Correlation Coefficient
Pearson’s Correlation Coefficient:
Pearson’s Correlation Coefficient
Karl Pearson’s Correlation Coefficient:
Pearson’s Correlation Coefficient
Pearson’s Correlation Coefficient:
We conclude that a strong positive linear
relationship occurs between the number of
commercials and sales.
More specifically, an increase in the number of
commercials is associated with an increase in
sales.
Pearson’s Correlation Coefficient
Pearson’s Correlation Coefficient: Note
1. rxy ranges from -1 to +1
2. It measures linear relationship between variables
3. Need quantitative variables to calculate the
coefficient.
4. Correlation Coefficient is unit free.
Pearson’s Correlation Coefficient
Home Work: (1)

More Related Content

What's hot (20)

PPT
Unit 7.plants
Alexandre Bárez
 
PPT
lfstat3e_ppt_01_rev.ppt
AnkurSingh340457
 
PDF
Normal and standard normal distribution
Avjinder (Avi) Kaler
 
PPTX
Descriptive statistics
Murugesan Kandan
 
PPTX
Frequency Distributions for Organizing and Summarizing
Long Beach City College
 
PPTX
A power point presentation on statistics
Kriace Ward
 
PPTX
Basic Descriptive statistics
Ajendra Sharma
 
PPT
probability
Unsa Shakir
 
PPTX
Measure of Central Tendency (Mean, Median, Mode and Quantiles)
Salman Khan
 
PDF
Linear models for data science
Brad Klingenberg
 
PPT
General Statistics boa
raileeanne
 
PPTX
ANOVA-One Way Classification
Sharlaine Ruth
 
PPTX
Kolmogorov smirnov test
Ramesh Giri
 
PPTX
Correlation and regression
Abdelaziz Tayoun
 
PPTX
Statistics
itutor
 
PPTX
Intro to statistics
Ratheeshkrishnakripa
 
PDF
Model selection
Animesh Kumar
 
PPT
Stat 1 variables & sampling
Forensic Pathology
 
PPTX
Basics of Educational Statistics (Descriptive statistics)
HennaAnsari
 
PPTX
Exploratory data analysis
Vishwas N
 
Unit 7.plants
Alexandre Bárez
 
lfstat3e_ppt_01_rev.ppt
AnkurSingh340457
 
Normal and standard normal distribution
Avjinder (Avi) Kaler
 
Descriptive statistics
Murugesan Kandan
 
Frequency Distributions for Organizing and Summarizing
Long Beach City College
 
A power point presentation on statistics
Kriace Ward
 
Basic Descriptive statistics
Ajendra Sharma
 
probability
Unsa Shakir
 
Measure of Central Tendency (Mean, Median, Mode and Quantiles)
Salman Khan
 
Linear models for data science
Brad Klingenberg
 
General Statistics boa
raileeanne
 
ANOVA-One Way Classification
Sharlaine Ruth
 
Kolmogorov smirnov test
Ramesh Giri
 
Correlation and regression
Abdelaziz Tayoun
 
Statistics
itutor
 
Intro to statistics
Ratheeshkrishnakripa
 
Model selection
Animesh Kumar
 
Stat 1 variables & sampling
Forensic Pathology
 
Basics of Educational Statistics (Descriptive statistics)
HennaAnsari
 
Exploratory data analysis
Vishwas N
 

Similar to 2-Descriptive statistics.pptx (20)

PPTX
Measures of Central Tendency, Variability and Shapes
ScholarsPoint1
 
PDF
Descriptive Statistics
Anthony J. Evans
 
PPTX
determinatiion of
University of Balochistan
 
PPTX
Descriptive Statistics.pptx
Shashank Mishra
 
PPTX
Statistics four
Mohamed Hefny
 
PPT
Statistics
Deepanshu Sharma
 
PPT
Statistical Method for engineers and science
usaproductservices
 
PPT
Class1.ppt Class StructureBasics of Statistics
deepanoel
 
PPTX
Measures of Central Tendency With Variance and Ranges.pptx
NecroManXer
 
PPT
Describing quantitative data with numbers
Ulster BOCES
 
PPTX
Chapter 4
Lem Lem
 
PPTX
measures of central tendency.pptx
Manish Agarwal
 
PPT
STATISTICS BASICS INCLUDING DESCRIPTIVE STATISTICS
nagamani651296
 
PPT
Class1.ppt
Gautam G
 
PPT
Introduction to Statistics - Basics of Data - Class 1
RajnishSingh367990
 
PPT
Introduction to statistics covering the basics
OptiAgileBusinessSer
 
PPT
Class1.ppt
PerumalPitchandi
 
PPT
Class1.ppt
Sandeepkumar628916
 
PPT
Class1.ppt
hanreaz219
 
PPT
Business Statistics Chapter Three Power points
TeferiGeta2
 
Measures of Central Tendency, Variability and Shapes
ScholarsPoint1
 
Descriptive Statistics
Anthony J. Evans
 
determinatiion of
University of Balochistan
 
Descriptive Statistics.pptx
Shashank Mishra
 
Statistics four
Mohamed Hefny
 
Statistics
Deepanshu Sharma
 
Statistical Method for engineers and science
usaproductservices
 
Class1.ppt Class StructureBasics of Statistics
deepanoel
 
Measures of Central Tendency With Variance and Ranges.pptx
NecroManXer
 
Describing quantitative data with numbers
Ulster BOCES
 
Chapter 4
Lem Lem
 
measures of central tendency.pptx
Manish Agarwal
 
STATISTICS BASICS INCLUDING DESCRIPTIVE STATISTICS
nagamani651296
 
Class1.ppt
Gautam G
 
Introduction to Statistics - Basics of Data - Class 1
RajnishSingh367990
 
Introduction to statistics covering the basics
OptiAgileBusinessSer
 
Class1.ppt
PerumalPitchandi
 
Class1.ppt
Sandeepkumar628916
 
Class1.ppt
hanreaz219
 
Business Statistics Chapter Three Power points
TeferiGeta2
 
Ad

More from SandipanMaji3 (20)

PPT
Process Selection and Facility layout.ppt
SandipanMaji3
 
PPTX
E-Commerce Management Topic 5 - Vinay Kalyan Parakala Alliance University.pptx
SandipanMaji3
 
PDF
p&g analysis_.pdf
SandipanMaji3
 
PDF
THE GOAL ( PDFDrive ).pdf
SandipanMaji3
 
PPTX
Module 13 - Exploratory Factor Analysis.pptx
SandipanMaji3
 
PPTX
03. AD-AS.pptx
SandipanMaji3
 
PPTX
Book My Show.pptx
SandipanMaji3
 
PPTX
ob.pptx
SandipanMaji3
 
PPTX
Ts.pptx
SandipanMaji3
 
PPTX
Ch4.pptx
SandipanMaji3
 
PPTX
Unit 1.pptx
SandipanMaji3
 
PPTX
SUPPLY CHAIN DRIVERS AND METRICS.pptx
SandipanMaji3
 
PPTX
AVIATION SECTOR.pptx
SandipanMaji3
 
PPT
Chap006 (1) (1).ppt
SandipanMaji3
 
PPTX
KingandLawleychapter1.pptx
SandipanMaji3
 
PPTX
Dickinson_Sample_Slides.pptx
SandipanMaji3
 
PPT
chapter 7 classroom ppt (1).ppt
SandipanMaji3
 
PPTX
Module-2.pptx
SandipanMaji3
 
PPTX
Book My Show.pptx
SandipanMaji3
 
PPTX
Module-1.pptx
SandipanMaji3
 
Process Selection and Facility layout.ppt
SandipanMaji3
 
E-Commerce Management Topic 5 - Vinay Kalyan Parakala Alliance University.pptx
SandipanMaji3
 
p&g analysis_.pdf
SandipanMaji3
 
THE GOAL ( PDFDrive ).pdf
SandipanMaji3
 
Module 13 - Exploratory Factor Analysis.pptx
SandipanMaji3
 
03. AD-AS.pptx
SandipanMaji3
 
Book My Show.pptx
SandipanMaji3
 
ob.pptx
SandipanMaji3
 
Ts.pptx
SandipanMaji3
 
Ch4.pptx
SandipanMaji3
 
Unit 1.pptx
SandipanMaji3
 
SUPPLY CHAIN DRIVERS AND METRICS.pptx
SandipanMaji3
 
AVIATION SECTOR.pptx
SandipanMaji3
 
Chap006 (1) (1).ppt
SandipanMaji3
 
KingandLawleychapter1.pptx
SandipanMaji3
 
Dickinson_Sample_Slides.pptx
SandipanMaji3
 
chapter 7 classroom ppt (1).ppt
SandipanMaji3
 
Module-2.pptx
SandipanMaji3
 
Book My Show.pptx
SandipanMaji3
 
Module-1.pptx
SandipanMaji3
 
Ad

Recently uploaded (20)

PDF
taking-action-on-nature-webinar-slides.pdf
ybhyrfp5t2
 
PDF
547229304-CBS-2021businesscasebook2o.pdf
CngNguynngHng
 
PPTX
Cruise API Provider | Amadeus Cruise API | Cruise Software
anusharajraj21
 
PDF
Concept Topology in Architectural Build Addendum.pdf
Brij Consulting, LLC
 
PDF
Rise of the Build Division, Conquering the Curve.pdf
Brij Consulting, LLC
 
DOCX
Oleg Belay: biography, strategy, and business philosophy of the leader of TRI...
trinfico
 
PPTX
Marketing Presentations - How to do them
Jaime Ernesto Portillo De la O
 
PPTX
How Essar Transforms the Planet while Investing in People Over Profit
essarcase
 
PPTX
_IIML_Optimizing Energy Efficiency in Industrial Operations with GenAI_Team I...
rafinrowshan
 
PDF
Accenture. (2024). Risk Study, 2024 Edition - Hyper-disruption demands consta...
yofep71646
 
PPTX
Customer screenshots from Quark Publishing Platform
Gareth Oakes
 
PPTX
Revolutionizing Shopping: Voice Commerce in Retail and eCommerce
RUPAL AGARWAL
 
PDF
Stuart Frost - An Accomplished CEO
Stuart Frost
 
PPTX
PwC Final PPT.pptx pitch Deck presentation
rafinrowshan
 
PDF
Corporate Governance for South African Mining Companies (a practitioner's view).
James AH Campbell
 
PPTX
epi editorial commitee meeting presentation
MIPLM
 
PDF
Factors Influencing Demand For Plumbers In Toronto GTA:
Homestars
 
PDF
🌿 Mohit Bansal – Building a Greener Future from Chandigarh.pdf
Mohit Bansal Chandigarh
 
PPTX
Unlocking the Power of Process Mining: Driving Efficiency Through Data
RUPAL AGARWAL
 
PDF
NewBase 14 July 2025 Energy News issue - 1802 by Khaled Al Awadi_compressed ...
Khaled Al Awadi
 
taking-action-on-nature-webinar-slides.pdf
ybhyrfp5t2
 
547229304-CBS-2021businesscasebook2o.pdf
CngNguynngHng
 
Cruise API Provider | Amadeus Cruise API | Cruise Software
anusharajraj21
 
Concept Topology in Architectural Build Addendum.pdf
Brij Consulting, LLC
 
Rise of the Build Division, Conquering the Curve.pdf
Brij Consulting, LLC
 
Oleg Belay: biography, strategy, and business philosophy of the leader of TRI...
trinfico
 
Marketing Presentations - How to do them
Jaime Ernesto Portillo De la O
 
How Essar Transforms the Planet while Investing in People Over Profit
essarcase
 
_IIML_Optimizing Energy Efficiency in Industrial Operations with GenAI_Team I...
rafinrowshan
 
Accenture. (2024). Risk Study, 2024 Edition - Hyper-disruption demands consta...
yofep71646
 
Customer screenshots from Quark Publishing Platform
Gareth Oakes
 
Revolutionizing Shopping: Voice Commerce in Retail and eCommerce
RUPAL AGARWAL
 
Stuart Frost - An Accomplished CEO
Stuart Frost
 
PwC Final PPT.pptx pitch Deck presentation
rafinrowshan
 
Corporate Governance for South African Mining Companies (a practitioner's view).
James AH Campbell
 
epi editorial commitee meeting presentation
MIPLM
 
Factors Influencing Demand For Plumbers In Toronto GTA:
Homestars
 
🌿 Mohit Bansal – Building a Greener Future from Chandigarh.pdf
Mohit Bansal Chandigarh
 
Unlocking the Power of Process Mining: Driving Efficiency Through Data
RUPAL AGARWAL
 
NewBase 14 July 2025 Energy News issue - 1802 by Khaled Al Awadi_compressed ...
Khaled Al Awadi
 

2-Descriptive statistics.pptx