SlideShare a Scribd company logo
STATISTICS IN
RESEARCH (PART – 1
DESCRIPTIVE)
Role of statistics in research
 Designing research
 Analyzing data
 Draw conclusion about research
Two major areas of statistics
 Descriptive statistics
 It concern with development of certain indices from the raw
data.
 It summarizes collected/ classified data.
 Inferential statistics
 It adopts the process of generalization from small groups
(i.e., samples) to population.
 It also known as sampling statistics.
 It concern with two major problems
 Estimation of population parameters.
 Testing of statistical hypothesis.
Descriptive statistics
 Measure of central tendency (Statistical
Average)
 Measure of dispersion
 Measure of asymmetry (Skewness)
 Measure of relationship
 Other measures
Measure of Central tendency
 It also known as statistical average.
 Mean, Median and Mode are the popular
averages.
 Geometric and Harmonic mean are also
sometime used.
Mean
 It also known as arithmetic average.
 The mean is found by adding all the values in
the set, then dividing the sum by the number of
values.
Example
 Find the mean of following numbers 8, 9, 10,
10, 10, 11, 11, 11, 12, 13
The mean is the usual average:
 n= 10
 (8 + 9 + 10 + 10 + 10 + 11 + 11 + 11 + 12 + 13)
÷ 10 = 105 ÷ 10 = 10.5
Weighted Mean
 The weighted average formula is used to
calculate the average value of a particular set
of numbers with different levels of weight.
Example
 Final marks has distribution of internal marks,
Mid semester marks and End semester marks.
Each one carries following weightage
Internal Mid-Sem End-Sem
30 20 50
Shakthi got 80 marks in internal, 75 marks in
Mid sem and 89 marks in End sem. Then what
is his final mark? Ans = 83.5
Frequency distribution Mean
Parkin
g
Spaces
Freque
ncy
1 15
2 27
3 8
Isabella went up and down the street to find out how many parking
spaces each house had. Here are her results: What is the mean
number of Parking Spaces?
Mean = 15×1 + 27×2 + 8×3 + 5×4
15+54+24+20 /15+27+8+5
Mean = 2.05
Median
 Median is the value of the middle item of
series when it is arranged in ascending or
descending order of magnitude.
 It divides the series into two halves; in one half
all items are less than median, whereas in the
other half all items have values higher than
median
Example
 21, 18, 24, 19, 27
 Arrange the number in ascending order
 18, 19, 21, 24, 27 (21 is the median)
 If there are two middle numbers,
 18, 19, 21, 25, 27, 28
 (21 + 25 )/2 = 23
Mode
 The number that appears most frequently in a set
of numbers.
 It useful in the studies of popular. (Popular shoe
size, popular cap, most demanded product etc.,)
 Arrange the numbers in order from least to
greatest.
 21, 18, 24, 19, 18
 Find the number that is repeated the most.
 18, 18, 19, 21, 24 ( 18 is the mode).
Geometric Mean
 It is defined as the nth root of the product of the values of n times in
a given series.
 The most frequently used application of this average is in the
determination of average percent of change.
Example
 What is the geometric mean of 4,6 and 9 ?
Ans: 6
 Your investment earns 20% during the first year, but
then realizes a loss of 10% in year 2, and another
10% in year 3
 Calculate a growth factor for each year. (1+.2) for Year 1, (1-
.1) for year 2 and (1-.1) for Year 3
 Multiply the 3 growth factors and take the 3rd root.
 Thus, geometric mean = 0.990578-1=-0.009422.
 So your investment losing roughly .9 percent of every year.
Harmonic Mean
 It is defined as reciprocal of the average of
reciprocal of values of items of a series.
 It has limited application particularly in time
and rate are involved.
 It gives largest weight to smallest value and
smallest weight to largest value.
Harmonic Mean
What is the Harmonic mean of 4,5 and 10?
Ans: 5.45
Example
 For example, suppose that you have four 10 km
segments to your automobile trip. You drive your
car:
 100 km/hr for the first 10 km
 110 km/hr for the second 10 km
 90 km/hr for the third 10 km
 120 km/hr for the fourth 10 km.
 Ans: 103.8 km/hr.
Measure of Dispersion
 Average can’t reveal the entire story of study. It
fail to give ideas about data which distributed
around average.
 Measures of dispersion measure how spread out
a set of data is.
 Important measure of dispersion are
 Range
 Mean Deviation
 Standard deviation
Range
 It is difference between extreme values.
 Range = Highest value – Lowest value
 What is the range for following data?
 2, 3, 1, 1, 0, 5, 3, 1, 2, 7, 4, 0, 2, 1, 2, 1, 6, 3, 2, 0,
0, 7, 4, 2, 1, 1, 2, 1, 3, 5, 12
 Range = 12
 Limitation: Its value is never stable, being based
on only two values of the variable.
Mean Deviation
 It is the average of difference of the values of items
from some average of the series
 Find the mean deviation of 3, 6, 6, 7, 8, 11, 15, 16
Find the mean
Find the absolute distance
from the mean.
Find mean of those
Distances.
Ans= 3.75
Coefficient of mean deviation
 When mean deviation is divided by the
average used in finding out the mean deviation
itself, the resulting quantity is described as the
coefficient of mean deviation.
 Coefficient of mean deviation is a relative
measure of dispersion and is comparable to
similar measure of other series.
Standard deviation
 It is most widely used measure of dispersion of
a series and is commonly denoted by the
symbol ‘ 𝜎’ (pronounced as sigma).
Standard deviation
 Find the standard deviation of 53,61,49,67,
55,63.
Steps:
 Find the mean (58)
 Find the deviation from mean, square it and sum it
(230)
 Divide the above answer by sample size (38.333)
Other terms related to 𝜎
 Variance : Square the standard deviation
 Coefficient of standard deviation: Divide the
standard deviation by Mean of that data.
 Coefficient of variation: Multiply the coefficient
of standard deviation with 100.
Distribution
 The pattern of outcomes of a variable; it tells
us what values the variable takes and how
often it takes these values.
 The distribution of data can be
find with help of histogram.
 Histogram is bar chart.
Steps in making Distribution
 Choose the classes by dividing the range of data
into classes of equal width (individuals fit into one
class).
 Count the individuals in each class (this is the
height of the bar).
 Draw the histogram:
The horizontal axis is marked off into equal class
widths.
The vertical axis contains the scale of counts
Histogram
 The number of days of Maria’s last 15
vacations are listed below. Use the data to
make a frequency table with intervals.
4, 8, 6, 7, 5, 4, 10, 6, 7, 14, 12, 8, 10, 15, 12.
 Step 1: Identify the least and greatest values.
 Step 2: Divide the data into equal intervals.
Histogram
 Step 3: List the intervals in the first column of
the table. Count the number of data values in
each interval and list the count in the last
column. Give the table a title.
Normal Distribution
 The distribution of data happens to be
perfectly symmetrical.
 It is perfectly bell shaped curve in which case
the value of mean 𝑋 = median M = mode Z.
Skewness
 if the curve is distorted (whether on the right
side or on the left side), we have asymmetrical
distribution which indicates that there is
skewness.
 If the curve is distorted on the right side, we
have positive skewness.
 If the curve is distorted on the left side, we
have negative skewness.
Measure of Skewness
Measure of Skewness
 Skewness = 𝑋 - Z or 3 ( 𝑋 - M)
 Coefficient of Skewness =
( 𝑋− 𝑍)
𝜎
 If skewness value is positive/ negative/ zero,
then data are positively/negatively
skewed/symmetry.
Kurtosis
 Kurtosis is the measure of flat-toppedness or
peakedness of a curve.
 Normal curve is MesoKurtic..
 Positive kurtosis is leptokurtic.
 Negative kurtosis is platykurtic
Measure of Relationship
 Univariate Analysis: The analysis is carried out
with the description of a single variable.
 Bivariate Analysis: The analysis of two
variables simultaneously.
 Multivariate Analysis: The analysis of multiple
variables simultaneously.
Measure of Relationship
 Correlation
 The word Correlation is made of Co- (meaning "together"),
and Relation.
 It answers that Does there exist association or correlation between
the two (or more) variables ?
 Regression
 It answers that:
 Is there any cause and effect relationship between the two variables
in case of the bivariate population or between one variable on one
side and two or more variables on the other side in case of
multivariate population? If yes, of what degree and in which direction?
Karl Pearson’s coefficient of
correlation
 It is simple correlation and most widely used
method. Denotes by r.
 It is also known as the product moment
correlation coefficient.
 The value of r lies between ±1.
r value and interpretation
We can also say that for a unit change in independent variable, if there happens
to be a constant change in the dependent variable in the same direction, then
correlation will be termed as perfect positive.
Examples
Karl Pearson Correlation coefficient
formula
r
If substitute the value of sigma and derive, you will get below
equation
Example
 A researcher want to know the relation between
advertisement expenditure and total sales. He took a sample
data of 7 companies for one year.
Solution
Solution
Regression
 It is the study of the relationship between
variables.
 It also used for prediction of dependent
variable.
 Regression types
 Simple Regression: single explanatory/independent
variable
Regression Analysis
 Linear Regression: Straight-line relationship.
 Non-linear: Implies curved relationships.
 Regression is nothing but try to find out the equation
of line/curve.
Linear Non linear
y=mx+b
Linear Regression Equation
Unknown parameter a and b can be calculated by below
formulam = b = slope of line, a= c = intercept
Above formula gives best fit line by using least square method.
b = Σ [ (𝑥𝑖 - 𝑥)(𝑦𝑖 - 𝑦) ] / Σ [ (𝑥𝑖 − 𝑥)2
]
a = 𝑦 - b * 𝑥
Example for regression
 In the table below, the X column shows scores
on the aptitude test. Similarly, the Y column
shows statistics grades.
Student X Y
1
2
3
4
5
95
85
80
70
60
85
95
70
65
70
Regression
S.no xi yi (xi - x) (yi - y) (xi -
x)2
(yi -
y)2
(xi - x)(yi - y)
1 95 85 17 8 289 64 136
2 85 95 7 18 49 324 126
3 80 70 2 -7 4 49 -14
4 70 65 -8 -12 64 144 96
5 60 70 -18 -7 324 49 126
Sum 390 385 730 630 470
Mean 78 77
b = 470/730 = 0.644 a = 26.768 ŷ = 26.768 + 0.644x
Standard error
 It provide an overall measure of how well the model
fits the data.
 It represents the average distance that the
observed values fall from the regression line.
 Smaller values are better because it indicates that
the observations are closer to the fitted line.
Standard error
Y 𝑌 (𝑌 − 𝑌) (𝑌 − 𝑌)2
85 81.508 3.492 12.19406
95 87.948 7.052 49.7307
70 71.848 -1.848 3.415104
65 68.628 -3.628 13.16238
70 71.848 -1.848 3.415104
Sum 81.91736
Standard error = 4.0475
Multiple regression
Multicollinearity: Independent variables in multiple regression are highly
correlated.
Partial Correlation
 In simple correlation, we measure the strength of the linear
relationship between two variables, without taking into
consideration the fact that both these variables may be
influenced by a third variable.
 For Ex: when we study the correlation between price and demand,
we completely ignore the effect of other factors like money supply,
import and exports etc. which definitely have a bearing on the price.
 The correlation co-efficient between two variables X1 and X2,
studied partially after eliminating the influence of the third
variable X3 from both of them, is the partial correlation co-
efficient.
Other Measures
 Index number
 Indicator of average percentage change in a series of figures
where one figure (called the base) is assigned an arbitrary
value of 100, and other figures are adjusted in proportion to
the base.
 Time series analysis
 Unlike the analyses of random samples of observations that
are discussed in the context of most other statistics, the
analysis of time series is based on the assumption that
successive values in the data file represent consecutive
measurements taken at equally spaced time intervals.
Ad

More Related Content

What's hot (20)

Statistical Methods
Statistical MethodsStatistical Methods
Statistical Methods
guest9fa52
 
Hypothesis in Research
Hypothesis in ResearchHypothesis in Research
Hypothesis in Research
Muthu Venkatachalam
 
Levels of Measurement
Levels of MeasurementLevels of Measurement
Levels of Measurement
Sarfraz Ahmad
 
Research hypothesis....ppt
Research hypothesis....pptResearch hypothesis....ppt
Research hypothesis....ppt
Rahul Dhaker
 
Introduction to Descriptive Statistics
Introduction to Descriptive StatisticsIntroduction to Descriptive Statistics
Introduction to Descriptive Statistics
Sanju Rusara Seneviratne
 
Frequency distribution
Frequency distributionFrequency distribution
Frequency distribution
Aishwarya PT
 
Quantitative Data Analysis
Quantitative Data AnalysisQuantitative Data Analysis
Quantitative Data Analysis
Asma Muhamad
 
EXPERIMENTAL RESEARCH DESIGN
EXPERIMENTAL RESEARCH DESIGNEXPERIMENTAL RESEARCH DESIGN
EXPERIMENTAL RESEARCH DESIGN
MAHESWARI JAIKUMAR
 
Measures of variability
Measures of variabilityMeasures of variability
Measures of variability
jennytuazon01630
 
Research Design
Research DesignResearch Design
Research Design
gaurav22
 
Research
ResearchResearch
Research
Suresh Thengumpallil
 
Inferential Statistics
Inferential StatisticsInferential Statistics
Inferential Statistics
University of Jaffna
 
Data presentation 2
Data presentation 2Data presentation 2
Data presentation 2
Rawalpindi Medical College
 
probability and non-probability samplings
probability and non-probability samplingsprobability and non-probability samplings
probability and non-probability samplings
n1a2g3a4j5a6i7
 
Introduction to statistics
Introduction to statisticsIntroduction to statistics
Introduction to statistics
ScholarsPoint1
 
Data analysis
Data analysisData analysis
Data analysis
neha147
 
Measurement of scales
Measurement of scalesMeasurement of scales
Measurement of scales
Research Scholar - HNB Garhwal Central University, Srinagar, Uttarakhand.
 
Chapter 7-THE RESEARCH DESIGN
Chapter 7-THE RESEARCH DESIGNChapter 7-THE RESEARCH DESIGN
Chapter 7-THE RESEARCH DESIGN
Ludy Mae Nalzaro,BSM,BSN,MN
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statistics
Sarfraz Ahmad
 
Inferential statistics.ppt
Inferential statistics.pptInferential statistics.ppt
Inferential statistics.ppt
Nursing Path
 

Similar to Statistics in research (20)

Statistics digital text book
Statistics digital text bookStatistics digital text book
Statistics digital text book
deepuplr
 
SAMPLING MEANDEFINITIONThe term sampling mean is a stati.docx
SAMPLING MEANDEFINITIONThe term sampling mean is a stati.docxSAMPLING MEANDEFINITIONThe term sampling mean is a stati.docx
SAMPLING MEANDEFINITIONThe term sampling mean is a stati.docx
anhlodge
 
SAMPLING MEANDEFINITIONThe term sampling mean is a stati.docx
SAMPLING MEANDEFINITIONThe term sampling mean is a stati.docxSAMPLING MEANDEFINITIONThe term sampling mean is a stati.docx
SAMPLING MEANDEFINITIONThe term sampling mean is a stati.docx
agnesdcarey33086
 
polar pojhjgfnbhggnbh hnhghgnhbhnhbjnhhhhhh
polar pojhjgfnbhggnbh hnhghgnhbhnhbjnhhhhhhpolar pojhjgfnbhggnbh hnhghgnhbhnhbjnhhhhhh
polar pojhjgfnbhggnbh hnhghgnhbhnhbjnhhhhhh
NathanAndreiBoongali
 
Answer the questions in one paragraph 4-5 sentences. · Why did t.docx
Answer the questions in one paragraph 4-5 sentences. · Why did t.docxAnswer the questions in one paragraph 4-5 sentences. · Why did t.docx
Answer the questions in one paragraph 4-5 sentences. · Why did t.docx
boyfieldhouse
 
SAMPLING MEAN DEFINITION The term sampling mean is.docx
SAMPLING MEAN  DEFINITION  The term sampling mean is.docxSAMPLING MEAN  DEFINITION  The term sampling mean is.docx
SAMPLING MEAN DEFINITION The term sampling mean is.docx
agnesdcarey33086
 
5.DATA SUMMERISATION.ppt
5.DATA SUMMERISATION.ppt5.DATA SUMMERISATION.ppt
5.DATA SUMMERISATION.ppt
chusematelephone
 
best for normal distribution.ppt
best for normal distribution.pptbest for normal distribution.ppt
best for normal distribution.ppt
DejeneDay
 
statical-data-1 to know how to measure.ppt
statical-data-1 to know how to measure.pptstatical-data-1 to know how to measure.ppt
statical-data-1 to know how to measure.ppt
NazarudinManik1
 
Ders 1 mean mod media st dev.pptx
Ders 1 mean mod media st dev.pptxDers 1 mean mod media st dev.pptx
Ders 1 mean mod media st dev.pptx
Ergin Akalpler
 
CABT Math 8 measures of central tendency and dispersion
CABT Math 8   measures of central tendency and dispersionCABT Math 8   measures of central tendency and dispersion
CABT Math 8 measures of central tendency and dispersion
Gilbert Joseph Abueg
 
Central tendency _dispersion
Central tendency _dispersionCentral tendency _dispersion
Central tendency _dispersion
Kirti Gupta
 
Lesson. 3.pdf probability and statistics
Lesson. 3.pdf probability and statisticsLesson. 3.pdf probability and statistics
Lesson. 3.pdf probability and statistics
coyoteketer
 
Measures of central tendancy easy to under this stats topic
Measures of central tendancy easy to under this stats topicMeasures of central tendancy easy to under this stats topic
Measures of central tendancy easy to under this stats topic
Nishant Taralkar
 
These is info only ill be attaching the questions work CJ 301 – .docx
These is info only ill be attaching the questions work CJ 301 – .docxThese is info only ill be attaching the questions work CJ 301 – .docx
These is info only ill be attaching the questions work CJ 301 – .docx
meagantobias
 
dispersion1.pptx
dispersion1.pptxdispersion1.pptx
dispersion1.pptx
apsoniyajaingtu
 
Basic Statistical Descriptions of Data.pptx
Basic Statistical Descriptions of Data.pptxBasic Statistical Descriptions of Data.pptx
Basic Statistical Descriptions of Data.pptx
Anusuya123
 
Lessontwo - Measures of Tendency.pptx.pdf
Lessontwo - Measures of Tendency.pptx.pdfLessontwo - Measures of Tendency.pptx.pdf
Lessontwo - Measures of Tendency.pptx.pdf
hebaelkouly
 
Lesson2 - chapter 2 Measures of Tendency.pptx.pdf
Lesson2 - chapter 2  Measures of Tendency.pptx.pdfLesson2 - chapter 2  Measures of Tendency.pptx.pdf
Lesson2 - chapter 2 Measures of Tendency.pptx.pdf
hebaelkouly
 
Lesson2 - chapter two Measures of Tendency.pptx.pdf
Lesson2 - chapter two  Measures of Tendency.pptx.pdfLesson2 - chapter two  Measures of Tendency.pptx.pdf
Lesson2 - chapter two Measures of Tendency.pptx.pdf
hebaelkouly
 
Statistics digital text book
Statistics digital text bookStatistics digital text book
Statistics digital text book
deepuplr
 
SAMPLING MEANDEFINITIONThe term sampling mean is a stati.docx
SAMPLING MEANDEFINITIONThe term sampling mean is a stati.docxSAMPLING MEANDEFINITIONThe term sampling mean is a stati.docx
SAMPLING MEANDEFINITIONThe term sampling mean is a stati.docx
anhlodge
 
SAMPLING MEANDEFINITIONThe term sampling mean is a stati.docx
SAMPLING MEANDEFINITIONThe term sampling mean is a stati.docxSAMPLING MEANDEFINITIONThe term sampling mean is a stati.docx
SAMPLING MEANDEFINITIONThe term sampling mean is a stati.docx
agnesdcarey33086
 
polar pojhjgfnbhggnbh hnhghgnhbhnhbjnhhhhhh
polar pojhjgfnbhggnbh hnhghgnhbhnhbjnhhhhhhpolar pojhjgfnbhggnbh hnhghgnhbhnhbjnhhhhhh
polar pojhjgfnbhggnbh hnhghgnhbhnhbjnhhhhhh
NathanAndreiBoongali
 
Answer the questions in one paragraph 4-5 sentences. · Why did t.docx
Answer the questions in one paragraph 4-5 sentences. · Why did t.docxAnswer the questions in one paragraph 4-5 sentences. · Why did t.docx
Answer the questions in one paragraph 4-5 sentences. · Why did t.docx
boyfieldhouse
 
SAMPLING MEAN DEFINITION The term sampling mean is.docx
SAMPLING MEAN  DEFINITION  The term sampling mean is.docxSAMPLING MEAN  DEFINITION  The term sampling mean is.docx
SAMPLING MEAN DEFINITION The term sampling mean is.docx
agnesdcarey33086
 
best for normal distribution.ppt
best for normal distribution.pptbest for normal distribution.ppt
best for normal distribution.ppt
DejeneDay
 
statical-data-1 to know how to measure.ppt
statical-data-1 to know how to measure.pptstatical-data-1 to know how to measure.ppt
statical-data-1 to know how to measure.ppt
NazarudinManik1
 
Ders 1 mean mod media st dev.pptx
Ders 1 mean mod media st dev.pptxDers 1 mean mod media st dev.pptx
Ders 1 mean mod media st dev.pptx
Ergin Akalpler
 
CABT Math 8 measures of central tendency and dispersion
CABT Math 8   measures of central tendency and dispersionCABT Math 8   measures of central tendency and dispersion
CABT Math 8 measures of central tendency and dispersion
Gilbert Joseph Abueg
 
Central tendency _dispersion
Central tendency _dispersionCentral tendency _dispersion
Central tendency _dispersion
Kirti Gupta
 
Lesson. 3.pdf probability and statistics
Lesson. 3.pdf probability and statisticsLesson. 3.pdf probability and statistics
Lesson. 3.pdf probability and statistics
coyoteketer
 
Measures of central tendancy easy to under this stats topic
Measures of central tendancy easy to under this stats topicMeasures of central tendancy easy to under this stats topic
Measures of central tendancy easy to under this stats topic
Nishant Taralkar
 
These is info only ill be attaching the questions work CJ 301 – .docx
These is info only ill be attaching the questions work CJ 301 – .docxThese is info only ill be attaching the questions work CJ 301 – .docx
These is info only ill be attaching the questions work CJ 301 – .docx
meagantobias
 
Basic Statistical Descriptions of Data.pptx
Basic Statistical Descriptions of Data.pptxBasic Statistical Descriptions of Data.pptx
Basic Statistical Descriptions of Data.pptx
Anusuya123
 
Lessontwo - Measures of Tendency.pptx.pdf
Lessontwo - Measures of Tendency.pptx.pdfLessontwo - Measures of Tendency.pptx.pdf
Lessontwo - Measures of Tendency.pptx.pdf
hebaelkouly
 
Lesson2 - chapter 2 Measures of Tendency.pptx.pdf
Lesson2 - chapter 2  Measures of Tendency.pptx.pdfLesson2 - chapter 2  Measures of Tendency.pptx.pdf
Lesson2 - chapter 2 Measures of Tendency.pptx.pdf
hebaelkouly
 
Lesson2 - chapter two Measures of Tendency.pptx.pdf
Lesson2 - chapter two  Measures of Tendency.pptx.pdfLesson2 - chapter two  Measures of Tendency.pptx.pdf
Lesson2 - chapter two Measures of Tendency.pptx.pdf
hebaelkouly
 
Ad

More from Balaji P (13)

Probability distribution for Dummies
Probability distribution for DummiesProbability distribution for Dummies
Probability distribution for Dummies
Balaji P
 
Probability basics and bayes' theorem
Probability basics and bayes' theoremProbability basics and bayes' theorem
Probability basics and bayes' theorem
Balaji P
 
Structural equation modeling in amos
Structural equation modeling in amosStructural equation modeling in amos
Structural equation modeling in amos
Balaji P
 
Sampling design
Sampling designSampling design
Sampling design
Balaji P
 
Method of data collection
Method of data collectionMethod of data collection
Method of data collection
Balaji P
 
Measurement and scaling
Measurement and scalingMeasurement and scaling
Measurement and scaling
Balaji P
 
Gantt
GanttGantt
Gantt
Balaji P
 
Research design
Research designResearch design
Research design
Balaji P
 
Research proposal
Research proposalResearch proposal
Research proposal
Balaji P
 
Research problem
Research problemResearch problem
Research problem
Balaji P
 
Research methodology
Research methodologyResearch methodology
Research methodology
Balaji P
 
Operation research history and overview application limitation
Operation research history and overview application limitationOperation research history and overview application limitation
Operation research history and overview application limitation
Balaji P
 
Use of quantitative techniques in economics
Use of quantitative techniques in economicsUse of quantitative techniques in economics
Use of quantitative techniques in economics
Balaji P
 
Probability distribution for Dummies
Probability distribution for DummiesProbability distribution for Dummies
Probability distribution for Dummies
Balaji P
 
Probability basics and bayes' theorem
Probability basics and bayes' theoremProbability basics and bayes' theorem
Probability basics and bayes' theorem
Balaji P
 
Structural equation modeling in amos
Structural equation modeling in amosStructural equation modeling in amos
Structural equation modeling in amos
Balaji P
 
Sampling design
Sampling designSampling design
Sampling design
Balaji P
 
Method of data collection
Method of data collectionMethod of data collection
Method of data collection
Balaji P
 
Measurement and scaling
Measurement and scalingMeasurement and scaling
Measurement and scaling
Balaji P
 
Research design
Research designResearch design
Research design
Balaji P
 
Research proposal
Research proposalResearch proposal
Research proposal
Balaji P
 
Research problem
Research problemResearch problem
Research problem
Balaji P
 
Research methodology
Research methodologyResearch methodology
Research methodology
Balaji P
 
Operation research history and overview application limitation
Operation research history and overview application limitationOperation research history and overview application limitation
Operation research history and overview application limitation
Balaji P
 
Use of quantitative techniques in economics
Use of quantitative techniques in economicsUse of quantitative techniques in economics
Use of quantitative techniques in economics
Balaji P
 
Ad

Recently uploaded (20)

AWS Certified Machine Learning Slides.pdf
AWS Certified Machine Learning Slides.pdfAWS Certified Machine Learning Slides.pdf
AWS Certified Machine Learning Slides.pdf
philsparkshome
 
FPET_Implementation_2_MA to 360 Engage Direct.pptx
FPET_Implementation_2_MA to 360 Engage Direct.pptxFPET_Implementation_2_MA to 360 Engage Direct.pptx
FPET_Implementation_2_MA to 360 Engage Direct.pptx
ssuser4ef83d
 
Analysis of Billboards hot 100 toop five hit makers on the chart.docx
Analysis of Billboards hot 100 toop five hit makers on the chart.docxAnalysis of Billboards hot 100 toop five hit makers on the chart.docx
Analysis of Billboards hot 100 toop five hit makers on the chart.docx
hershtara1
 
Deloitte Analytics - Applying Process Mining in an audit context
Deloitte Analytics - Applying Process Mining in an audit contextDeloitte Analytics - Applying Process Mining in an audit context
Deloitte Analytics - Applying Process Mining in an audit context
Process mining Evangelist
 
Deloitte - A Framework for Process Mining Projects
Deloitte - A Framework for Process Mining ProjectsDeloitte - A Framework for Process Mining Projects
Deloitte - A Framework for Process Mining Projects
Process mining Evangelist
 
Suncorp - Integrating Process Mining at Australia's Largest Insurer
Suncorp - Integrating Process Mining at Australia's Largest InsurerSuncorp - Integrating Process Mining at Australia's Largest Insurer
Suncorp - Integrating Process Mining at Australia's Largest Insurer
Process mining Evangelist
 
Decision Trees in Artificial-Intelligence.pdf
Decision Trees in Artificial-Intelligence.pdfDecision Trees in Artificial-Intelligence.pdf
Decision Trees in Artificial-Intelligence.pdf
Saikat Basu
 
2-Raction quotient_١٠٠١٤٦.ppt of physical chemisstry
2-Raction quotient_١٠٠١٤٦.ppt of physical chemisstry2-Raction quotient_١٠٠١٤٦.ppt of physical chemisstry
2-Raction quotient_١٠٠١٤٦.ppt of physical chemisstry
bastakwyry
 
Oral Malodor.pptx jsjshdhushehsidjjeiejdhfj
Oral Malodor.pptx jsjshdhushehsidjjeiejdhfjOral Malodor.pptx jsjshdhushehsidjjeiejdhfj
Oral Malodor.pptx jsjshdhushehsidjjeiejdhfj
maitripatel5301
 
Automation Platforms and Process Mining - success story
Automation Platforms and Process Mining - success storyAutomation Platforms and Process Mining - success story
Automation Platforms and Process Mining - success story
Process mining Evangelist
 
Process Mining at AE - Key success factors
Process Mining at AE - Key success factorsProcess Mining at AE - Key success factors
Process Mining at AE - Key success factors
Process mining Evangelist
 
文凭证书美国SDSU文凭圣地亚哥州立大学学生证学历认证查询
文凭证书美国SDSU文凭圣地亚哥州立大学学生证学历认证查询文凭证书美国SDSU文凭圣地亚哥州立大学学生证学历认证查询
文凭证书美国SDSU文凭圣地亚哥州立大学学生证学历认证查询
Taqyea
 
Chapter-3-PROBLEM-SOLVING.pdf hhhhhhhhhh
Chapter-3-PROBLEM-SOLVING.pdf hhhhhhhhhhChapter-3-PROBLEM-SOLVING.pdf hhhhhhhhhh
Chapter-3-PROBLEM-SOLVING.pdf hhhhhhhhhh
ChrisjohnAlfiler
 
real illuminati Uganda agent 0782561496/0756664682
real illuminati Uganda agent 0782561496/0756664682real illuminati Uganda agent 0782561496/0756664682
real illuminati Uganda agent 0782561496/0756664682
way to join real illuminati Agent In Kampala Call/WhatsApp+256782561496/0756664682
 
Collibra DQ Installation setup and debug
Collibra DQ Installation setup and debugCollibra DQ Installation setup and debug
Collibra DQ Installation setup and debug
karthikprince20
 
2024-Media-Literacy-Index-Of-Ukrainians-ENG-SHORT.pdf
2024-Media-Literacy-Index-Of-Ukrainians-ENG-SHORT.pdf2024-Media-Literacy-Index-Of-Ukrainians-ENG-SHORT.pdf
2024-Media-Literacy-Index-Of-Ukrainians-ENG-SHORT.pdf
OlhaTatokhina1
 
Microsoft Excel: A Comprehensive Overview
Microsoft Excel: A Comprehensive OverviewMicrosoft Excel: A Comprehensive Overview
Microsoft Excel: A Comprehensive Overview
GinaTomarongRegencia
 
chapter 4 Variability statistical research .pptx
chapter 4 Variability statistical research .pptxchapter 4 Variability statistical research .pptx
chapter 4 Variability statistical research .pptx
justinebandajbn
 
HershAggregator (2).pdf musicretaildistribution
HershAggregator (2).pdf musicretaildistributionHershAggregator (2).pdf musicretaildistribution
HershAggregator (2).pdf musicretaildistribution
hershtara1
 
Lagos School of Programming Final Project Updated.pdf
Lagos School of Programming Final Project Updated.pdfLagos School of Programming Final Project Updated.pdf
Lagos School of Programming Final Project Updated.pdf
benuju2016
 
AWS Certified Machine Learning Slides.pdf
AWS Certified Machine Learning Slides.pdfAWS Certified Machine Learning Slides.pdf
AWS Certified Machine Learning Slides.pdf
philsparkshome
 
FPET_Implementation_2_MA to 360 Engage Direct.pptx
FPET_Implementation_2_MA to 360 Engage Direct.pptxFPET_Implementation_2_MA to 360 Engage Direct.pptx
FPET_Implementation_2_MA to 360 Engage Direct.pptx
ssuser4ef83d
 
Analysis of Billboards hot 100 toop five hit makers on the chart.docx
Analysis of Billboards hot 100 toop five hit makers on the chart.docxAnalysis of Billboards hot 100 toop five hit makers on the chart.docx
Analysis of Billboards hot 100 toop five hit makers on the chart.docx
hershtara1
 
Deloitte Analytics - Applying Process Mining in an audit context
Deloitte Analytics - Applying Process Mining in an audit contextDeloitte Analytics - Applying Process Mining in an audit context
Deloitte Analytics - Applying Process Mining in an audit context
Process mining Evangelist
 
Deloitte - A Framework for Process Mining Projects
Deloitte - A Framework for Process Mining ProjectsDeloitte - A Framework for Process Mining Projects
Deloitte - A Framework for Process Mining Projects
Process mining Evangelist
 
Suncorp - Integrating Process Mining at Australia's Largest Insurer
Suncorp - Integrating Process Mining at Australia's Largest InsurerSuncorp - Integrating Process Mining at Australia's Largest Insurer
Suncorp - Integrating Process Mining at Australia's Largest Insurer
Process mining Evangelist
 
Decision Trees in Artificial-Intelligence.pdf
Decision Trees in Artificial-Intelligence.pdfDecision Trees in Artificial-Intelligence.pdf
Decision Trees in Artificial-Intelligence.pdf
Saikat Basu
 
2-Raction quotient_١٠٠١٤٦.ppt of physical chemisstry
2-Raction quotient_١٠٠١٤٦.ppt of physical chemisstry2-Raction quotient_١٠٠١٤٦.ppt of physical chemisstry
2-Raction quotient_١٠٠١٤٦.ppt of physical chemisstry
bastakwyry
 
Oral Malodor.pptx jsjshdhushehsidjjeiejdhfj
Oral Malodor.pptx jsjshdhushehsidjjeiejdhfjOral Malodor.pptx jsjshdhushehsidjjeiejdhfj
Oral Malodor.pptx jsjshdhushehsidjjeiejdhfj
maitripatel5301
 
Automation Platforms and Process Mining - success story
Automation Platforms and Process Mining - success storyAutomation Platforms and Process Mining - success story
Automation Platforms and Process Mining - success story
Process mining Evangelist
 
文凭证书美国SDSU文凭圣地亚哥州立大学学生证学历认证查询
文凭证书美国SDSU文凭圣地亚哥州立大学学生证学历认证查询文凭证书美国SDSU文凭圣地亚哥州立大学学生证学历认证查询
文凭证书美国SDSU文凭圣地亚哥州立大学学生证学历认证查询
Taqyea
 
Chapter-3-PROBLEM-SOLVING.pdf hhhhhhhhhh
Chapter-3-PROBLEM-SOLVING.pdf hhhhhhhhhhChapter-3-PROBLEM-SOLVING.pdf hhhhhhhhhh
Chapter-3-PROBLEM-SOLVING.pdf hhhhhhhhhh
ChrisjohnAlfiler
 
Collibra DQ Installation setup and debug
Collibra DQ Installation setup and debugCollibra DQ Installation setup and debug
Collibra DQ Installation setup and debug
karthikprince20
 
2024-Media-Literacy-Index-Of-Ukrainians-ENG-SHORT.pdf
2024-Media-Literacy-Index-Of-Ukrainians-ENG-SHORT.pdf2024-Media-Literacy-Index-Of-Ukrainians-ENG-SHORT.pdf
2024-Media-Literacy-Index-Of-Ukrainians-ENG-SHORT.pdf
OlhaTatokhina1
 
Microsoft Excel: A Comprehensive Overview
Microsoft Excel: A Comprehensive OverviewMicrosoft Excel: A Comprehensive Overview
Microsoft Excel: A Comprehensive Overview
GinaTomarongRegencia
 
chapter 4 Variability statistical research .pptx
chapter 4 Variability statistical research .pptxchapter 4 Variability statistical research .pptx
chapter 4 Variability statistical research .pptx
justinebandajbn
 
HershAggregator (2).pdf musicretaildistribution
HershAggregator (2).pdf musicretaildistributionHershAggregator (2).pdf musicretaildistribution
HershAggregator (2).pdf musicretaildistribution
hershtara1
 
Lagos School of Programming Final Project Updated.pdf
Lagos School of Programming Final Project Updated.pdfLagos School of Programming Final Project Updated.pdf
Lagos School of Programming Final Project Updated.pdf
benuju2016
 

Statistics in research

  • 1. STATISTICS IN RESEARCH (PART – 1 DESCRIPTIVE)
  • 2. Role of statistics in research  Designing research  Analyzing data  Draw conclusion about research
  • 3. Two major areas of statistics  Descriptive statistics  It concern with development of certain indices from the raw data.  It summarizes collected/ classified data.  Inferential statistics  It adopts the process of generalization from small groups (i.e., samples) to population.  It also known as sampling statistics.  It concern with two major problems  Estimation of population parameters.  Testing of statistical hypothesis.
  • 4. Descriptive statistics  Measure of central tendency (Statistical Average)  Measure of dispersion  Measure of asymmetry (Skewness)  Measure of relationship  Other measures
  • 5. Measure of Central tendency  It also known as statistical average.  Mean, Median and Mode are the popular averages.  Geometric and Harmonic mean are also sometime used.
  • 6. Mean  It also known as arithmetic average.  The mean is found by adding all the values in the set, then dividing the sum by the number of values.
  • 7. Example  Find the mean of following numbers 8, 9, 10, 10, 10, 11, 11, 11, 12, 13 The mean is the usual average:  n= 10  (8 + 9 + 10 + 10 + 10 + 11 + 11 + 11 + 12 + 13) ÷ 10 = 105 ÷ 10 = 10.5
  • 8. Weighted Mean  The weighted average formula is used to calculate the average value of a particular set of numbers with different levels of weight.
  • 9. Example  Final marks has distribution of internal marks, Mid semester marks and End semester marks. Each one carries following weightage Internal Mid-Sem End-Sem 30 20 50 Shakthi got 80 marks in internal, 75 marks in Mid sem and 89 marks in End sem. Then what is his final mark? Ans = 83.5
  • 10. Frequency distribution Mean Parkin g Spaces Freque ncy 1 15 2 27 3 8 Isabella went up and down the street to find out how many parking spaces each house had. Here are her results: What is the mean number of Parking Spaces? Mean = 15×1 + 27×2 + 8×3 + 5×4 15+54+24+20 /15+27+8+5 Mean = 2.05
  • 11. Median  Median is the value of the middle item of series when it is arranged in ascending or descending order of magnitude.  It divides the series into two halves; in one half all items are less than median, whereas in the other half all items have values higher than median
  • 12. Example  21, 18, 24, 19, 27  Arrange the number in ascending order  18, 19, 21, 24, 27 (21 is the median)  If there are two middle numbers,  18, 19, 21, 25, 27, 28  (21 + 25 )/2 = 23
  • 13. Mode  The number that appears most frequently in a set of numbers.  It useful in the studies of popular. (Popular shoe size, popular cap, most demanded product etc.,)  Arrange the numbers in order from least to greatest.  21, 18, 24, 19, 18  Find the number that is repeated the most.  18, 18, 19, 21, 24 ( 18 is the mode).
  • 14. Geometric Mean  It is defined as the nth root of the product of the values of n times in a given series.  The most frequently used application of this average is in the determination of average percent of change.
  • 15. Example  What is the geometric mean of 4,6 and 9 ? Ans: 6  Your investment earns 20% during the first year, but then realizes a loss of 10% in year 2, and another 10% in year 3  Calculate a growth factor for each year. (1+.2) for Year 1, (1- .1) for year 2 and (1-.1) for Year 3  Multiply the 3 growth factors and take the 3rd root.  Thus, geometric mean = 0.990578-1=-0.009422.  So your investment losing roughly .9 percent of every year.
  • 16. Harmonic Mean  It is defined as reciprocal of the average of reciprocal of values of items of a series.  It has limited application particularly in time and rate are involved.  It gives largest weight to smallest value and smallest weight to largest value.
  • 17. Harmonic Mean What is the Harmonic mean of 4,5 and 10? Ans: 5.45
  • 18. Example  For example, suppose that you have four 10 km segments to your automobile trip. You drive your car:  100 km/hr for the first 10 km  110 km/hr for the second 10 km  90 km/hr for the third 10 km  120 km/hr for the fourth 10 km.  Ans: 103.8 km/hr.
  • 19. Measure of Dispersion  Average can’t reveal the entire story of study. It fail to give ideas about data which distributed around average.  Measures of dispersion measure how spread out a set of data is.  Important measure of dispersion are  Range  Mean Deviation  Standard deviation
  • 20. Range  It is difference between extreme values.  Range = Highest value – Lowest value  What is the range for following data?  2, 3, 1, 1, 0, 5, 3, 1, 2, 7, 4, 0, 2, 1, 2, 1, 6, 3, 2, 0, 0, 7, 4, 2, 1, 1, 2, 1, 3, 5, 12  Range = 12  Limitation: Its value is never stable, being based on only two values of the variable.
  • 21. Mean Deviation  It is the average of difference of the values of items from some average of the series  Find the mean deviation of 3, 6, 6, 7, 8, 11, 15, 16 Find the mean Find the absolute distance from the mean. Find mean of those Distances. Ans= 3.75
  • 22. Coefficient of mean deviation  When mean deviation is divided by the average used in finding out the mean deviation itself, the resulting quantity is described as the coefficient of mean deviation.  Coefficient of mean deviation is a relative measure of dispersion and is comparable to similar measure of other series.
  • 23. Standard deviation  It is most widely used measure of dispersion of a series and is commonly denoted by the symbol ‘ 𝜎’ (pronounced as sigma).
  • 24. Standard deviation  Find the standard deviation of 53,61,49,67, 55,63. Steps:  Find the mean (58)  Find the deviation from mean, square it and sum it (230)  Divide the above answer by sample size (38.333)
  • 25. Other terms related to 𝜎  Variance : Square the standard deviation  Coefficient of standard deviation: Divide the standard deviation by Mean of that data.  Coefficient of variation: Multiply the coefficient of standard deviation with 100.
  • 26. Distribution  The pattern of outcomes of a variable; it tells us what values the variable takes and how often it takes these values.  The distribution of data can be find with help of histogram.  Histogram is bar chart.
  • 27. Steps in making Distribution  Choose the classes by dividing the range of data into classes of equal width (individuals fit into one class).  Count the individuals in each class (this is the height of the bar).  Draw the histogram: The horizontal axis is marked off into equal class widths. The vertical axis contains the scale of counts
  • 28. Histogram  The number of days of Maria’s last 15 vacations are listed below. Use the data to make a frequency table with intervals. 4, 8, 6, 7, 5, 4, 10, 6, 7, 14, 12, 8, 10, 15, 12.  Step 1: Identify the least and greatest values.  Step 2: Divide the data into equal intervals.
  • 29. Histogram  Step 3: List the intervals in the first column of the table. Count the number of data values in each interval and list the count in the last column. Give the table a title.
  • 30. Normal Distribution  The distribution of data happens to be perfectly symmetrical.  It is perfectly bell shaped curve in which case the value of mean 𝑋 = median M = mode Z.
  • 31. Skewness  if the curve is distorted (whether on the right side or on the left side), we have asymmetrical distribution which indicates that there is skewness.  If the curve is distorted on the right side, we have positive skewness.  If the curve is distorted on the left side, we have negative skewness.
  • 33. Measure of Skewness  Skewness = 𝑋 - Z or 3 ( 𝑋 - M)  Coefficient of Skewness = ( 𝑋− 𝑍) 𝜎  If skewness value is positive/ negative/ zero, then data are positively/negatively skewed/symmetry.
  • 34. Kurtosis  Kurtosis is the measure of flat-toppedness or peakedness of a curve.  Normal curve is MesoKurtic..  Positive kurtosis is leptokurtic.  Negative kurtosis is platykurtic
  • 35. Measure of Relationship  Univariate Analysis: The analysis is carried out with the description of a single variable.  Bivariate Analysis: The analysis of two variables simultaneously.  Multivariate Analysis: The analysis of multiple variables simultaneously.
  • 36. Measure of Relationship  Correlation  The word Correlation is made of Co- (meaning "together"), and Relation.  It answers that Does there exist association or correlation between the two (or more) variables ?  Regression  It answers that:  Is there any cause and effect relationship between the two variables in case of the bivariate population or between one variable on one side and two or more variables on the other side in case of multivariate population? If yes, of what degree and in which direction?
  • 37. Karl Pearson’s coefficient of correlation  It is simple correlation and most widely used method. Denotes by r.  It is also known as the product moment correlation coefficient.  The value of r lies between ±1.
  • 38. r value and interpretation We can also say that for a unit change in independent variable, if there happens to be a constant change in the dependent variable in the same direction, then correlation will be termed as perfect positive.
  • 40. Karl Pearson Correlation coefficient formula r If substitute the value of sigma and derive, you will get below equation
  • 41. Example  A researcher want to know the relation between advertisement expenditure and total sales. He took a sample data of 7 companies for one year.
  • 44. Regression  It is the study of the relationship between variables.  It also used for prediction of dependent variable.  Regression types  Simple Regression: single explanatory/independent variable
  • 45. Regression Analysis  Linear Regression: Straight-line relationship.  Non-linear: Implies curved relationships.  Regression is nothing but try to find out the equation of line/curve. Linear Non linear y=mx+b
  • 46. Linear Regression Equation Unknown parameter a and b can be calculated by below formulam = b = slope of line, a= c = intercept Above formula gives best fit line by using least square method. b = Σ [ (𝑥𝑖 - 𝑥)(𝑦𝑖 - 𝑦) ] / Σ [ (𝑥𝑖 − 𝑥)2 ] a = 𝑦 - b * 𝑥
  • 47. Example for regression  In the table below, the X column shows scores on the aptitude test. Similarly, the Y column shows statistics grades. Student X Y 1 2 3 4 5 95 85 80 70 60 85 95 70 65 70
  • 48. Regression S.no xi yi (xi - x) (yi - y) (xi - x)2 (yi - y)2 (xi - x)(yi - y) 1 95 85 17 8 289 64 136 2 85 95 7 18 49 324 126 3 80 70 2 -7 4 49 -14 4 70 65 -8 -12 64 144 96 5 60 70 -18 -7 324 49 126 Sum 390 385 730 630 470 Mean 78 77 b = 470/730 = 0.644 a = 26.768 ŷ = 26.768 + 0.644x
  • 49. Standard error  It provide an overall measure of how well the model fits the data.  It represents the average distance that the observed values fall from the regression line.  Smaller values are better because it indicates that the observations are closer to the fitted line.
  • 50. Standard error Y 𝑌 (𝑌 − 𝑌) (𝑌 − 𝑌)2 85 81.508 3.492 12.19406 95 87.948 7.052 49.7307 70 71.848 -1.848 3.415104 65 68.628 -3.628 13.16238 70 71.848 -1.848 3.415104 Sum 81.91736 Standard error = 4.0475
  • 51. Multiple regression Multicollinearity: Independent variables in multiple regression are highly correlated.
  • 52. Partial Correlation  In simple correlation, we measure the strength of the linear relationship between two variables, without taking into consideration the fact that both these variables may be influenced by a third variable.  For Ex: when we study the correlation between price and demand, we completely ignore the effect of other factors like money supply, import and exports etc. which definitely have a bearing on the price.  The correlation co-efficient between two variables X1 and X2, studied partially after eliminating the influence of the third variable X3 from both of them, is the partial correlation co- efficient.
  • 53. Other Measures  Index number  Indicator of average percentage change in a series of figures where one figure (called the base) is assigned an arbitrary value of 100, and other figures are adjusted in proportion to the base.  Time series analysis  Unlike the analyses of random samples of observations that are discussed in the context of most other statistics, the analysis of time series is based on the assumption that successive values in the data file represent consecutive measurements taken at equally spaced time intervals.