SlideShare a Scribd company logo
Chapter-1 Introduction to Statistics.pdf
Chapter –1
Introduction to
Statistics
 Statistics is concerned with scientific methods for
collecting, organizing, summarizing, presenting and
analyzing data as well as deriving the valid
conclusion and making reasonable decision on the
basis of this analysis.
 The word ‘ Statistics’ is used to refer to
- Numerical facts, such as the number of people
living in particular area.
-The study of ways of collecting, analyzing and
interpreting the facts.
Statistics can be classified into two parts
I. Theoretical Statistics or Mathematical Statistics
II. Applied Statistics
Theoretical or Mathematical Statistics
It is application of mathematics to Statistics. It
involves collection of facts and analysis of facts using
mathematical techniques. It is divided into two parts;
a) Descriptive Statistics
b) Inferential Statistics
 Statistics which deals with collection of data,
presentation of data using table, diagram, graph
etc and summarize data using measures of central
tendency and measures of variability (spread).
 Hence summarized results is obtained from
descriptive statistics which can describe the data
but can not be used to generalized.
Statistics which deals with sample selection from
population and statistical techniques used to draw
the valid conclusion about population on the basis of
statistical measures obtained from sample.
With the help of inferential statistics, one can
generalized the results for the whole population by
computing statistical measures from the sample.
Hence, inferential statistics possess the
generalizability capacity and it plays a vital role in
statistics.
 Statistics which deals with the application of
statistical methods to specific problem is called
applied statistics.
 The statistical tools and methods are used in
order to solve many more practical problems in
diversified area like computer science,
information technology, physics, biology,
astronomy, sociology, psychology, business,
economics and so on.
Despite its power, essential usefulness and
universal applicability statistics has its own
limitation. Some of these limitations are as
follows:
I. Statistics deal with groups and aggregates
only.
II. Statistics does not deal with qualitative
characteristics directly.
III. If sufficient care is not exercised in collecting,
analyzing and interpreting the data, statistical
results might be misleading.
IV. Only a person who has an expert knowledge of
statistics can handle statistical data efficiently.
V. Some errors are possible in statistical decisions.
In particular, inferential statistics involves
certain errors. We do not know whether an error
has been committed or not.
Chapter-1 Introduction to Statistics.pdf
 Measurement is a process of assigning numbers or
symbols to any facts or objects or products or items
according to some rule.
 Scale is simply a range of levels or numbers used for
measuring something.
 Different measurements scales are used on the basis
of nature of the data. There are generally four type
of measurement scales, which are as follows;
I. Nominal Scale
II. Ordinal Scale
III. Interval Scale
IV. Ratio Scale
Nominal Scale
 It is the lowest level of measurement scale. It is simply a
system of assigning numbers or the symbols to objects or
events to distinguish one from another.
 The symbols or the numbers have no numerical meaning
so that the arithmetic operations can not be used for these
numbers.
 Categorical data are generally measured on nominal scale.
 For example ; gender, religion, occupation are measured in
nominal scale. If we use 1 for male and 2 for female for
measuring gender, then 1 and 2 have no numeric meaning.
Ordinal Scale
 When the quantification of observation is done by
ranking based on the certain criterion or priorities or
importance, then they are said to be measured on an
ordinal scale.
 It represents the qualitative values in ascending or
descending order.
 For example, symptoms of depression from
psychiatric assessment such as none is coded by ‘0’,
mild ‘1’, moderate’2’, severe’3’. These numbers are
known as ranks.
Interval Scale
Some measurement scales possess a constant interval
size, they are called interval scale.
 It assumes data have equal intervals.
 This scale does not have absolute zero but only
arbitrary zero.
 Scale of temperature is an example of ordinal scale.
Ratio Scale
Ratio scale is the ideal scale and extended form of the
interval scale. It is the most powerful scale of
measurement.
 It possesses the characteristics of nominal, ordinal
and interval scale.
 Ratio scale has an absolute zero or true zero point
that indicates the completely absence of that property
of an object what is being measured.
 For example, length, weight, age, income, sales etc
are measured in ratio scale.
 In statistics, variable may be defined as an attributes that
describe person, place, thing or idea under study. And is
denoted by capital letters like X, Y, Z etc.
 It is called a variable because the value may vary
between data units in a population, and may change in
value over time.
 For example, age, height, weight of persons life time of
any electronic appliance, time to download any image
file through internet etc.
 Generally, variables is classified into two types;
I. Qualitative variables
II. Quantitative Variable
Qualitative Variable
The variable which varies in kind rather than in
magnitude is called qualitative variable.
 It can be divided into different categories. It is
also called categorical variable.
 Qualitative variable are presented in nominal and
ordinal scales.
 For example, hair color, eye color, gender,
smoking habit, etc.
Quantitative Variable
The variable which varies in magnitude and can be
expressed numerically is called quantitative data.
 Quantitative variables are presents in interval and
ratio scale.
 Quantitative variable can be further divided into
two types;
I. Discrete Variable
II. Continuous Variable
I. Discrete Variable : A variable is said to be
discrete if it takes only whole number. For
example no. of girls in a class, family size, etc.
II. Continuous Variable : A variable is said to be
continuous if it takes all possible real values(
whole number as well as fractional values) within
a certain range. For example, height, weight, age,
temperature etc.
In any statistical investigation first approach is to
collect data. It is a set of values obtained on one or
more characters under study. It is the collected
information which is ready to use for statistical
analysis.
There are mainly two types of data on the basis of
collection procedures;
I. Primary data
II. Secondary data
Primary Data :
The data which are originally collected by
investigator or researcher for the first time with the
purpose of statistical inquiry is called primary data.
It is collected by government, an individual,
institution and research bodies. It needs more fund,
time and manpower.
Following are the method of collecting primary
data;
Direct personal interview method.
Indirect personal interview method.
Information through correspondence.
Mailed questionnaire method.
Schedule through enumerators.
Secondary Data :
The data that has been already collected for a
particular purpose and used for next purpose is called
secondary data.
When investigators find impracticable to collect
firsthand information on related issues, secondary
data is used. It saves time, money and manpower.
Source of Secondary data :
I. Published Source
II. Unpublished Source
Published Sources :
Different published source of secondary data are national organizations
and international agencies.
– International agencies such as WHO, World bank, International Labor
Organization.
– Governmental organizations such as Central Bureau of Statistics,
Ministry of Commerce and Industry, Nepal Rastra Bank, Ministry of
Finance.
– Semi-governmental organizations such as Nepal Food Corporation,
Nepal Electricity Authority.
– Private organizations such as Nepal Chamber of Commerce,
Federation of Nepal Chamber of Commerce and Industry,
Publications.
Unpublished Sources :
– Records maintained by government offices.
– Records maintained by research institutions,
research scholars etc.
– Records updated by the departments, institutions for
their internal purpose.
Cross- Sectional data refers to data collected by
observing many subjects at the one point or period
of time.
It is a snapshot of observation at a particular point.
For example; Population of women in census year
2068.
The data which can be recorded over different periods
of time is called time series data. In this case same
measurements are recorded on regular basis.
For example; population of Nepal in census year
2048, 2058, 2068.
The data of each unit is recorded for each follow up
time till the occurrence of an event or till the unit fails
is called failure time data.
It is also called time to event data.
Time to event data is mostly found in life time
analysis of different parts of computers, different
software, different industrial products, life time of
infrastructure, clinical studies, etc.
It is a dataset in which the behaviors of entities are
observed across time.
These entities could be individuals, states,
companies, institutions, Countries, etc.
Panel data is also known as longitudinal or cross-
sectional data.
Data of individual is recorded repeatedly over
number of years.
Income of persons X and Y in years 2014, 2015&
2016 according to age and qualification.
A population can be defined as an aggregate
observation of subjects grouped together by a
common feature.
 Population is the entire pool from which a
statistical sample is drawn.
 Census survey is conducted to enumerate all the
population units.
 Based on the number of individuals belonging to the
group, population can be divided into two types;
I. Finite population
II. Infinite Population
Based on the type of individuals in population,
population can be divided into two types
I. Homogeneous Population
Population consisting of individuals of same type
is called homogeneous population.
II. Heterogeneous Population
Population consisting of individuals of different
type is called heterogeneous population.

More Related Content

What's hot (20)

PDF
diagrammatic and graphical representation of data
Varun Prem Varu
 
PPTX
Measurement Scales in Research
Dr. Sarita Anand
 
PPTX
Diagrams
Fousiya O P
 
PPTX
Correlation (theory)
Pandidurai P
 
PPTX
Tabulation
Dodiya Nikunj
 
PPTX
Business statistics
muthukrishnaveni anand
 
PPTX
Tabulation
Pranav Krishna
 
PPT
Meaning and uses of statistics
RekhaChoudhary24
 
PPT
Measures of central tendency
Alex Chris
 
PPTX
Advantages and Limitations for Diagrams and Graphs
Hardik Bhaavani
 
PPTX
Type of data
Amit Sharma
 
PPTX
Segment reporting ppt
Rani Padmini
 
PPTX
Methods of data presention
Dr Vaibhav Gupta
 
PPTX
Understanding the graphical representation of data in research
DrShalooSaini
 
PPTX
Measurement in research
Bikram Pradhan
 
PPTX
Tabulation of data
RekhaChoudhary24
 
PPT
SAMPLING AND SAMPLING ERRORS
rambhu21
 
PDF
Introduction to Business Statistics
SOMASUNDARAM T
 
diagrammatic and graphical representation of data
Varun Prem Varu
 
Measurement Scales in Research
Dr. Sarita Anand
 
Diagrams
Fousiya O P
 
Correlation (theory)
Pandidurai P
 
Tabulation
Dodiya Nikunj
 
Business statistics
muthukrishnaveni anand
 
Tabulation
Pranav Krishna
 
Meaning and uses of statistics
RekhaChoudhary24
 
Measures of central tendency
Alex Chris
 
Advantages and Limitations for Diagrams and Graphs
Hardik Bhaavani
 
Type of data
Amit Sharma
 
Segment reporting ppt
Rani Padmini
 
Methods of data presention
Dr Vaibhav Gupta
 
Understanding the graphical representation of data in research
DrShalooSaini
 
Measurement in research
Bikram Pradhan
 
Tabulation of data
RekhaChoudhary24
 
SAMPLING AND SAMPLING ERRORS
rambhu21
 
Introduction to Business Statistics
SOMASUNDARAM T
 

Similar to Chapter-1 Introduction to Statistics.pdf (20)

PPTX
INTRODUCTION TO BIOSTATISTICS
BismahKhan21
 
PPTX
statistics chp 1&2.pptx statistics in veterinary
ayeleasefa2
 
PPT
Introduction to statistics
Shaamma(Simi_ch) Fiverr
 
PPT
Intro_BiostatPG.ppt
victor431494
 
PPTX
Bio Statistics.pptx by Dr.REVATHI SIVAKUMAR
Dr.REVATHI SIVAKUMAR
 
PPTX
543957106-Introduction-Basic-Concepts-in-Statistics-PPT - Copy.pptx
ssuser46ca42
 
PPTX
Statistics and prob.
Emmanuel Alimpolos
 
PPTX
Statistical techniques for interpreting and reporting quantitative data i
Vijayalakshmi Murugesan
 
PPTX
Sampling-A compact study of different types of sample
Asith Paul.K
 
PPT
Introduction-To-Statistics-18032022-010747pm (1).ppt
Israr36
 
PPTX
BUSINESS STATISTICS AND PROBABILITY Chapter 1 By Arbaminich University
kedirh150
 
PPTX
Statistics.pptx
lavanya209529
 
PDF
CH1.pdf
mekuannintdemeke
 
PPTX
Introduction to statistics.pptx
Unfold1
 
PPTX
01 Introduction (1).pptx
BAVAHRNIAPSUBRAMANIA
 
PPTX
Medical Statistics.pptx
Siddanna B Chougala C
 
PPTX
introduction to statistics
mirabubakar1
 
PPTX
Stat-Lesson.pptx
JennilynFeliciano2
 
PDF
Introduction.pdf
MuhammadFaizan389
 
INTRODUCTION TO BIOSTATISTICS
BismahKhan21
 
statistics chp 1&2.pptx statistics in veterinary
ayeleasefa2
 
Introduction to statistics
Shaamma(Simi_ch) Fiverr
 
Intro_BiostatPG.ppt
victor431494
 
Bio Statistics.pptx by Dr.REVATHI SIVAKUMAR
Dr.REVATHI SIVAKUMAR
 
543957106-Introduction-Basic-Concepts-in-Statistics-PPT - Copy.pptx
ssuser46ca42
 
Statistics and prob.
Emmanuel Alimpolos
 
Statistical techniques for interpreting and reporting quantitative data i
Vijayalakshmi Murugesan
 
Sampling-A compact study of different types of sample
Asith Paul.K
 
Introduction-To-Statistics-18032022-010747pm (1).ppt
Israr36
 
BUSINESS STATISTICS AND PROBABILITY Chapter 1 By Arbaminich University
kedirh150
 
Statistics.pptx
lavanya209529
 
Introduction to statistics.pptx
Unfold1
 
01 Introduction (1).pptx
BAVAHRNIAPSUBRAMANIA
 
Medical Statistics.pptx
Siddanna B Chougala C
 
introduction to statistics
mirabubakar1
 
Stat-Lesson.pptx
JennilynFeliciano2
 
Introduction.pdf
MuhammadFaizan389
 
Ad

Recently uploaded (20)

PPTX
DATA-COLLECTION METHODS, TYPES AND SOURCES
biggdaad011
 
PPTX
Slide studies GC- CRC - PC - HNC baru.pptx
LLen8
 
PPTX
This PowerPoint presentation titled "Data Visualization: Turning Data into In...
HemaDivyaKantamaneni
 
PDF
List of all the AI prompt cheat codes.pdf
Avijit Kumar Roy
 
PPTX
apidays Munich 2025 - Agentic AI: A Friend or Foe?, Merja Kajava (Aavista Oy)
apidays
 
PDF
Basotho Satisfaction with Electricity(Statspack)
KatlehoMefane
 
PPTX
fashion industry boom.pptx an economics project
TGMPandeyji
 
PPTX
Introduction to Artificial Intelligence.pptx
StarToon1
 
PDF
How to Avoid 7 Costly Mainframe Migration Mistakes
JP Infra Pvt Ltd
 
PPTX
GLOBAL_Gender-module-5_committing-equity-responsive-budget.pptx
rashmisahu90
 
PPT
dsaaaaaaaaaaaaaaaaaaaaaaaaaaaaaasassas2.ppt
UzairAfzal13
 
PDF
apidays Munich 2025 - Automating Operations Without Reinventing the Wheel, Ma...
apidays
 
DOCX
Online Delivery Restaurant idea and analyst the data
sejalsengar2323
 
PDF
The X-Press God-WPS Office.pdf hdhdhdhdhd
ramifatoh4
 
PPT
Data base management system Transactions.ppt
gandhamcharan2006
 
PPTX
Data Analysis for Business - make informed decisions, optimize performance, a...
Slidescope
 
PPTX
原版定制AIM毕业证(澳大利亚音乐学院毕业证书)成绩单底纹防伪如何办理
Taqyea
 
PPTX
Pre-Interrogation_Assessment_Presentation.pptx
anjukumari94314
 
PPTX
GEN CHEM ACCURACY AND PRECISION eme.pptx
yeagere932
 
PDF
Introduction to Data Science_Washington_
StarToon1
 
DATA-COLLECTION METHODS, TYPES AND SOURCES
biggdaad011
 
Slide studies GC- CRC - PC - HNC baru.pptx
LLen8
 
This PowerPoint presentation titled "Data Visualization: Turning Data into In...
HemaDivyaKantamaneni
 
List of all the AI prompt cheat codes.pdf
Avijit Kumar Roy
 
apidays Munich 2025 - Agentic AI: A Friend or Foe?, Merja Kajava (Aavista Oy)
apidays
 
Basotho Satisfaction with Electricity(Statspack)
KatlehoMefane
 
fashion industry boom.pptx an economics project
TGMPandeyji
 
Introduction to Artificial Intelligence.pptx
StarToon1
 
How to Avoid 7 Costly Mainframe Migration Mistakes
JP Infra Pvt Ltd
 
GLOBAL_Gender-module-5_committing-equity-responsive-budget.pptx
rashmisahu90
 
dsaaaaaaaaaaaaaaaaaaaaaaaaaaaaaasassas2.ppt
UzairAfzal13
 
apidays Munich 2025 - Automating Operations Without Reinventing the Wheel, Ma...
apidays
 
Online Delivery Restaurant idea and analyst the data
sejalsengar2323
 
The X-Press God-WPS Office.pdf hdhdhdhdhd
ramifatoh4
 
Data base management system Transactions.ppt
gandhamcharan2006
 
Data Analysis for Business - make informed decisions, optimize performance, a...
Slidescope
 
原版定制AIM毕业证(澳大利亚音乐学院毕业证书)成绩单底纹防伪如何办理
Taqyea
 
Pre-Interrogation_Assessment_Presentation.pptx
anjukumari94314
 
GEN CHEM ACCURACY AND PRECISION eme.pptx
yeagere932
 
Introduction to Data Science_Washington_
StarToon1
 
Ad

Chapter-1 Introduction to Statistics.pdf

  • 3.  Statistics is concerned with scientific methods for collecting, organizing, summarizing, presenting and analyzing data as well as deriving the valid conclusion and making reasonable decision on the basis of this analysis.  The word ‘ Statistics’ is used to refer to - Numerical facts, such as the number of people living in particular area. -The study of ways of collecting, analyzing and interpreting the facts.
  • 4. Statistics can be classified into two parts I. Theoretical Statistics or Mathematical Statistics II. Applied Statistics Theoretical or Mathematical Statistics It is application of mathematics to Statistics. It involves collection of facts and analysis of facts using mathematical techniques. It is divided into two parts; a) Descriptive Statistics b) Inferential Statistics
  • 5.  Statistics which deals with collection of data, presentation of data using table, diagram, graph etc and summarize data using measures of central tendency and measures of variability (spread).  Hence summarized results is obtained from descriptive statistics which can describe the data but can not be used to generalized.
  • 6. Statistics which deals with sample selection from population and statistical techniques used to draw the valid conclusion about population on the basis of statistical measures obtained from sample. With the help of inferential statistics, one can generalized the results for the whole population by computing statistical measures from the sample. Hence, inferential statistics possess the generalizability capacity and it plays a vital role in statistics.
  • 7.  Statistics which deals with the application of statistical methods to specific problem is called applied statistics.  The statistical tools and methods are used in order to solve many more practical problems in diversified area like computer science, information technology, physics, biology, astronomy, sociology, psychology, business, economics and so on.
  • 8. Despite its power, essential usefulness and universal applicability statistics has its own limitation. Some of these limitations are as follows: I. Statistics deal with groups and aggregates only. II. Statistics does not deal with qualitative characteristics directly. III. If sufficient care is not exercised in collecting, analyzing and interpreting the data, statistical results might be misleading.
  • 9. IV. Only a person who has an expert knowledge of statistics can handle statistical data efficiently. V. Some errors are possible in statistical decisions. In particular, inferential statistics involves certain errors. We do not know whether an error has been committed or not.
  • 11.  Measurement is a process of assigning numbers or symbols to any facts or objects or products or items according to some rule.  Scale is simply a range of levels or numbers used for measuring something.  Different measurements scales are used on the basis of nature of the data. There are generally four type of measurement scales, which are as follows; I. Nominal Scale II. Ordinal Scale III. Interval Scale IV. Ratio Scale
  • 12. Nominal Scale  It is the lowest level of measurement scale. It is simply a system of assigning numbers or the symbols to objects or events to distinguish one from another.  The symbols or the numbers have no numerical meaning so that the arithmetic operations can not be used for these numbers.  Categorical data are generally measured on nominal scale.  For example ; gender, religion, occupation are measured in nominal scale. If we use 1 for male and 2 for female for measuring gender, then 1 and 2 have no numeric meaning.
  • 13. Ordinal Scale  When the quantification of observation is done by ranking based on the certain criterion or priorities or importance, then they are said to be measured on an ordinal scale.  It represents the qualitative values in ascending or descending order.  For example, symptoms of depression from psychiatric assessment such as none is coded by ‘0’, mild ‘1’, moderate’2’, severe’3’. These numbers are known as ranks.
  • 14. Interval Scale Some measurement scales possess a constant interval size, they are called interval scale.  It assumes data have equal intervals.  This scale does not have absolute zero but only arbitrary zero.  Scale of temperature is an example of ordinal scale.
  • 15. Ratio Scale Ratio scale is the ideal scale and extended form of the interval scale. It is the most powerful scale of measurement.  It possesses the characteristics of nominal, ordinal and interval scale.  Ratio scale has an absolute zero or true zero point that indicates the completely absence of that property of an object what is being measured.  For example, length, weight, age, income, sales etc are measured in ratio scale.
  • 16.  In statistics, variable may be defined as an attributes that describe person, place, thing or idea under study. And is denoted by capital letters like X, Y, Z etc.  It is called a variable because the value may vary between data units in a population, and may change in value over time.  For example, age, height, weight of persons life time of any electronic appliance, time to download any image file through internet etc.  Generally, variables is classified into two types; I. Qualitative variables II. Quantitative Variable
  • 17. Qualitative Variable The variable which varies in kind rather than in magnitude is called qualitative variable.  It can be divided into different categories. It is also called categorical variable.  Qualitative variable are presented in nominal and ordinal scales.  For example, hair color, eye color, gender, smoking habit, etc.
  • 18. Quantitative Variable The variable which varies in magnitude and can be expressed numerically is called quantitative data.  Quantitative variables are presents in interval and ratio scale.  Quantitative variable can be further divided into two types; I. Discrete Variable II. Continuous Variable
  • 19. I. Discrete Variable : A variable is said to be discrete if it takes only whole number. For example no. of girls in a class, family size, etc. II. Continuous Variable : A variable is said to be continuous if it takes all possible real values( whole number as well as fractional values) within a certain range. For example, height, weight, age, temperature etc.
  • 20. In any statistical investigation first approach is to collect data. It is a set of values obtained on one or more characters under study. It is the collected information which is ready to use for statistical analysis. There are mainly two types of data on the basis of collection procedures; I. Primary data II. Secondary data
  • 21. Primary Data : The data which are originally collected by investigator or researcher for the first time with the purpose of statistical inquiry is called primary data. It is collected by government, an individual, institution and research bodies. It needs more fund, time and manpower.
  • 22. Following are the method of collecting primary data; Direct personal interview method. Indirect personal interview method. Information through correspondence. Mailed questionnaire method. Schedule through enumerators.
  • 23. Secondary Data : The data that has been already collected for a particular purpose and used for next purpose is called secondary data. When investigators find impracticable to collect firsthand information on related issues, secondary data is used. It saves time, money and manpower. Source of Secondary data : I. Published Source II. Unpublished Source
  • 24. Published Sources : Different published source of secondary data are national organizations and international agencies. – International agencies such as WHO, World bank, International Labor Organization. – Governmental organizations such as Central Bureau of Statistics, Ministry of Commerce and Industry, Nepal Rastra Bank, Ministry of Finance. – Semi-governmental organizations such as Nepal Food Corporation, Nepal Electricity Authority. – Private organizations such as Nepal Chamber of Commerce, Federation of Nepal Chamber of Commerce and Industry, Publications.
  • 25. Unpublished Sources : – Records maintained by government offices. – Records maintained by research institutions, research scholars etc. – Records updated by the departments, institutions for their internal purpose.
  • 26. Cross- Sectional data refers to data collected by observing many subjects at the one point or period of time. It is a snapshot of observation at a particular point. For example; Population of women in census year 2068. The data which can be recorded over different periods of time is called time series data. In this case same measurements are recorded on regular basis. For example; population of Nepal in census year 2048, 2058, 2068.
  • 27. The data of each unit is recorded for each follow up time till the occurrence of an event or till the unit fails is called failure time data. It is also called time to event data. Time to event data is mostly found in life time analysis of different parts of computers, different software, different industrial products, life time of infrastructure, clinical studies, etc.
  • 28. It is a dataset in which the behaviors of entities are observed across time. These entities could be individuals, states, companies, institutions, Countries, etc. Panel data is also known as longitudinal or cross- sectional data. Data of individual is recorded repeatedly over number of years. Income of persons X and Y in years 2014, 2015& 2016 according to age and qualification.
  • 29. A population can be defined as an aggregate observation of subjects grouped together by a common feature.  Population is the entire pool from which a statistical sample is drawn.  Census survey is conducted to enumerate all the population units.  Based on the number of individuals belonging to the group, population can be divided into two types; I. Finite population II. Infinite Population
  • 30. Based on the type of individuals in population, population can be divided into two types I. Homogeneous Population Population consisting of individuals of same type is called homogeneous population. II. Heterogeneous Population Population consisting of individuals of different type is called heterogeneous population.