SlideShare a Scribd company logo
Lu2 introduction to statistics
QUANTITATIVE TECHNIQUES
5112
felixmaboko83@gmail.com
Lecturer: Mr Felix Maluleke
Chapter 1:
Introduction to Statistics
What is Statistics?
Statistics is therefore defined as a set of mathematically based and techniques to transform raw
(unprocessed) data into a few summary measures that represent useful and usable information
to support effective decision making.
Information
To make sound business decision, a manager needs high-quality information. Information must
be timely, accurate, relevant, adequate and easily accessible. However, information to support
decision making is seldom readily available in the format, quality and quantity required by the
decision maker. More often than not, it needs to be generated from data.
Data
What is more readily available – from a variety of sources and of varying quality and quantity –
is data. Data consists of individual values that each conveys little useful and usable information
to management. Three examples of data are: the purchase value of a single transaction at a
supermarket (e.g. R214); the time it takes a worker to assemble a single part (e.g. 7.35 minutes).
The Language of Statistics
A random variable
Is any attribute of interest on which data is collected and analysed.
Data
Data is the actual values (numbers) or outcomes recorded on a random variable. Examples:
• the brand of coffee preferred (data: Nescafe, Ricoffy, Frisco).
• the daily occupancy rates of hotels in Cape Town (data: 45%, 72%, 54%)
Terms and Concepts of Statistics
Sampling Unit
A sampling unit is the object being measured, counted or observed with respect to
the random variable under study. Examples: Consumer, an employee, a household,
a company or a product.
More than one random variable can be defined for a given sampling unit. Example:
an employee could be measured in terms of age, qualification and gender.
Population
A population is the collection of all possible data values that exist for the random
variable under study. Example:
• For a study on hotel occupancy levels (the random variable) in Cape Town only.
All hotels in Cape Town would represent the target population.
• To research the age, gender, and savings levels of banking clients (three random
variables being studied), the population would be all savings account holders and
all banks.
Terms and Concepts of Statistics
Population Parameter
A population parameter is a measure that describes a characteristic of a population.
A population average is a parameter, so is a population proportion. It is called a
parameter if it uses all the population data values to compute its value.
Sample
A sample is a sub set of data values drawn from a population. Samples are used
because it is often not possible to record every data values of the population,
mainly because of cost, time and possible item destruction. Example: a sample of 25
hotels in Cape Town is selected to study hotel occupancy levels.
Sample Statistics
A sample statistic is a measure that describes a characteristic of a sample. The
sample average and a sample proportion are two typical sample statistics. Example:
appropriate sample statistics are: the average hotel occupancy level for the sample
of 25 hotels surveyed.
Symbols used to describe statistical concepts
Statistical terms and symbols to distinguish a sample statistics
from a population parameter for a given statistical measure.
Statistical measure Sample statistics Population parameter
Mean 𝑥 µ
Standard deviation S 𝜎
Variance 𝑆2
𝜎2
Size n N
Proportion 𝜌 𝜋
Correlation r 𝜌
Components of Statistics
Descriptive Statistics
Descriptive statistics condense sample data into a few summary descriptive
measures (only describes the behaviour of a random variable in a sample).
Descriptive statistics organise, summarise and extract essential information
contained within the data for communication to management. These allows users to
identify profiles, patterns, relationships, and trends within the data.
Inferential Statistics
Inferential statistics generalises sample findings to the broader population.
Inferential statistics is that area of statistics that allows managers to understand the
bigger population picture of a random variable based on the sample evidence.
Statistical Modelling
Statistical modelling constructs equations between variables that are related to
each other. These equations (called models) are used to estimate or predict values
of one of these variables based on values of related variables.
Data Types and Measurement Scales
Qualitative random variables
Qualitative random variables generate categorical (non-numeric) response data.
The data is represented by categories only. Examples: the gender of a consumer is
either male or female; an employee’s highest qualification is either a matric, a
diploma or a degree.
Quantitative random variables
Quantitative random variables generate numeric response data. These are real
numbers that can be manipulated using arithmetic operations (add, subtract,
multiply and divide). Examples: the age of an employee (e.g. 46 years, 28 years; 32
years); the price of a product in different stores (e.g. R6.75; R7.45; R7.20; R6.99)
Numeric data can be further classified as either discrete or continuous.
Types of Numeric Data
Discrete data
Discrete data is whole number (or integer) data.
Examples: the number of students in a class (e.g. 24;37;41;46), the number of cars
sold by a dealer in a month (e.g. 14;27;21;16), and the number of machine
breakdowns in a shift (e.g. 4;0;6;2).
Continuous data
Continuous data is any number that can occur in an interval data.
Example: the assembly time for a part can be between 27 minutes and 31 minutes
(e.g. assembly time = 28.4 min), a passenger’s hand luggage can have a mass
between 0.5 kg and 10 kg (e.g. 2.4 kg) and the volume of fuel in a car tank can be
between 0 litres and 55 litres (e.g. 42.38 litres).
Measurement of Scale
Nominal data
There must be distinct classes but theses classes have no quantitative properties.
Therefore, no comparison can be made in terms of one category being higher than
the other. Examples of nominal-scaled categorical data are:
• Gender (1 = male; 2 = female)
• City of residence (1 = Pretoria; 2 = Durban; 3 = Cape Town; 4 = Bloemfontein)
• Home language (1 = Xhosa; 2 = Zulu; 3 = English; 4 = Afrikaans; 5 = Sotho)
There are no quantitative properties for this variable or these classes and,
therefore, gender is a normal variable.
Measurement is the process of assigning numbers or labels to objects, person, or
events in accordance with specific rules to represent quantities or qualities of
attributes.
• We do not measure specific objects, persons, etc., we measure attributes or
features that define them.
Different types of Data
Different types of Data
Ordinal data
There are distinct classes but these classes have a natural ordering or ranking. The
difference can be ordered on the basis of magnitude. Examples of ordinal-scaled
categorical data are:
• Size of clothing (1 = small; 2 = medium; 3 = large; 4 = extra large)
• Product usage level (1 = light; 2 = moderate; 3 = heavy;)
• Response to a survey question: ‘Rank your top three TV programmes in order of preference’
(1 = first choice; 2 = second choice; 3 = third choice).
• Final position of horses in a race is an ordinal variable. The horses finish first, second, third,
fourth, and so on.
The different between first and second is not necessarily equivalent to the
difference between second and third, or between third and fourth.
Interval data
It is possible to compare differences in magnitude, but importantly the zero point does not have a
natural meaning. It captures the properties of nominal and ordinal data – used by most psychological
tests. Examples of rating scales responses are shown in the table below. Statements 1,2 and 3 are
illustrations of semantic differential rating scales that use bipolar adjectives (e.g. very slow to
extremely fast service), while statement 4 illustrates a likert rating scale that uses a scale that
ranges from strongly disagree to strongly agree with respect to a statement or an opinion.
1. How would you rate your chances of promotion after the next performance appraisal?
Very poor
1
Poor
2
Unsure
3
Good
4
Very good
5
2. How satisfied are you with your current job description?
Very dissatisfied
1
Dissatisfied
2
Satisfied
3
Very satisfied
4
3. What is your opinion of the latest Idols TV series?
Very boring
1
Dull
2
Ok
3
Exciting
4
Fantastic
5
4. The performance appraisal system is biased in favour of technically oriented employees.
Strongly disagree
1
Disagree
2
Unsure
3
Agree
4
Strongly agree
5
interval data possesses the
two properties of rank-
order (same as ordinal data)
and distance in terms of
how much more or how
much less an object
possesses of a given
characteristic. However, it
has a no zero point.
Therefore it is not
meaningful to compare the
ratio of interval scaled
values with one another.
Example, we can’t conclude
that a rating of 4 is twice
important as a rating of 2.
Different types of Data
Ratio data
• It is the highest level of measurement
• This level has all the three attributes:
o magnitude
o Equal interval
o Absolute zero point
• It represent continuous values
Examples of ratio-scaled data are: employee ages (years), customer income (R), distance
travelled (km), door height (cm), product mass (g), volume of liquid in a container (ml),
machine speed (rpm), tyre pressure (psi), product price (R), length of service (months)
and number of shopping trips per month (0; 1; 2; 4; etc).
Different Sources of Data
Internal data source
Internal data source in a business context, is data sourced from within a company. It is
data generated during the normal course of business activities.
It is relatively inexpensive to gather, readily available from company databases and
potentially of good quality (since it is recorded using internal business systems).
Examples of internal data sources are:
• Sales vouchers, credit notes, accounts receivable, accounts payable and asset registers for
financial data
• Production cost records, stock sheets and downtime records for production data
• Time sheets, wages and salaries schedules and absenteeism records for human resource data
• Product sales records and advertising expenditure budgets for marketing data.
Different sources of Data
External data source
external data source in a business context, is data that exist outside an organisation.
They are mainly business associations, government agencies, universities and various
research institutions.
The cost and reliability of external data is dependent on the source. A wide selection of
external databases exist and, in many cases, can be accessed via the internet, either free
of charge or for a fee.
A few examples of relevant to managers are:
• Statistics South Africa – for macroeconomic data
• South African Chamber of Business – for trade surveys,
• I-Net Bridge and etc.
Different sources of Data
Primary data source
Primary data is data that is recorded for the first time at source and with specific
purpose in mind. Primary data can be either internal (if it is recorded directly from an
internal business process, such as machine speed settings, sales invoices, stock sheets
and employee attendance records) or external (e.g. obtained through surveys such as
human resources surveys, economic surveys and consumer surveys (market research)).
Advantage of primary sourced data
It is its high quality (i.e. relevancy and accuracy). This is due to
Lu2 introduction to statistics
End of Learning Unit
One!!!!

More Related Content

What's hot (20)

PPTX
marketing research & applications on SPSS
ANSHU TIWARI
 
PPTX
Business statistics
Homework Guru
 
PPT
Business Statistics Chapter 1
Lux PP
 
PPTX
data analysis techniques and statistical softwares
Dr.ammara khakwani
 
PPTX
Statistical analysis and interpretation
Dave Marcial
 
PPT
Introduction to statistics 1
Anwar Afridi
 
PPTX
Data Analysis and Statistics
T.S. Lim
 
PDF
Business statistics-i-part1-aarhus-bss
Antonio Rivero Ostoic
 
PPTX
Statistics 1
Saed Jama
 
PPT
Aed1222 lesson 4
nurun2010
 
PDF
Statistics: Chapter One
Saed Jama
 
PPTX
Basics of data_interpretation
Vasista Vinuthan
 
PDF
Descriptive Statistics
CIToolkit
 
PPT
Introduction to spss
Manish Parihar
 
PPTX
Statistics Class 10 CBSE
Smitha Sumod
 
PDF
Panel slides
Titus Mutambu Mweta
 
PDF
Bbs11 ppt ch02
Tuul Tuul
 
PPTX
Das20502 chapter 1 descriptive statistics
Rozainita Rosley
 
PPTX
Statistical Data Analysis | Data Analysis | Statistics Services | Data Collec...
Stats Statswork
 
marketing research & applications on SPSS
ANSHU TIWARI
 
Business statistics
Homework Guru
 
Business Statistics Chapter 1
Lux PP
 
data analysis techniques and statistical softwares
Dr.ammara khakwani
 
Statistical analysis and interpretation
Dave Marcial
 
Introduction to statistics 1
Anwar Afridi
 
Data Analysis and Statistics
T.S. Lim
 
Business statistics-i-part1-aarhus-bss
Antonio Rivero Ostoic
 
Statistics 1
Saed Jama
 
Aed1222 lesson 4
nurun2010
 
Statistics: Chapter One
Saed Jama
 
Basics of data_interpretation
Vasista Vinuthan
 
Descriptive Statistics
CIToolkit
 
Introduction to spss
Manish Parihar
 
Statistics Class 10 CBSE
Smitha Sumod
 
Panel slides
Titus Mutambu Mweta
 
Bbs11 ppt ch02
Tuul Tuul
 
Das20502 chapter 1 descriptive statistics
Rozainita Rosley
 
Statistical Data Analysis | Data Analysis | Statistics Services | Data Collec...
Stats Statswork
 

Similar to Lu2 introduction to statistics (20)

PPTX
Presentation1.pptx
IndhuGreen
 
PPTX
MMW (Data Management)-Part 1 for ULO 2 (1).pptx
PETTIROSETALISIC
 
PDF
Data Analysis with SPSS PPT.pdf
Thanavathi C
 
PPTX
Chap001
Sandra Nicks
 
PPTX
01 Introduction (1).pptx
BAVAHRNIAPSUBRAMANIA
 
DOCX
Statistical ProcessesCan descriptive statistical processes b.docx
darwinming1
 
PPT
grade7statistics-150427083137-conversion-gate01.ppt
KayraTheressGubat
 
PPTX
Introduction to Statistics statistics formuls
MaulikVasani1
 
PPTX
Introduction to Data (Data Analytics)...
70147084
 
DOC
Statistics (2).doc
Atoshe Elmi
 
PPTX
Probability_and_Statistics_lecture_notes_1.pptx
AliMurat5
 
PDF
IV STATISTICS I.pdf
PyaePhyoKoKo2
 
PPTX
Basic statistics
Ganesh Raju
 
PPTX
1.2 types of data
Long Beach City College
 
PPTX
Introduction to Statistics
Jahanzaib Shah
 
PPT
Basic Concepts of Statistics & Its Analysis
chachachola
 
PDF
INTRODUCTION TO STATISTICS QUANTITATIVE TECHNIQUES.pdf
Alison Tutors
 
PPTX
Introduction to statistics.pptx
Unfold1
 
DOCX
Statistics  What you Need to KnowIntroductionOften, when peop.docx
dessiechisomjj4
 
PPTX
Basics stat ppt-types of data
Farhana Shaheen
 
Presentation1.pptx
IndhuGreen
 
MMW (Data Management)-Part 1 for ULO 2 (1).pptx
PETTIROSETALISIC
 
Data Analysis with SPSS PPT.pdf
Thanavathi C
 
Chap001
Sandra Nicks
 
01 Introduction (1).pptx
BAVAHRNIAPSUBRAMANIA
 
Statistical ProcessesCan descriptive statistical processes b.docx
darwinming1
 
grade7statistics-150427083137-conversion-gate01.ppt
KayraTheressGubat
 
Introduction to Statistics statistics formuls
MaulikVasani1
 
Introduction to Data (Data Analytics)...
70147084
 
Statistics (2).doc
Atoshe Elmi
 
Probability_and_Statistics_lecture_notes_1.pptx
AliMurat5
 
IV STATISTICS I.pdf
PyaePhyoKoKo2
 
Basic statistics
Ganesh Raju
 
1.2 types of data
Long Beach City College
 
Introduction to Statistics
Jahanzaib Shah
 
Basic Concepts of Statistics & Its Analysis
chachachola
 
INTRODUCTION TO STATISTICS QUANTITATIVE TECHNIQUES.pdf
Alison Tutors
 
Introduction to statistics.pptx
Unfold1
 
Statistics  What you Need to KnowIntroductionOften, when peop.docx
dessiechisomjj4
 
Basics stat ppt-types of data
Farhana Shaheen
 
Ad

More from LamineKaba6 (7)

PPTX
Sadcw 7e chapter04_recorded
LamineKaba6
 
PPTX
Sadcw 7e chapter01-done
LamineKaba6
 
PPTX
Chapter 2
LamineKaba6
 
PPTX
Sadcw 7e chapter04(1)
LamineKaba6
 
PPTX
Sadcw 7e chapter03-done(1)
LamineKaba6
 
PPTX
Sadcw 7e chapter05-done
LamineKaba6
 
PPTX
Sadcw 7e chapter06-done
LamineKaba6
 
Sadcw 7e chapter04_recorded
LamineKaba6
 
Sadcw 7e chapter01-done
LamineKaba6
 
Chapter 2
LamineKaba6
 
Sadcw 7e chapter04(1)
LamineKaba6
 
Sadcw 7e chapter03-done(1)
LamineKaba6
 
Sadcw 7e chapter05-done
LamineKaba6
 
Sadcw 7e chapter06-done
LamineKaba6
 
Ad

Recently uploaded (20)

PDF
Human-centred design in online workplace learning and relationship to engagem...
Tracy Tang
 
PDF
Shuen Mei Parth Sharma Boost Productivity, Innovation and Efficiency wit...
AWS Chicago
 
PDF
Arcee AI - building and working with small language models (06/25)
Julien SIMON
 
PPTX
Building a Production-Ready Barts Health Secure Data Environment Tooling, Acc...
Barts Health
 
PDF
TrustArc Webinar - Data Privacy Trends 2025: Mid-Year Insights & Program Stra...
TrustArc
 
PDF
Market Wrap for 18th July 2025 by CIFDAQ
CIFDAQ
 
PPTX
Simplifying End-to-End Apache CloudStack Deployment with a Web-Based Automati...
ShapeBlue
 
PDF
Smart Air Quality Monitoring with Serrax AQM190 LITE
SERRAX TECHNOLOGIES LLP
 
PDF
CloudStack GPU Integration - Rohit Yadav
ShapeBlue
 
PDF
HCIP-Data Center Facility Deployment V2.0 Training Material (Without Remarks ...
mcastillo49
 
PPTX
MSP360 Backup Scheduling and Retention Best Practices.pptx
MSP360
 
PDF
How Current Advanced Cyber Threats Transform Business Operation
Eryk Budi Pratama
 
PDF
CIFDAQ Weekly Market Wrap for 11th July 2025
CIFDAQ
 
PDF
Meetup Kickoff & Welcome - Rohit Yadav, CSIUG Chairman
ShapeBlue
 
PDF
HydITEx corporation Booklet 2025 English
Георгий Феодориди
 
PDF
Upgrading to z_OS V2R4 Part 01 of 02.pdf
Flavio787771
 
PDF
Bitcoin+ Escalando sin concesiones - Parte 1
Fernando Paredes García
 
PDF
Are there government-backed agri-software initiatives in Limerick.pdf
giselawagner2
 
PDF
NewMind AI Journal - Weekly Chronicles - July'25 Week II
NewMind AI
 
PDF
SWEBOK Guide and Software Services Engineering Education
Hironori Washizaki
 
Human-centred design in online workplace learning and relationship to engagem...
Tracy Tang
 
Shuen Mei Parth Sharma Boost Productivity, Innovation and Efficiency wit...
AWS Chicago
 
Arcee AI - building and working with small language models (06/25)
Julien SIMON
 
Building a Production-Ready Barts Health Secure Data Environment Tooling, Acc...
Barts Health
 
TrustArc Webinar - Data Privacy Trends 2025: Mid-Year Insights & Program Stra...
TrustArc
 
Market Wrap for 18th July 2025 by CIFDAQ
CIFDAQ
 
Simplifying End-to-End Apache CloudStack Deployment with a Web-Based Automati...
ShapeBlue
 
Smart Air Quality Monitoring with Serrax AQM190 LITE
SERRAX TECHNOLOGIES LLP
 
CloudStack GPU Integration - Rohit Yadav
ShapeBlue
 
HCIP-Data Center Facility Deployment V2.0 Training Material (Without Remarks ...
mcastillo49
 
MSP360 Backup Scheduling and Retention Best Practices.pptx
MSP360
 
How Current Advanced Cyber Threats Transform Business Operation
Eryk Budi Pratama
 
CIFDAQ Weekly Market Wrap for 11th July 2025
CIFDAQ
 
Meetup Kickoff & Welcome - Rohit Yadav, CSIUG Chairman
ShapeBlue
 
HydITEx corporation Booklet 2025 English
Георгий Феодориди
 
Upgrading to z_OS V2R4 Part 01 of 02.pdf
Flavio787771
 
Bitcoin+ Escalando sin concesiones - Parte 1
Fernando Paredes García
 
Are there government-backed agri-software initiatives in Limerick.pdf
giselawagner2
 
NewMind AI Journal - Weekly Chronicles - July'25 Week II
NewMind AI
 
SWEBOK Guide and Software Services Engineering Education
Hironori Washizaki
 

Lu2 introduction to statistics

  • 4. What is Statistics? Statistics is therefore defined as a set of mathematically based and techniques to transform raw (unprocessed) data into a few summary measures that represent useful and usable information to support effective decision making. Information To make sound business decision, a manager needs high-quality information. Information must be timely, accurate, relevant, adequate and easily accessible. However, information to support decision making is seldom readily available in the format, quality and quantity required by the decision maker. More often than not, it needs to be generated from data. Data What is more readily available – from a variety of sources and of varying quality and quantity – is data. Data consists of individual values that each conveys little useful and usable information to management. Three examples of data are: the purchase value of a single transaction at a supermarket (e.g. R214); the time it takes a worker to assemble a single part (e.g. 7.35 minutes).
  • 5. The Language of Statistics A random variable Is any attribute of interest on which data is collected and analysed. Data Data is the actual values (numbers) or outcomes recorded on a random variable. Examples: • the brand of coffee preferred (data: Nescafe, Ricoffy, Frisco). • the daily occupancy rates of hotels in Cape Town (data: 45%, 72%, 54%)
  • 6. Terms and Concepts of Statistics Sampling Unit A sampling unit is the object being measured, counted or observed with respect to the random variable under study. Examples: Consumer, an employee, a household, a company or a product. More than one random variable can be defined for a given sampling unit. Example: an employee could be measured in terms of age, qualification and gender. Population A population is the collection of all possible data values that exist for the random variable under study. Example: • For a study on hotel occupancy levels (the random variable) in Cape Town only. All hotels in Cape Town would represent the target population. • To research the age, gender, and savings levels of banking clients (three random variables being studied), the population would be all savings account holders and all banks.
  • 7. Terms and Concepts of Statistics Population Parameter A population parameter is a measure that describes a characteristic of a population. A population average is a parameter, so is a population proportion. It is called a parameter if it uses all the population data values to compute its value. Sample A sample is a sub set of data values drawn from a population. Samples are used because it is often not possible to record every data values of the population, mainly because of cost, time and possible item destruction. Example: a sample of 25 hotels in Cape Town is selected to study hotel occupancy levels. Sample Statistics A sample statistic is a measure that describes a characteristic of a sample. The sample average and a sample proportion are two typical sample statistics. Example: appropriate sample statistics are: the average hotel occupancy level for the sample of 25 hotels surveyed.
  • 8. Symbols used to describe statistical concepts Statistical terms and symbols to distinguish a sample statistics from a population parameter for a given statistical measure. Statistical measure Sample statistics Population parameter Mean 𝑥 µ Standard deviation S 𝜎 Variance 𝑆2 𝜎2 Size n N Proportion 𝜌 𝜋 Correlation r 𝜌
  • 9. Components of Statistics Descriptive Statistics Descriptive statistics condense sample data into a few summary descriptive measures (only describes the behaviour of a random variable in a sample). Descriptive statistics organise, summarise and extract essential information contained within the data for communication to management. These allows users to identify profiles, patterns, relationships, and trends within the data. Inferential Statistics Inferential statistics generalises sample findings to the broader population. Inferential statistics is that area of statistics that allows managers to understand the bigger population picture of a random variable based on the sample evidence. Statistical Modelling Statistical modelling constructs equations between variables that are related to each other. These equations (called models) are used to estimate or predict values of one of these variables based on values of related variables.
  • 10. Data Types and Measurement Scales Qualitative random variables Qualitative random variables generate categorical (non-numeric) response data. The data is represented by categories only. Examples: the gender of a consumer is either male or female; an employee’s highest qualification is either a matric, a diploma or a degree. Quantitative random variables Quantitative random variables generate numeric response data. These are real numbers that can be manipulated using arithmetic operations (add, subtract, multiply and divide). Examples: the age of an employee (e.g. 46 years, 28 years; 32 years); the price of a product in different stores (e.g. R6.75; R7.45; R7.20; R6.99) Numeric data can be further classified as either discrete or continuous.
  • 11. Types of Numeric Data Discrete data Discrete data is whole number (or integer) data. Examples: the number of students in a class (e.g. 24;37;41;46), the number of cars sold by a dealer in a month (e.g. 14;27;21;16), and the number of machine breakdowns in a shift (e.g. 4;0;6;2). Continuous data Continuous data is any number that can occur in an interval data. Example: the assembly time for a part can be between 27 minutes and 31 minutes (e.g. assembly time = 28.4 min), a passenger’s hand luggage can have a mass between 0.5 kg and 10 kg (e.g. 2.4 kg) and the volume of fuel in a car tank can be between 0 litres and 55 litres (e.g. 42.38 litres).
  • 12. Measurement of Scale Nominal data There must be distinct classes but theses classes have no quantitative properties. Therefore, no comparison can be made in terms of one category being higher than the other. Examples of nominal-scaled categorical data are: • Gender (1 = male; 2 = female) • City of residence (1 = Pretoria; 2 = Durban; 3 = Cape Town; 4 = Bloemfontein) • Home language (1 = Xhosa; 2 = Zulu; 3 = English; 4 = Afrikaans; 5 = Sotho) There are no quantitative properties for this variable or these classes and, therefore, gender is a normal variable. Measurement is the process of assigning numbers or labels to objects, person, or events in accordance with specific rules to represent quantities or qualities of attributes. • We do not measure specific objects, persons, etc., we measure attributes or features that define them. Different types of Data
  • 13. Different types of Data Ordinal data There are distinct classes but these classes have a natural ordering or ranking. The difference can be ordered on the basis of magnitude. Examples of ordinal-scaled categorical data are: • Size of clothing (1 = small; 2 = medium; 3 = large; 4 = extra large) • Product usage level (1 = light; 2 = moderate; 3 = heavy;) • Response to a survey question: ‘Rank your top three TV programmes in order of preference’ (1 = first choice; 2 = second choice; 3 = third choice). • Final position of horses in a race is an ordinal variable. The horses finish first, second, third, fourth, and so on. The different between first and second is not necessarily equivalent to the difference between second and third, or between third and fourth.
  • 14. Interval data It is possible to compare differences in magnitude, but importantly the zero point does not have a natural meaning. It captures the properties of nominal and ordinal data – used by most psychological tests. Examples of rating scales responses are shown in the table below. Statements 1,2 and 3 are illustrations of semantic differential rating scales that use bipolar adjectives (e.g. very slow to extremely fast service), while statement 4 illustrates a likert rating scale that uses a scale that ranges from strongly disagree to strongly agree with respect to a statement or an opinion. 1. How would you rate your chances of promotion after the next performance appraisal? Very poor 1 Poor 2 Unsure 3 Good 4 Very good 5 2. How satisfied are you with your current job description? Very dissatisfied 1 Dissatisfied 2 Satisfied 3 Very satisfied 4 3. What is your opinion of the latest Idols TV series? Very boring 1 Dull 2 Ok 3 Exciting 4 Fantastic 5 4. The performance appraisal system is biased in favour of technically oriented employees. Strongly disagree 1 Disagree 2 Unsure 3 Agree 4 Strongly agree 5 interval data possesses the two properties of rank- order (same as ordinal data) and distance in terms of how much more or how much less an object possesses of a given characteristic. However, it has a no zero point. Therefore it is not meaningful to compare the ratio of interval scaled values with one another. Example, we can’t conclude that a rating of 4 is twice important as a rating of 2.
  • 15. Different types of Data Ratio data • It is the highest level of measurement • This level has all the three attributes: o magnitude o Equal interval o Absolute zero point • It represent continuous values Examples of ratio-scaled data are: employee ages (years), customer income (R), distance travelled (km), door height (cm), product mass (g), volume of liquid in a container (ml), machine speed (rpm), tyre pressure (psi), product price (R), length of service (months) and number of shopping trips per month (0; 1; 2; 4; etc).
  • 16. Different Sources of Data Internal data source Internal data source in a business context, is data sourced from within a company. It is data generated during the normal course of business activities. It is relatively inexpensive to gather, readily available from company databases and potentially of good quality (since it is recorded using internal business systems). Examples of internal data sources are: • Sales vouchers, credit notes, accounts receivable, accounts payable and asset registers for financial data • Production cost records, stock sheets and downtime records for production data • Time sheets, wages and salaries schedules and absenteeism records for human resource data • Product sales records and advertising expenditure budgets for marketing data.
  • 17. Different sources of Data External data source external data source in a business context, is data that exist outside an organisation. They are mainly business associations, government agencies, universities and various research institutions. The cost and reliability of external data is dependent on the source. A wide selection of external databases exist and, in many cases, can be accessed via the internet, either free of charge or for a fee. A few examples of relevant to managers are: • Statistics South Africa – for macroeconomic data • South African Chamber of Business – for trade surveys, • I-Net Bridge and etc.
  • 18. Different sources of Data Primary data source Primary data is data that is recorded for the first time at source and with specific purpose in mind. Primary data can be either internal (if it is recorded directly from an internal business process, such as machine speed settings, sales invoices, stock sheets and employee attendance records) or external (e.g. obtained through surveys such as human resources surveys, economic surveys and consumer surveys (market research)). Advantage of primary sourced data It is its high quality (i.e. relevancy and accuracy). This is due to
  • 20. End of Learning Unit One!!!!

Editor's Notes

  • #5: The quantities measured in a study are called random variables, and a particular outcome is called an observation. Several observations are collectively known as data. The collection of all possible outcomes is called the population.
  • #6: The quantities measured in a study are called random variables, and a particular outcome is called an observation. Several observations are collectively known as data. The collection of all possible outcomes is called the population.
  • #7: In practice, we cannot usually observe the whole population. Instead we observe a sub-set of the population, known as a sample. In order to ensure that the sample we take is representative of the whole population, we usually take a random sample in which all members of the population are equally likely to be selected for inclusion in the sample.
  • #8: For example, if we are interested in conducting a survey of the amount of physical exercise undertaken by the general public, surveying people entering and leaving a gymnasium would provide a biased sample of the population, and the results obtained would not generalise to the population at large.
  • #10: Descriptive Statistics – is used to profile data Inferential Statistics – is used to test genuine patterns or relationships in the population based on sample data. Statistical modelling – is used to explore relationships.
  • #11: The type of data available for analysis is determined by the nature of its random variable. A random variable is either qualitative (categorical) or quantitative (numerical) in nature.
  • #12: The type of data available for analysis is determined by the nature of its random variable. A random variable is either qualitative (categorical) or quantitative (numerical) in nature.
  • #13: There are four types of measurement scales: nominal, ordinal, interval and ratio. Nominal data is the weakest form of data to analyse since the codes assigned to the various categories have no numerical properties.
  • #14: Ordinal data = is stronger than nominal data because the data possesses the numeric property of order ( but the distances between the ranks are not equal)
  • #15: Interval data (rating scales) possesses sufficient numeric properties to be treated as numeric data for the purpose of statistical analysis. A much wider range of statistical techniques can therefore be applied to interval data compared with nominal and ordinal data.
  • #16: Ratio data = is the strongest data for statistical analysis. Compared to the other data types, the most amount of statistical information can be extracted from it. Also, more statistical methods can be applied to ratio data than to any other data type.
  • #17: Data Source = data for statistical analysis is available from many different sources. A manager must decide how reliable and accurate a set of data from a given source is before basing decisions on findings derived from it. Unreliable data results in invalid findings.