SlideShare a Scribd company logo
5
Most read
6
Most read
8
Most read
Lecture 1
Introduction to Biostatistics
Biostatistics=biology+statistics
Other definitionsfor“Statistics”
Frequentlyusedtoreferredtorecordeddata
Denotescharacteristicscalculatedforaset of data : sample mean
Statistical procedurescanbe dividedintotwomajorcategories:descriptive statisticsandinferential statistics.
Descriptive statistics
Descriptive statisticsincludesstatistical proceduresthatwe use to describe the populationwe are studying.The data
couldbe collectedfromeitherasample ora population,butthe resultshelpusorganize anddescribedata.
Descriptive statisticscanonlybe usedtodescribe the groupthat isbeingstudying.Thatis,the resultscannotbe
generalizedtoanylargergroup.
Frequencydistributions, measuresof central tendency (mean,median,andmode),andgraphslike pie chartsandbar
charts that describe the dataare all examplesof descriptive statistics.
Inferential Statistics
Inferentialstatisticsisconcernedwithmakingpredictionsorinferencesaboutapopulationfromobservationsand
analysesof a sample.Thatis,we can take the resultsof ananalysisusinga sample andcan generalizeittothe larger
populationthatthe sample represents.Inordertodothis,however,itisimperativethatthe sample is
representative of the grouptowhichitis beinggeneralized.
Examplesof inferentialstatisticsincludelinearregressionanalyses, ANOVA,correlationanalyses,andsurvival
analysis,toname a few.
Parameter
A parameterisa value,usuallyunknown(andwhichtherefore hastobe estimated),usedtorepresentacertain
populationcharacteristic.Forexample,the populationmeanisaparameterthatisoftenusedto indicate the
average value of a quantity.
Withina population,aparameterisafixedvalue whichdoesnotvary.Eachsample drawnfromthe populationhas
itsown value of anystatisticthat isusedto estimate thisparameter.Forexample,the meanof the datain a sample
isusedto give informationaboutthe overallmeaninthe populationfromwhichthatsample wasdrawn.
Parametersare oftenassignedGreekletters(e.g.µ )
Statistic
A statisticisa quantitythat iscalculatedfroma sample of data.It is usedtogive informationaboutunknownvalues
inthe correspondingpopulation.Forexample,the average of the dataina sample isusedtogive informationabout
the overall average inthe populationfromwhichthatsample wasdrawn.
Statisticsare oftenassignedRomanletters(e.g.mands)
Population
A populationisanyentire collectionof people,animals,plantsorthingsfromwhichwe maycollectdata.It isthe
entire groupwe are interestedin,whichwe wishtodescribeordraw conclusionsabout.
The numberof observationsinapopulationiscalledthe size of the populationandisdenotedbythe letterN.
Example
The populationfora studyof infanthealthmightbe all childrenborninthe UK inthe 1980's. The sample mightbe all
babiesbornon 7th May in any of the years.
Sample
A sample isa groupof unitsselectedfromalargergroup(the population).Bystudyingthe sample itishopedtodraw
validconclusionsaboutthe largergroup.
A sample isgenerallyselectedforstudybecause the populationistoolarge tostudyin itsentirety.The sample
shouldbe representativeof the general population.Thisisoftenbestachievedbyrandomsampling.Also,before
collectingthe sample,itisimportantthatthe researchercarefullyandcompletelydefinesthe population,includinga
descriptionof the memberstobe included.
The numberof observationsinasample iscalledthe size of the sample andisdenotedbythe letter n.
Example
The populationfora studyof infanthealthmightbe all children borninthe UK inthe 1980's. The sample mightbe all
babiesbornon 7th May in any of the years.
Lecture 2
Data:
The raw material of Statistics is data.
We may define data as figures. Figures result from the process of counting or from taking a measurement.
For example:
1. When a hospital administrator counts the number of patients (counting).
2. When a nurse weighs a patient (measurement)
We search for suitable data to serve as the raw material for our investigation.
Such data are available from one or more of the following sources:
1- Routinely kept records.
For example:
1. Hospital medical records contain immense amounts of information on patients.
2. Hospital accounting records contain a wealth of data on the facility’s business activities.
2- External sources.
The data needed to answer a question may already exist in the form of published reports, commercially available data
banks, or the research literature, i.e. someone else has already asked the same question.
3- Surveys:
The source may be a survey, if the data needed is about answering certain questions.
For example: If the administrator of a clinic wishes to obtain information regarding the mode of transportation used by
patients to visit the clinic, then a survey may be conducted among patients to obtain this information.
4- Experiments.
Frequently the data needed to answer a question are available only as the result of an experiment.
For example:If a nurse wishes to know which of several strategies is best for maximizing patient compliance, she
might conduct an experiment in which the different strategies of motivating compliance are tried with different
patients.
Applications ofbiostatistics
It has particular applications to medicine and to agriculture, public health, including epidemiology, health services
research,nutrition, and environmental health,
Design and analysis of clinical trials in medicine
Statistical methods are beginning to be integrated into medical informatics, public health informatics, and
bioinformatics
Presentation ofdata
Massive volume of data
It is Difficult to learn data by examining unorganized data
Data should be organized and condensed into a form that can be more rapidly and easily understood and interpreted
Classification
Process of dividing a set of observations or objects into classes or groups in such a way that
1. Observations or objects in the same class or group are similar
2. Observations or objects in each class or group are dissimilar to observations or objects in other class or group.
Types ofclassification
1. Data are sorted according to one criterion … one way classification
2. When data are sorted according to two criteria… two way classification.
3. When data are sorted according to severalcriteria… manifold classification.
Data may be classified according to qualitative, temporal and geographic characteristic.
Arrangement of data according to the values of a variable characteristic is called DISTRIBUTION.
When the defining variable is expressed in terms of location ….spatial or geographical distribution.
Temporal arrangement of values is called a time series.
Tabulation
Systematic presentation of data classified under suitable heads and subheads , and placed in columns and rows.
Data is easy to understand, facilitate comparison,and effective way to conveyinformation to a reader.
Types oftables
1. general purpose table..large in size,are extensive with vast coverage and are constructed for for reference purpose.
2.specific purpose table….simple in structure and deal with one or two criteria of classification .. Used ton analyse or
to assist in analyzing data.
Parts of a table
A statistical table has at least four major parts and some other minor parts.
(1) The Title
(2) The Box Head (column captions)
(3) The Stub (row captions)
(4) The Body
(5) Prefatory Notes
(6) Foots Notes
(7) Source Notes
Main parts of table and its construction
Title :
 table must have a self explanatory title.
 It should be brief in the form of phrases.
 Abbreviations shouldn’t be used
 Should be in capitals throughout
 The different parts of a title should be separated by commas but no full stop at the end
 If in two or more lines .. Inverted pyramid arrangement of the lines should be used.
Column captions and boxhead
 The heading of each column is called column caption while the section of a table that contains the column
aption is called boxhead
 The heading should be clear but concise
 Most important characteristic is placed in 1st
column
 Only first word in each column caption shoud be capitalized. No full stop should be put at the end.
Row caption and Stub
 The heading or title for a row caption is called a row caption and the section containing the row caption is
called stub.
 The principles of column caption s apply to row caption in the sub
Prefatory notes and footnotes
 Explanatory notes incorporated in the table bene ath the title and below the bdy is calledPrefatory notes and
footnotes respectively.
 Prefatory note give additional specification of the data indicative of items included and excluded for all data
of the table, statements of the box etc.
 Footnotes are used to clarify anything in the table by giving a fuller description
Source notes
 Every table should have a source note unless the table is an original tabulation and its source is clear from the
context
 It is placed immediately below the table and below the footnote.
Body and arrangement of data
 The most important part.
 Taking in consideration the basis of classification arrangement of data is made.
 The data may be arranged according to
o alphabetical order
o time of occurance
o location
o magnitude
o by customary arrangement i.e men,women and children
Spacing and ruling
Thick or double lines are used for emphasis and for separating the title, the boxhead, the stub,etc
General Rules ofTabulation:
A table should be simple and attractive. There should be no need of further explanations (details).
Proper and clear headings for columns and rows should be need.
Suitable approximation may be adopted and figures may be rounded off.
The unit of measurement should be well defined.
If the observations are large in number they can be broken into two or three tables.
Thick lines should be used to separate the data under big classes and thin lines to separate the sub classes of data.
Lecture 3
FrequencyDistribution
A frequencydistributionisanorderlyarrangementof dataclassifiedaccordingtothe magnitude of the observations.
Whenthe data are groupedintoclassesof appropriate size indicatingthe numberof observationsineachclasswe
geta frequencydistribution.Byformingfrequencydistribution,we cansummarize the dataeffectively.Itisa
methodof presentingthe dataina summarizedform.FrequencydistributionisalsoknownasFrequencytable.
Usesof FrequencyDistribution
1. Frequencydistributionhelpsustoanalyze the data.
2. Frequencydistributionhelpsustofacilitate the computationof variousstatisticalmeasures
UngroupedFrequencyDistribution
A frequencydistributionwithan interval widthof 1is refferedtoanungroupedfrequencydistribution.Ungroped
frequencydistributionisanarrangementof the observedvaluesinascendingorder.The ungroupedfrequency
distributionare those data,whichare notarrangedin groups.They are knownasindividual series.Whenthe
ungroupeddataare grouped,we getthe groupedfrequencydistribution.
GroupedFrequencyDistributionTable
Frequencydistributiontable (alsoknownasfrequencytable) consistsof variouscomponents.
Classes:A large numberof observationsvaryinginawide range are usuallyclassifiedinseveralgroupsaccordingto
the size of theirvalues.Eachof these groupsisdefinedbyaninterval calledclassinterval.The classinterval between
10 and 20 is definedas10-20.
Class limits:The smallestandlargestpossible valuesineachclassof a frequencydistributiontableare knownas
classlimits.Forthe class10-20, the classlimitsare 10 and 20. 10 iscalledthe lowerclasslimitand20 iscalledthe
upperclass limit.
Class mark: Classmark is the midmostvalue of the classinterval.Itisalsoknownasthe midvalue. Midvalue of each
class= (lowerlimit+Upperlimit)
If the classis 0-10, lowerlimitis0 and upperlimitis10. So the midvalue is (0+10)2 = 102 = 5.
Magnitude of a class interval: The difference betweenthe upperandlowerlimitof aclassis calledthe magnitude of
a class interval.
Class frequency:The numberof observationfallingwithinaclassinterval iscalledclassfrequencyof thatclass
interval.
Creatinga GroupedFrequencyDistribution
Findthe largestand smallestvalues
Compute the Range = Maximum - Minimum
Selectthe numberof classesdesired.Thisisusuallybetween5and20.
Findthe classwidthby dividingthe range bythe numberof classesandroundingup.There are twothingsto be
careful of here.Youmust round up,notoff.Normally3.2wouldroundto be 3, butin roundingup,itbecomes4. If
the range dividedbythe numberof classesgivesanintegervalue (noremainder),thenyoucaneitheraddone tothe
numberof classesor addone to the classwidth.Sometimesyou're lockedintoacertainnumberof classesbecause
of the instructions.The Blumantextfailstomentionthe case whenthere isnoremainder.
Picka suitable startingpointlessthanorequal tothe minimumvalue.Youwill be able tocover:"the classwidth
timesthe numberof classes"values.Youneedtocoverone more value thanthe range.Follow thisrule andyou'll be
okay:The starting pointplus the numberof classestimes the class width mustbe greaterthan the maximumvalue.
Your startingpointisthe lowerlimitof the firstclass.Continue toaddthe class widthtothislowerlimittogetthe
restof the lowerlimits.
To findthe upperlimitof the firstclass,subtractone from the lowerlimitof the secondclass.Thencontinue toadd
the class widthtothisupperlimittofindthe restof the upperlimits.
Findthe boundariesbysubtracting0.5 unitsfromthe lowerlimitsandadding0.5 unitsfrom the upperlimits.The
boundariesare alsohalf-waybetweenthe upperlimitof one classandthe lowerlimitof the nextclass.Depending
on whatyou're tryingto accomplish,itmaynot be necessarytofindthe boundaries.
Tallythe data.
Findthe frequencies.
Findthe cumulative frequencies.Dependingonwhatyou're tryingto accomplish,itmaynotbe necessarytofindthe
cumulative frequencies.
If necessary,findthe relativefrequenciesand/orrelativecumulative frequencies
LECTURE 4
Graphical representation
The visual displayof statistical datainthe formof points, lines,areasandothergeometrical formandsymbolsis
calledgraphical representation
Two maingroups;Graphs & diagrams
Difference betweenagraph and a diagram
Graph is a representationof databya continuouscurve usuallyshownona graphpaper.
Diagram isany otherone, twoor three dimensional formof visual representation.
Diagrams
Bestsuitedtospatial series anddatasplitintodifferentcategories
For comparisonof same type of data at differentplaces
Advantages
More attractive thanfigures.
Beinga visual display,leave alonglastingandmore effective impressiononthe mindof a reader
Comparisoniseasierwithdiagrams.
Disadvantage isthat itis lessaccurate than tables.
Drawing a diagram
An appropriate scale shouldbe chosen
Must have a title,akeyfootnote or source note is alsonecessary
It shouldbe shaded,coloredorcrosshatchedto show the differentparts,if any.
Types ofdiagram
1. Simple barchart
2. Multiple barchart
3. Componentbarchart
4. Rectanglesandsubdividedrectangles
5. Pictograms
6. Pie diagrams
Simple bar chart
Consistsof horizontal orvertical barsof equal widthsor
lengthsproportional tothe valuestheyrepresent.
The space separating the barsshouldn’texceedthe width
of the bar and shouldn’tbe lessthanhalf of itswidth.
The bar shouldneitherbe exceedinglylongandnarrow
nor shortand broad.
Multiple bar chart
It showstwoor more characteristicscorrespondingtothe value of a commonvariable inthe formsof groupedbars,
whose lengthsare proportional tothe valuesof characteristics,andeachof whichisshadedor coloreddifferentlyto
aididentification.
It isa gooddevice forthe comparisonof twoor three kindsof information.
Componentbar chart
It isan effective technique inwhicheachbarisdividedintotwoormore sections,proportional insizetothe
componentpartsof a total beingdisplayedbyeach bar.
Variouscomponentsare shadedorcoloreddifferentlyto increase the overall effectivenessof the diagram
It isusedto representthe accumulationof the variouscomponentsof dataandthe percentages.
Theyare alsoknownas subdividedbars.
Rectanglesand subdividedrectangles
The area of rectangle isequal tothe productof itslengthandbreadth.
To representaquantitybya rectangle,bothlengthandbreadthof the rectangle are used.
Subdividedrectanglesare drawnforthe data where the quantitiesalongwiththeircomponentsare tobe
compared.
For construction:
Change eachcomponentintothe percentage of the correspondingtotal.
Draw one rectangle foreachtotal,takingequal lengthsandbreadthsproportional tothe total.
Pictograms
It isa device forportrayingstatistical databymeansof picturesorsmall symbols.
Representaunitvalue of the data by a standardsymbol or a picture andthe whole quantitybybyanappropriate
numberof repetitionsof symbolconcerned.
A quantitylessthana unitisrepresentedbyapart of the symbol used.
Pie diagrams/sector diagrams
It consistsof a circle dividedintosectorsorpie shapedpieceswhose areasare proportional tothe variouspartsinto
whichthe whole quantityisdivided.
Sectorsare shadedor coloreddifferentlytoshow the relationshipof partsto whole.

More Related Content

What's hot (20)

PPTX
Biostatistics
Dr. Senthilvel Vasudevan
 
PPT
Research methodology - Analysis of Data
The Stockker
 
PPTX
Biostatistics khushbu
khushbu mishra
 
PPTX
Data collection,tabulation,processing and analysis
RobinsonRaja1
 
PPTX
Data analysis copy
KAVITHAMONTADKA
 
PPTX
A seminar on quantitave data analysis
Bimel Kottarathil
 
PPT
Wynberg girls high-Jade Gibson-maths-data analysis statistics
Wynberg Girls High
 
PPTX
Fundamentals of biostatistics
Kingsuk Sarkar
 
PPTX
Applications of biostatistics
K040798s
 
PDF
Basic statistics concepts
ECRD2015
 
PPTX
Biostatistics and data analysis
David Enoma
 
PPTX
Statistical Data Analysis | Data Analysis | Statistics Services | Data Collec...
Stats Statswork
 
PPT
Chapter 1 biostat
ayanle maigag
 
PPT
Introduction to statistics
Kapil Dev Ghante
 
PPTX
Data representation and analysis - Mathematics
Nayan Dagliya
 
PPTX
RESEARCH METHODOLOGY- PROCESSING OF DATA
jeni jerry
 
PPT
Ebd1 lecture 3 2010
Reko Kemo
 
PPTX
Processing of research data
Ashish Sahu
 
PPTX
analysis and presentation of data
WISDOM WEALTH INTERNATIONAL SCHOOL, TAMILNADU
 
Research methodology - Analysis of Data
The Stockker
 
Biostatistics khushbu
khushbu mishra
 
Data collection,tabulation,processing and analysis
RobinsonRaja1
 
Data analysis copy
KAVITHAMONTADKA
 
A seminar on quantitave data analysis
Bimel Kottarathil
 
Wynberg girls high-Jade Gibson-maths-data analysis statistics
Wynberg Girls High
 
Fundamentals of biostatistics
Kingsuk Sarkar
 
Applications of biostatistics
K040798s
 
Basic statistics concepts
ECRD2015
 
Biostatistics and data analysis
David Enoma
 
Statistical Data Analysis | Data Analysis | Statistics Services | Data Collec...
Stats Statswork
 
Chapter 1 biostat
ayanle maigag
 
Introduction to statistics
Kapil Dev Ghante
 
Data representation and analysis - Mathematics
Nayan Dagliya
 
RESEARCH METHODOLOGY- PROCESSING OF DATA
jeni jerry
 
Ebd1 lecture 3 2010
Reko Kemo
 
Processing of research data
Ashish Sahu
 
analysis and presentation of data
WISDOM WEALTH INTERNATIONAL SCHOOL, TAMILNADU
 

Viewers also liked (12)

PDF
Biostatistics and Statistics Career opportunities
Dr.Kumud Sarin
 
PPTX
Lec. biostatistics
Riaz101
 
PDF
Essentials of Biostatistics - Second Edition
Dr. Indranil Saha
 
PPT
Biostatistics in Bioequivalence
Bhaswat Chakraborty
 
PPTX
Advanced Biostatistics - Simplified
Mohammed Alhefzi
 
PPSX
Biostatistics
Pritam Gupta
 
PPTX
Intro biostat1&2
Lucidante1
 
PDF
Nicholas Jewell MedicReS World Congress 2014
MedicReS
 
PDF
1. Introduction to biostatistics
Razif Shahril
 
PPT
Biostatistics
priyarokz
 
PPTX
Research methodology & Biostatistics
Kusum Gaur
 
PDF
Top 100 Linux Interview Questions and Answers 2014
iimjobs and hirist
 
Biostatistics and Statistics Career opportunities
Dr.Kumud Sarin
 
Lec. biostatistics
Riaz101
 
Essentials of Biostatistics - Second Edition
Dr. Indranil Saha
 
Biostatistics in Bioequivalence
Bhaswat Chakraborty
 
Advanced Biostatistics - Simplified
Mohammed Alhefzi
 
Biostatistics
Pritam Gupta
 
Intro biostat1&2
Lucidante1
 
Nicholas Jewell MedicReS World Congress 2014
MedicReS
 
1. Introduction to biostatistics
Razif Shahril
 
Biostatistics
priyarokz
 
Research methodology & Biostatistics
Kusum Gaur
 
Top 100 Linux Interview Questions and Answers 2014
iimjobs and hirist
 
Ad

Similar to Biostatistics (20)

PPTX
2dk9spxsgkmbj3llxgrw-signature-942e20f9f4d90e588b512ceb917b4542d6b0e98ab1d79a...
drpriyankaswasthavri
 
PPTX
Biostatics
Navneet Randhawa
 
PPTX
Biostatistics ppt
santhoshikayithi
 
PDF
Basic Statistics, Biostatistics, and Frequency Distribution
Gaurav Patil
 
PPTX
Intro to statistics
Ratheeshkrishnakripa
 
PPTX
Introduction to the concepts of Biostatics 2.pptx
shanehart
 
PPTX
Biostatistics
PRIYAG63
 
PPTX
STATISTICS.pptx
Dr. Senthilvel Vasudevan
 
PPTX
Biostatistics
khushbu mishra
 
DOCX
BIOSTATISTICS hypothesis testings ,sampling
hridyahp
 
PPTX
biostatistics-210618023858.pptx bbbbbbbbbb
RAMJIBANYADAV2
 
PPTX
Data Organizarion and presentation (1).pptx
MuhammadAsif297069
 
PPTX
Biostatistics
Vaibhav Ambashikar
 
PPTX
Biostatistic 2.pptx
imrantestmails
 
PPT
1- introduction,data sources and types1 (1).ppt
Caramel40
 
PPTX
Biostatistics
anju mathew
 
PPTX
Medical Statistics.pptx
Siddanna B Chougala C
 
PPT
Biostatistics Concept & Definition
Southern Range, Berhampur, Odisha
 
PPTX
Biostatistics pt 1
BipulBorthakur
 
PPT
Data types by dr najeeb
muhammed najeeb
 
2dk9spxsgkmbj3llxgrw-signature-942e20f9f4d90e588b512ceb917b4542d6b0e98ab1d79a...
drpriyankaswasthavri
 
Biostatics
Navneet Randhawa
 
Biostatistics ppt
santhoshikayithi
 
Basic Statistics, Biostatistics, and Frequency Distribution
Gaurav Patil
 
Intro to statistics
Ratheeshkrishnakripa
 
Introduction to the concepts of Biostatics 2.pptx
shanehart
 
Biostatistics
PRIYAG63
 
STATISTICS.pptx
Dr. Senthilvel Vasudevan
 
Biostatistics
khushbu mishra
 
BIOSTATISTICS hypothesis testings ,sampling
hridyahp
 
biostatistics-210618023858.pptx bbbbbbbbbb
RAMJIBANYADAV2
 
Data Organizarion and presentation (1).pptx
MuhammadAsif297069
 
Biostatistics
Vaibhav Ambashikar
 
Biostatistic 2.pptx
imrantestmails
 
1- introduction,data sources and types1 (1).ppt
Caramel40
 
Biostatistics
anju mathew
 
Medical Statistics.pptx
Siddanna B Chougala C
 
Biostatistics Concept & Definition
Southern Range, Berhampur, Odisha
 
Biostatistics pt 1
BipulBorthakur
 
Data types by dr najeeb
muhammed najeeb
 
Ad

More from Aftab Badshah (20)

PPTX
Economic Crisis in SriLanka.pptx
Aftab Badshah
 
DOCX
Classification of cat
Aftab Badshah
 
DOCX
Zahavi handicapp theory
Aftab Badshah
 
PPTX
What is evolution
Aftab Badshah
 
DOCX
The theory of endosymbiosis says that eukaryote cells have evolved from a sym...
Aftab Badshah
 
DOCX
Special creation theory
Aftab Badshah
 
DOCX
Selection pressure
Aftab Badshah
 
DOCX
Rats
Aftab Badshah
 
DOCX
Rabbits
Aftab Badshah
 
PDF
Pap.macroevolution
Aftab Badshah
 
DOCX
Paleontological evidence of evolution
Aftab Badshah
 
DOCX
Mimicry
Aftab Badshah
 
DOCX
Mutation pressure
Aftab Badshah
 
DOCX
Microevolution
Aftab Badshah
 
DOCX
Macroevolution
Aftab Badshah
 
DOCX
Genetic drift
Aftab Badshah
 
DOCX
Fisher theory
Aftab Badshah
 
DOCX
Extinction
Aftab Badshah
 
DOCX
Evolution theories
Aftab Badshah
 
DOCX
Evidences from comparative embryology
Aftab Badshah
 
Economic Crisis in SriLanka.pptx
Aftab Badshah
 
Classification of cat
Aftab Badshah
 
Zahavi handicapp theory
Aftab Badshah
 
What is evolution
Aftab Badshah
 
The theory of endosymbiosis says that eukaryote cells have evolved from a sym...
Aftab Badshah
 
Special creation theory
Aftab Badshah
 
Selection pressure
Aftab Badshah
 
Rabbits
Aftab Badshah
 
Pap.macroevolution
Aftab Badshah
 
Paleontological evidence of evolution
Aftab Badshah
 
Mimicry
Aftab Badshah
 
Mutation pressure
Aftab Badshah
 
Microevolution
Aftab Badshah
 
Macroevolution
Aftab Badshah
 
Genetic drift
Aftab Badshah
 
Fisher theory
Aftab Badshah
 
Extinction
Aftab Badshah
 
Evolution theories
Aftab Badshah
 
Evidences from comparative embryology
Aftab Badshah
 

Recently uploaded (20)

DOCX
Table - Technique selection matrix in CleaningValidation
Markus Janssen
 
PPT
Conservation-of-Mechanical-Energy-Honors-14.ppt
exieHANNAHEXENGaALME
 
PDF
crestacean parasitim non chordates notes
S.B.P.G. COLLEGE BARAGAON VARANASI
 
PPTX
Gene Therapy. Introduction, history and types of Gene therapy
Ashwini I Chuncha
 
PPTX
Pharmaceutical Microbiology (sem-3) unit 1.pptx
payalpilaji
 
PPTX
Akshay tunneling .pptx_20250331_165945_0000.pptx
akshaythaker18
 
PDF
Polarized Multiwavelength Emission from Pulsar Wind—Accretion Disk Interactio...
Sérgio Sacani
 
PDF
Primordial Black Holes and the First Stars
Sérgio Sacani
 
PPTX
Structure and uses of DDT, Saccharin..pptx
harsimrankaur204
 
PPTX
formations-of-rock-layers-grade 11_.pptx
GraceSarte
 
PDF
GK_GS One Liner For Competitive Exam.pdf
abhi01nm
 
PDF
Pharmaceutical Microbiology (sem-3) UNIT IV.pdf
payalpilaji
 
PDF
Is the Interstellar Object 3I/ATLAS Alien Technology?
Sérgio Sacani
 
PPTX
PEDIA IDS IN A GIST_6488b6b5-3152-4a4a-a943-20a56efddd43 (2).pptx
tdas83504
 
PPTX
Diuretic Medicinal Chemistry II Unit II.pptx
Dhanashri Dupade
 
PPT
Human physiology and digestive system
S.B.P.G. COLLEGE BARAGAON VARANASI
 
PDF
Continuous Model-Based Engineering of Software-Intensive Systems: Approaches,...
Hugo Bruneliere
 
PPTX
MODULE 2 Effects of Lifestyle in the Function of Respiratory and Circulator...
judithgracemangunday
 
PDF
THE MOLECULAR GENETICS OF TYPE 1 DIABETES
ijab2
 
PPTX
Anatomy and physiology of digestive system.pptx
Ashwini I Chuncha
 
Table - Technique selection matrix in CleaningValidation
Markus Janssen
 
Conservation-of-Mechanical-Energy-Honors-14.ppt
exieHANNAHEXENGaALME
 
crestacean parasitim non chordates notes
S.B.P.G. COLLEGE BARAGAON VARANASI
 
Gene Therapy. Introduction, history and types of Gene therapy
Ashwini I Chuncha
 
Pharmaceutical Microbiology (sem-3) unit 1.pptx
payalpilaji
 
Akshay tunneling .pptx_20250331_165945_0000.pptx
akshaythaker18
 
Polarized Multiwavelength Emission from Pulsar Wind—Accretion Disk Interactio...
Sérgio Sacani
 
Primordial Black Holes and the First Stars
Sérgio Sacani
 
Structure and uses of DDT, Saccharin..pptx
harsimrankaur204
 
formations-of-rock-layers-grade 11_.pptx
GraceSarte
 
GK_GS One Liner For Competitive Exam.pdf
abhi01nm
 
Pharmaceutical Microbiology (sem-3) UNIT IV.pdf
payalpilaji
 
Is the Interstellar Object 3I/ATLAS Alien Technology?
Sérgio Sacani
 
PEDIA IDS IN A GIST_6488b6b5-3152-4a4a-a943-20a56efddd43 (2).pptx
tdas83504
 
Diuretic Medicinal Chemistry II Unit II.pptx
Dhanashri Dupade
 
Human physiology and digestive system
S.B.P.G. COLLEGE BARAGAON VARANASI
 
Continuous Model-Based Engineering of Software-Intensive Systems: Approaches,...
Hugo Bruneliere
 
MODULE 2 Effects of Lifestyle in the Function of Respiratory and Circulator...
judithgracemangunday
 
THE MOLECULAR GENETICS OF TYPE 1 DIABETES
ijab2
 
Anatomy and physiology of digestive system.pptx
Ashwini I Chuncha
 

Biostatistics

  • 1. Lecture 1 Introduction to Biostatistics Biostatistics=biology+statistics Other definitionsfor“Statistics” Frequentlyusedtoreferredtorecordeddata Denotescharacteristicscalculatedforaset of data : sample mean Statistical procedurescanbe dividedintotwomajorcategories:descriptive statisticsandinferential statistics. Descriptive statistics Descriptive statisticsincludesstatistical proceduresthatwe use to describe the populationwe are studying.The data couldbe collectedfromeitherasample ora population,butthe resultshelpusorganize anddescribedata. Descriptive statisticscanonlybe usedtodescribe the groupthat isbeingstudying.Thatis,the resultscannotbe generalizedtoanylargergroup. Frequencydistributions, measuresof central tendency (mean,median,andmode),andgraphslike pie chartsandbar charts that describe the dataare all examplesof descriptive statistics. Inferential Statistics Inferentialstatisticsisconcernedwithmakingpredictionsorinferencesaboutapopulationfromobservationsand analysesof a sample.Thatis,we can take the resultsof ananalysisusinga sample andcan generalizeittothe larger populationthatthe sample represents.Inordertodothis,however,itisimperativethatthe sample is representative of the grouptowhichitis beinggeneralized. Examplesof inferentialstatisticsincludelinearregressionanalyses, ANOVA,correlationanalyses,andsurvival analysis,toname a few. Parameter A parameterisa value,usuallyunknown(andwhichtherefore hastobe estimated),usedtorepresentacertain populationcharacteristic.Forexample,the populationmeanisaparameterthatisoftenusedto indicate the average value of a quantity. Withina population,aparameterisafixedvalue whichdoesnotvary.Eachsample drawnfromthe populationhas itsown value of anystatisticthat isusedto estimate thisparameter.Forexample,the meanof the datain a sample isusedto give informationaboutthe overallmeaninthe populationfromwhichthatsample wasdrawn. Parametersare oftenassignedGreekletters(e.g.µ ) Statistic A statisticisa quantitythat iscalculatedfroma sample of data.It is usedtogive informationaboutunknownvalues inthe correspondingpopulation.Forexample,the average of the dataina sample isusedtogive informationabout the overall average inthe populationfromwhichthatsample wasdrawn. Statisticsare oftenassignedRomanletters(e.g.mands) Population A populationisanyentire collectionof people,animals,plantsorthingsfromwhichwe maycollectdata.It isthe entire groupwe are interestedin,whichwe wishtodescribeordraw conclusionsabout. The numberof observationsinapopulationiscalledthe size of the populationandisdenotedbythe letterN.
  • 2. Example The populationfora studyof infanthealthmightbe all childrenborninthe UK inthe 1980's. The sample mightbe all babiesbornon 7th May in any of the years. Sample A sample isa groupof unitsselectedfromalargergroup(the population).Bystudyingthe sample itishopedtodraw validconclusionsaboutthe largergroup. A sample isgenerallyselectedforstudybecause the populationistoolarge tostudyin itsentirety.The sample shouldbe representativeof the general population.Thisisoftenbestachievedbyrandomsampling.Also,before collectingthe sample,itisimportantthatthe researchercarefullyandcompletelydefinesthe population,includinga descriptionof the memberstobe included. The numberof observationsinasample iscalledthe size of the sample andisdenotedbythe letter n. Example The populationfora studyof infanthealthmightbe all children borninthe UK inthe 1980's. The sample mightbe all babiesbornon 7th May in any of the years. Lecture 2 Data: The raw material of Statistics is data. We may define data as figures. Figures result from the process of counting or from taking a measurement. For example: 1. When a hospital administrator counts the number of patients (counting). 2. When a nurse weighs a patient (measurement) We search for suitable data to serve as the raw material for our investigation. Such data are available from one or more of the following sources: 1- Routinely kept records. For example: 1. Hospital medical records contain immense amounts of information on patients. 2. Hospital accounting records contain a wealth of data on the facility’s business activities. 2- External sources. The data needed to answer a question may already exist in the form of published reports, commercially available data banks, or the research literature, i.e. someone else has already asked the same question. 3- Surveys: The source may be a survey, if the data needed is about answering certain questions. For example: If the administrator of a clinic wishes to obtain information regarding the mode of transportation used by patients to visit the clinic, then a survey may be conducted among patients to obtain this information. 4- Experiments. Frequently the data needed to answer a question are available only as the result of an experiment. For example:If a nurse wishes to know which of several strategies is best for maximizing patient compliance, she might conduct an experiment in which the different strategies of motivating compliance are tried with different patients. Applications ofbiostatistics
  • 3. It has particular applications to medicine and to agriculture, public health, including epidemiology, health services research,nutrition, and environmental health, Design and analysis of clinical trials in medicine Statistical methods are beginning to be integrated into medical informatics, public health informatics, and bioinformatics Presentation ofdata Massive volume of data It is Difficult to learn data by examining unorganized data Data should be organized and condensed into a form that can be more rapidly and easily understood and interpreted Classification Process of dividing a set of observations or objects into classes or groups in such a way that 1. Observations or objects in the same class or group are similar 2. Observations or objects in each class or group are dissimilar to observations or objects in other class or group. Types ofclassification 1. Data are sorted according to one criterion … one way classification 2. When data are sorted according to two criteria… two way classification. 3. When data are sorted according to severalcriteria… manifold classification. Data may be classified according to qualitative, temporal and geographic characteristic. Arrangement of data according to the values of a variable characteristic is called DISTRIBUTION. When the defining variable is expressed in terms of location ….spatial or geographical distribution. Temporal arrangement of values is called a time series. Tabulation Systematic presentation of data classified under suitable heads and subheads , and placed in columns and rows. Data is easy to understand, facilitate comparison,and effective way to conveyinformation to a reader. Types oftables 1. general purpose table..large in size,are extensive with vast coverage and are constructed for for reference purpose. 2.specific purpose table….simple in structure and deal with one or two criteria of classification .. Used ton analyse or to assist in analyzing data. Parts of a table A statistical table has at least four major parts and some other minor parts. (1) The Title (2) The Box Head (column captions) (3) The Stub (row captions) (4) The Body (5) Prefatory Notes (6) Foots Notes (7) Source Notes Main parts of table and its construction Title :  table must have a self explanatory title.  It should be brief in the form of phrases.
  • 4.  Abbreviations shouldn’t be used  Should be in capitals throughout  The different parts of a title should be separated by commas but no full stop at the end  If in two or more lines .. Inverted pyramid arrangement of the lines should be used. Column captions and boxhead  The heading of each column is called column caption while the section of a table that contains the column aption is called boxhead  The heading should be clear but concise  Most important characteristic is placed in 1st column  Only first word in each column caption shoud be capitalized. No full stop should be put at the end. Row caption and Stub  The heading or title for a row caption is called a row caption and the section containing the row caption is called stub.  The principles of column caption s apply to row caption in the sub Prefatory notes and footnotes  Explanatory notes incorporated in the table bene ath the title and below the bdy is calledPrefatory notes and footnotes respectively.  Prefatory note give additional specification of the data indicative of items included and excluded for all data of the table, statements of the box etc.  Footnotes are used to clarify anything in the table by giving a fuller description Source notes  Every table should have a source note unless the table is an original tabulation and its source is clear from the context  It is placed immediately below the table and below the footnote. Body and arrangement of data  The most important part.  Taking in consideration the basis of classification arrangement of data is made.  The data may be arranged according to o alphabetical order o time of occurance o location o magnitude o by customary arrangement i.e men,women and children Spacing and ruling Thick or double lines are used for emphasis and for separating the title, the boxhead, the stub,etc General Rules ofTabulation: A table should be simple and attractive. There should be no need of further explanations (details). Proper and clear headings for columns and rows should be need. Suitable approximation may be adopted and figures may be rounded off. The unit of measurement should be well defined. If the observations are large in number they can be broken into two or three tables. Thick lines should be used to separate the data under big classes and thin lines to separate the sub classes of data. Lecture 3 FrequencyDistribution
  • 5. A frequencydistributionisanorderlyarrangementof dataclassifiedaccordingtothe magnitude of the observations. Whenthe data are groupedintoclassesof appropriate size indicatingthe numberof observationsineachclasswe geta frequencydistribution.Byformingfrequencydistribution,we cansummarize the dataeffectively.Itisa methodof presentingthe dataina summarizedform.FrequencydistributionisalsoknownasFrequencytable. Usesof FrequencyDistribution 1. Frequencydistributionhelpsustoanalyze the data. 2. Frequencydistributionhelpsustofacilitate the computationof variousstatisticalmeasures UngroupedFrequencyDistribution A frequencydistributionwithan interval widthof 1is refferedtoanungroupedfrequencydistribution.Ungroped frequencydistributionisanarrangementof the observedvaluesinascendingorder.The ungroupedfrequency distributionare those data,whichare notarrangedin groups.They are knownasindividual series.Whenthe ungroupeddataare grouped,we getthe groupedfrequencydistribution. GroupedFrequencyDistributionTable Frequencydistributiontable (alsoknownasfrequencytable) consistsof variouscomponents. Classes:A large numberof observationsvaryinginawide range are usuallyclassifiedinseveralgroupsaccordingto the size of theirvalues.Eachof these groupsisdefinedbyaninterval calledclassinterval.The classinterval between 10 and 20 is definedas10-20. Class limits:The smallestandlargestpossible valuesineachclassof a frequencydistributiontableare knownas classlimits.Forthe class10-20, the classlimitsare 10 and 20. 10 iscalledthe lowerclasslimitand20 iscalledthe upperclass limit. Class mark: Classmark is the midmostvalue of the classinterval.Itisalsoknownasthe midvalue. Midvalue of each class= (lowerlimit+Upperlimit) If the classis 0-10, lowerlimitis0 and upperlimitis10. So the midvalue is (0+10)2 = 102 = 5. Magnitude of a class interval: The difference betweenthe upperandlowerlimitof aclassis calledthe magnitude of a class interval. Class frequency:The numberof observationfallingwithinaclassinterval iscalledclassfrequencyof thatclass interval. Creatinga GroupedFrequencyDistribution Findthe largestand smallestvalues Compute the Range = Maximum - Minimum Selectthe numberof classesdesired.Thisisusuallybetween5and20. Findthe classwidthby dividingthe range bythe numberof classesandroundingup.There are twothingsto be careful of here.Youmust round up,notoff.Normally3.2wouldroundto be 3, butin roundingup,itbecomes4. If the range dividedbythe numberof classesgivesanintegervalue (noremainder),thenyoucaneitheraddone tothe numberof classesor addone to the classwidth.Sometimesyou're lockedintoacertainnumberof classesbecause of the instructions.The Blumantextfailstomentionthe case whenthere isnoremainder. Picka suitable startingpointlessthanorequal tothe minimumvalue.Youwill be able tocover:"the classwidth timesthe numberof classes"values.Youneedtocoverone more value thanthe range.Follow thisrule andyou'll be okay:The starting pointplus the numberof classestimes the class width mustbe greaterthan the maximumvalue. Your startingpointisthe lowerlimitof the firstclass.Continue toaddthe class widthtothislowerlimittogetthe restof the lowerlimits.
  • 6. To findthe upperlimitof the firstclass,subtractone from the lowerlimitof the secondclass.Thencontinue toadd the class widthtothisupperlimittofindthe restof the upperlimits. Findthe boundariesbysubtracting0.5 unitsfromthe lowerlimitsandadding0.5 unitsfrom the upperlimits.The boundariesare alsohalf-waybetweenthe upperlimitof one classandthe lowerlimitof the nextclass.Depending on whatyou're tryingto accomplish,itmaynot be necessarytofindthe boundaries. Tallythe data. Findthe frequencies. Findthe cumulative frequencies.Dependingonwhatyou're tryingto accomplish,itmaynotbe necessarytofindthe cumulative frequencies. If necessary,findthe relativefrequenciesand/orrelativecumulative frequencies LECTURE 4 Graphical representation The visual displayof statistical datainthe formof points, lines,areasandothergeometrical formandsymbolsis calledgraphical representation Two maingroups;Graphs & diagrams Difference betweenagraph and a diagram Graph is a representationof databya continuouscurve usuallyshownona graphpaper. Diagram isany otherone, twoor three dimensional formof visual representation. Diagrams Bestsuitedtospatial series anddatasplitintodifferentcategories For comparisonof same type of data at differentplaces Advantages More attractive thanfigures. Beinga visual display,leave alonglastingandmore effective impressiononthe mindof a reader Comparisoniseasierwithdiagrams. Disadvantage isthat itis lessaccurate than tables. Drawing a diagram An appropriate scale shouldbe chosen Must have a title,akeyfootnote or source note is alsonecessary It shouldbe shaded,coloredorcrosshatchedto show the differentparts,if any.
  • 7. Types ofdiagram 1. Simple barchart 2. Multiple barchart 3. Componentbarchart 4. Rectanglesandsubdividedrectangles 5. Pictograms 6. Pie diagrams Simple bar chart Consistsof horizontal orvertical barsof equal widthsor lengthsproportional tothe valuestheyrepresent. The space separating the barsshouldn’texceedthe width of the bar and shouldn’tbe lessthanhalf of itswidth. The bar shouldneitherbe exceedinglylongandnarrow nor shortand broad. Multiple bar chart It showstwoor more characteristicscorrespondingtothe value of a commonvariable inthe formsof groupedbars, whose lengthsare proportional tothe valuesof characteristics,andeachof whichisshadedor coloreddifferentlyto aididentification. It isa gooddevice forthe comparisonof twoor three kindsof information. Componentbar chart It isan effective technique inwhicheachbarisdividedintotwoormore sections,proportional insizetothe componentpartsof a total beingdisplayedbyeach bar. Variouscomponentsare shadedorcoloreddifferentlyto increase the overall effectivenessof the diagram It isusedto representthe accumulationof the variouscomponentsof dataandthe percentages. Theyare alsoknownas subdividedbars. Rectanglesand subdividedrectangles The area of rectangle isequal tothe productof itslengthandbreadth. To representaquantitybya rectangle,bothlengthandbreadthof the rectangle are used. Subdividedrectanglesare drawnforthe data where the quantitiesalongwiththeircomponentsare tobe compared. For construction: Change eachcomponentintothe percentage of the correspondingtotal. Draw one rectangle foreachtotal,takingequal lengthsandbreadthsproportional tothe total.
  • 8. Pictograms It isa device forportrayingstatistical databymeansof picturesorsmall symbols. Representaunitvalue of the data by a standardsymbol or a picture andthe whole quantitybybyanappropriate numberof repetitionsof symbolconcerned. A quantitylessthana unitisrepresentedbyapart of the symbol used. Pie diagrams/sector diagrams It consistsof a circle dividedintosectorsorpie shapedpieceswhose areasare proportional tothe variouspartsinto whichthe whole quantityisdivided. Sectorsare shadedor coloreddifferentlytoshow the relationshipof partsto whole.