Statistics Random Data And Numerical Data

This Statistics chapter is about random data and numerical data.

statistics

a branch of mathematics dealing with the collection, analysis, interpretation and presentation of masses of numerical data

descriptive statistics

organizes, presents, summarizes: graphically, numerically, pictorially..convenient and informative, simplifies comparisons

Inferential

estimates, predicts, decides, draw conclusions or inferences, USES SMALLER AMOUNTS OF DATA

population

is the group (individuals,items, measurements) of interest which is usually not easy to access directly

paramater

describes some characteristics of the population. paramaters are usually unknown

sample

a part of the population that we actually examine and for which we collect data. it is smaller (sometimes more accessible) group

statistic

is a number that describes a characteristic of a sample. often, a sample is used to estimate an unknown parameter

statistical inference

the process of making an estimate, prediction, or decision about a population based on a sample

confidence levels

tells the proportion of time that our conclusion will be correct

significance levels

a numerical measure of how often a result will be wrong

variable

a characteristic of a population or sample of interest
interval
nominal
ordinal

interval

real numbers..heights weights prices- quantitative or numerical histograms

nominal

qualitative or categorical. the values of nominal data are categories. ex: single=1 married=2 pie charts and bar graphs

ordinal

values have an order a ranking to them. order is maintained no matter what numeric values are assigned.

bar chart

used to display frequencies
the bar represents each category, height of the bar represents the frequency
the base of the bar is arbitrary

pie chart

shows relative frequencies. the pies represent categories.

histogram stem and leaf and ogive

are used when the data is interval

contingency table

used to describe the relationship between two nominal variables. lists the frequencies of each combination of the values of the two variables. the data can then be summarized in a bar cart graphically

two interval variables related?

scatter diagram or scatterplot

independent variable

predictor, explanatory- stays the same. is labeled x on the horizontal axis

dependent variable

is the outcome or the response and is labeled y on the vertical axis

one nominal and one interval

bar chart is an effective way to summarize this.

time series plot

observations measured at successive points in time.graphed on a line chart. time periods go on the horizontal axis

mean

average. appropriate for describing interval data.

x-bar

mean for a sample

mean for a population

median

appropriate for interval or ordinal data. best for data dealing with extreme values. computed the same for population and sample. (n+1)/2

mode

occurs most frequently. useful for all data mainly for identifying the group with the highest frequency for nominal data

geometric mean

used when the variable is a growth rate of change.

range

largest observation-smallest observation

variance

population- sigma squared
sample s^2

standard deviation

square root of the variance

Cross-sectional data

data that is collected at a certain point in time. starting salaries of mba students.

longitudinal data

is collected over a period of time. weekly starting prices of gold.

prospective

collected from the current point in the future

retrospective or historical

collected on events that have happened in the past.

sampling

the process of selecting a subset of a whole population. cost efficient. and practical. the sample and the target population should be similar to each other.

sampling plan.

a method or procedure for specifying how a sample will be taken from a population.
simple random
stratified random
cluster sampling

simple random

everyone has an equal chance of being selected

stratified random

separating the population into mutually exclusive set. split into groups first and then use random sampling

cluster sample

a simple random sample of groups or clusters. may increase error

Homepage