• Shuffle
    Toggle On
    Toggle Off
  • Alphabetize
    Toggle On
    Toggle Off
  • Front First
    Toggle On
    Toggle Off
  • Both Sides
    Toggle On
    Toggle Off
Front

How to study your flashcards.

Right/Left arrow keys: Navigate between flashcards.right arrow keyleft arrow key

Up/Down arrow keys: Flip the card between the front and back.down keyup key

H key: Show hint (3rd side).h key

image

PLAY BUTTON

image

PLAY BUTTON

image

Progress

1/40

Click to flip

40 Cards in this Set

  • Front
  • Back
residual
difference of
observed y - predicted y
= y - ŷ
residual plot
scatter plot of the regression residual against the explanatory variable.
outliers
a point that lies outside the overall pattern of the scatter plot.
influential outliers
if removing it would make a markedly change to the results of calculations

outliers in the x direction are usually influential to the least squares regression line
lurking variable
a variable that is not the explanatory or the response variable but it may influence the interpretation of the relationships among those variables
Simpson's paradox
an association or comparison that holds for all several groups can reverse directions when the data are combined to form a single group.
causation
x causes y
common response
z causes x and y
confounding
z, and x are influential to y but do not cause y
anecdotal evidence
comes from haphazardly selected individual cases, are striking in some way. these cases need not be representative of any larger group of cases
shows bias
available data
data produced in the past that may help answer a present question
observational study
observe individual and measure variables of interest but do not attempt to influence the response
experiment
deliberately impose some treatment on individuals and observe their response
random allocation
assigning people to groups randomly
compare
two or more treatments to control the effect of lurking variables
randomize
use impersonal chance to assign experimental units to treatments
repeat
repeat to reduce chance of variation in the results
block
group of experimental units that are known before the experiment to be similar in some way that is expected to affect the response to the treatment.
block design
the random assignment of units to treatments for each block

subject --> men--> random assignment--> group 1,2,3--> treatment 1,2,3--> compare survival

the random assignment of units to treatments for each block

subject --> women--> random assignment--> group 1,2,3--> treatment 1,2,3--> compare survival
population
the entire group of individuals that we want information about
sample
part of the population that we actually examine in order to gather information
voluntary response sample
people who choose to respond, are bias b/c people with strong opinions are more likely to respond
simple random sample
individuals from the population chosen in such a way that every set of individuals has an equal chance to be the sample actually selected
probability sample
chosen by chance. know what samples are possible and what chance each possible sample has
stratified random sample
first divide the population into groups of similar individuals called strata. then choose a separate SRS in each stratum and combine these to form a full sample
undercoverage
when a population is left out of the process of choosing the sample
nonresponse
individual cant be contacted or doesn't cooperate
parameters
number that describes population
a fixed number that we do not know
statistic
describes a sample is known when we take a sample, can change from each sample
used to estimate unknown parameters
sampling distribution
of a statistic, is the distribution of values taken by the statistic in all possible samples of the same size and from the same population
reduce bias
use random sampling
reduce the variability
use a larger sample

variability: the spread of the sampling design and the sample size
institutional review board
reviews the planned studies in advanced in order to protect the subjects from possible harm
residual plot, pattern
don't want a patten
joint distribution
always use total
% of people out of total
group total divided by total.
is/of= %/100
marginal distribution
always use total
% of people out of one variable
individual total divided by total
conditional distribution
of education level for each pet ownership group

when you see for that is the part you are talking about.

given that we are talking about pet ownership __% have ____education
experimental design
The plan of the experiment which specifies the treatment conditions (independent variables ), what is to be measured (dependent variables ) and methods of assigning subjects to groups.
parameter
mean= μ
standard deviation= σ
proportion= p
statistic
mean= xˉ
standard deviation= s
proportion=p^
p hat