• Shuffle
    Toggle On
    Toggle Off
  • Alphabetize
    Toggle On
    Toggle Off
  • Front First
    Toggle On
    Toggle Off
  • Both Sides
    Toggle On
    Toggle Off
  • Read
    Toggle On
    Toggle Off
Reading...
Front

Card Range To Study

through

image

Play button

image

Play button

image

Progress

1/40

Click to flip

Use LEFT and RIGHT arrow keys to navigate between flashcards;

Use UP and DOWN arrow keys to flip the card;

H to show hint;

A reads text to speech;

40 Cards in this Set

  • Front
  • Back
A numerical measure of the strength of the association between two variables representing quantitative data:
the linear correlation coefficient r
Paired sample data is aka:
bivariate data
This data is used to find the value of r:
paired sample data
The value of r is used to conclude:
that there is or isn't a linear correlation between two paired variables.
A correlation exists between two variables when:
the values of one variable are somehow associated with the values of the other variable.
This measures the strength of the linear correlation between the paired quantitative x-and y-values in a sample.
The linear correlation coefficient r
The linear correlation coefficient is sometimes referred to as:
the Pearson product moment correlation coefficient in honor of Karl Pearson who originally developed it.
The linear correlation coefficient r is calculated using _________ data.
sample data.
∑ denotes:
addition of the items indicated
∑ₓ =
the sum of all x-values
∑ₓ² indicates:
that each x-value should be squared and then those squares added.
(∑ₓ)² indicates:
that the x-values should be added and the total then squared.
∑ₓᵧ indicates:
that each x-value should first be multiplied by its corresponding y-value. After obtaining all such products, find their sum.
r =
the linear correlation coefficient for a sample of paired data.
ρ =
the linear correlation coefficient for a population of paired data.
What are the requirements when making a conclusion about correlation in the population?
1) The sample of paired (x,y) data is a simple random sample of quantitative data.
2) Visual examination of the scatterplot must confirm that the points approximate a straight-line pattern.
3) The pairs of (x,y) data must have a bivariate normal distribution. ( No outliers)
Conclude that "there is a strong linear correlation between x and y" if the P-value computed from r is:
P-v ≤ α
Conclude that "there is no linear correlation between x and y" if the P-value computed from r is:
P-v > α
x is the __________ or _____________ variable.
y is the __________ or _____________ variable.
x is the Explanatory or Independent variable.
y is the Response or Dependent variable.
The value of r is always between:
-1 and 1 That is,
-1 ≤ r ≤ 1
If all values of either are converted to a different scale, does the value of r change?
no, it doesn't change.
Is the value of r affected by the choice of x or y? Meaning if you interchange all x and y values, will r change?
No, interchange all x and y values and the value of r will not change.
Does r measure the strength of a relationship that isn't linear?
No, r measures the strength of a linear relationship. It is not designed to measure the strength of a relationship that is not linear.
Is r sensitive to outliers?
Yes, r is very sensitive to outliers in the sense that a single outlier can dramatically affect its value.
The value of r² is the:
proportion of the variation in y that is explained by the linear relationship between x and y.
If x is the cost of pizza and y the cost of fare, and r = 0.988 and r² = 0.976, then about 98% of the variation in the cost of a subway fare can be explained by the linear relationship between the costs of pizza and subway fares. This implies that about 2% of the variation in costs of subway fares cannot be explained by the costs of pizza.
What are the three common errors involving correlation?
1) Concluding that correlation implies causality.
2) Calculating correlation based on avgs. (they can inflate correlation).
3) Thinking that because there is no LINEAR correlation, that there is no correlation at all. (There could be some other non-linear correlation.)
What is the procedure for hypothesis testing for correlation using the P-v method?
1) Verify the requirements are met.
2) Find H₀ and Hₐ.
3) Find ttest.
4) Calculate P-v. (Use tcdf and n - 2 df)
5) Determine the relationship between P-v and α.
6) Make statement.
H₀: ρ = 0 means:
Hₐ: ρ ≠ 0 means:
H₀: ρ = 0 means: No linear correlation between x and y.
Hₐ: ρ ≠ 0 means: There is a strong linear correlation between x and y.
For a claim of correlation,
H₀:
Hₐ:
H₀: ρ = 0 (oc)
Hₐ: ρ ≠ 0 (cc)
For a claim of a negative correlation,
H₀:
Hₐ:
H₀: ρ = 0 (cc)
Hₐ: ρ < 0 (oc)
For a claim of a positive correlation,
H₀:
Hₐ:
H₀: ρ = 0 (cc)
Hₐ: ρ > 0 (oc)
Conclude that "there is a strong linear correlation between x and y" if |rtest|:
if the absolute value of rtest is greater than rcr. ie...
|rtest| > rcr.
Conclude that "there is no linear correlation between x and y" if|rtest|:
if the absolute value of rtest is less than rcr. ie...
|rtest| ≤ rcr.
How do you use the TI-84 to find ∑ₓ, ∑ₓ², ∑ᵧ, ∑ᵧ², ∑ₓᵧ?
TI-84 to find ∑ₓ, ∑ₓ², ∑ᵧ, ∑ᵧ², ∑ₓᵧ: stat, calc, 2-var stats, L₁,L₂
How do you use the TI-84 to find the Ttest statistic and
P-v for the P-v method of hypothesis testing?
stat, test, LinRegTTest, L₁, L₂
How do you use the TI-84 to find r?
stat, calc, LinReg(ax + b)
n =
the number of pairs of sample data
What distribution is used when testing correlation?
The Correlation Coefficient chart (table A - 6)
df when testing correlation?
There are no degrees of freedom. Use n (the number of pairs of sample data) to search for rcr on table A-6.
What is the procedure for hypothesis testing for correlation using the rtest method?
1) Verify the requirements are met.
2) Find H₀ and Hₐ.
3) Find rtest.
4) Find rcr.
5) Determine the relationship between rtest and rcr.
6) Make statement.