Chapters 5-8
Farmingdale State College
Which of the following values for Pearson’s correlation coefficient (r) represents the strongest relationship?
If two variables have a correlation coefficient of r = 0, what does this indicate?
If two variables have a correlation coefficient of r = 0, what does this indicate?
What does the slope of the regression line represent?
What does the slope of the regression line represent?
Which of the following best describes the meaning of an R-squared value of 0.85?
Which of the following best describes the meaning of an R-squared value of 0.85?
a) 85% of the variation in the dependent variable is explained by the independent variables
b) The correlation between the variables is 0.85
c) The model’s predictions are 85% accurate
d) The independent variables account for 15% of the error in predictions
True or False: A correlation coefficient of -0.85 indicates a weaker relationship than a correlation of 0.50.
True or False: A correlation coefficient of -0.85 indicates a weaker relationship than a correlation of 0.50.
False
True or False: Linear regression is appropriate only when the relationship between the variables is nonlinear.
True or False: Linear regression is appropriate only when the relationship between the variables is nonlinear.
False
True or False: An outlier can have a significant impact on both the correlation coefficient and the slope of the regression line.
True or False: An outlier can have a significant impact on both the correlation coefficient and the slope of the regression line.
True
What is the difference between simple linear regression and multiple linear regression?
What is the difference between simple linear regression and multiple linear regression?
Answer: Simple linear regression uses one independent variable to predict the dependent variable, while multiple linear regression uses two or more independent variables.
Describe the difference between correlation and causation. Why is it important not to infer causation directly from correlation?
Answer: Correlation refers to a relationship or association between two variables, while causation indicates that one variable directly affects another. It is important not to infer causation from correlation because there may be confounding variables or the relationship may be coincidental.
Suppose you perform a regression analysis and the p-value of one of your predictors is 0.15. How would you interpret this result? Should you include this predictor in your model?
Answer: A p-value of 0.15 suggests that the predictor is not statistically significant at conventional levels (e.g., 0.05). You may consider removing this predictor, though you might keep it if there is theoretical justification
A researcher collects data on study hours and exam scores and finds a correlation coefficient of r = 0.65.
What does this tell you about the relationship between study hours and exam scores?
Would you expect the relationship to be strong, moderate, or weak? Why?
A researcher collects data on study hours and exam scores and finds a correlation coefficient of r = 0.65.
You run a simple linear regression analysis and get the following equation:
\(\hat{Y} = 2.5 + 1.8X\)
What is the predicted value of Y if X is 4?
What does the coefficient 1.8 represent in this context?
\(\hat{Y} = 2.5 + 1.8X\)
\(\hat{Y} = 2.5 + 1.8(4) = 2.5 + 7.2 = 9.7\)
What does the coefficient 1.8 represent in this context?
1.8
represents the predicted change in Y for each additional unit of X. In this context, it means that for each additional unit of X, Y is expected to increase by 1.8
units.Consider the output of a regression analysis where the R-squared value is 0.70 and the coefficient of the independent variable is 3.2 with a p-value of 0.02.
Interpret the R-squared value in terms of the model’s explanatory power.
Based on the coefficient and p-value, would you consider this predictor variable significant? Why or why not?
Consider the output of a regression analysis where the R-squared value is 0.70 and the coefficient of the independent variable is 3.2 with a p-value of 0.02.
Answer: An R-squared value of 0.70 means that 70% of the variance in the dependent variable is explained by the independent variable(s) in the model.
Answer: Yes, the predictor is significant. The p-value of 0.02 is less than 0.05, indicating that the predictor variable is statistically significant at the 5% level.