Highlights
The dataset CSDATA.xlsx Case Study,
Scenario: The dataset CSDATA.xlsx (attached to the case study in Blackboard) contains survey responses of 500 heads of households in the Philadelphia area. The data contains information on their annual consumption expenditures and other information regarding the characteristics of their household. Your job is to construct the best regression model to explain how household characteristics impact consumption expenditures. The variables in the dataset are as follows:
Yi: Annual consumption expenditures of household i in thousands of US dollars.
X1i: Annual gross income of household i in thousands of US dollars.
X2i: Hourly wage rate of household i in US dollars.
X3i: Number of dependents in household i (i.e. number of children)
X4i: Average age of dependents in household i in years.
X5i: College education (1 = respondent has a college education, 0 = respondent does not have a college education)
1. Test your independent variables (X1i through X5i) for potential multicollinearity. Report the variance inflation factor (VIF) for each independent variable, and state your conclusion. If your test suggests that you should remove an independent variable from your analysis, then remove it and test the remaining independent variables for multicollinearity again.
2. Run a regression with consumption expenditure (Yi) as your dependent variable and the remaining X variables as your independent variables. Check if any of the regression coefficients are insignificantly different from zero. If so, remove the corresponding independent variables (one at a time) from your analysis and arrive at a model where all regression coefficients are statistically significant. Only state your final regression results.
3. Using your results from question 2, perform a partial residual analysis by testing the linearity assumption of your regression. Does there appear to be a pattern in any of the relationships between the residuals and your independent variables? If so, then extend your dataset by adding a nonlinear term of the independent variable in question (i.e. add a squared term). State your final regression results which successfully pass a residual analysis.
4. Using your results from question 3, answer the following questions.
a. What is the R2 value? What does this say in the context of the scenario?
b. Interpret each of the relationships between consumption expenditure and the independent variables. In the occurrence of a linear and quadratic term, be sure to interpret both terms jointly and not separately.
c. Test the following hypothesis: For every $1 increase in income on average, a household will increase their consumption expenditure by MORE THAN $0.25. Be sure to state the null and alternative hypotheses, the p-value of the test, and your conclusion.
This Statistics Assignment has been solved by our Statistics Experts at My Uni Paper. Our Assignment Writing Experts are efficient to provide a fresh solution to this question. We are serving more than 10000+Students in Australia, UK & US by helping them to score HD in their academics. Our Experts are well trained to follow all marking rubrics & referencing style.
Be it a used or new solution, the quality of the work submitted by our assignment Experts remains unhampered. You may continue to expect the same or even better quality with the used and new assignment solution files respectively. There’s one thing to be noticed that you could choose one between the two and acquire an HD either way. You could choose a new assignment solution file to get yourself an exclusive, plagiarism (with free Turnitin file), expert quality assignment or order an old solution file that was considered worthy of the highest distinction.
© Copyright 2026 My Uni Papers – Student Hustle Made Hassle Free. All rights reserved.