Remember that the first and second assumptions of the regression analysis implies that residuals (or errors) must have constant variance for different values of the explanatory variable. The below figure shows three behaviors of residuals. Notice that although residuals have a mean of zero in all three cases, only the third one shows a constant variance for residuals. The first and second cases show a varying constant for residuals. This situation is referred to as heteroscedasticity.
Search the internet to find a real-world dataset that shows heteroscedasticity. Show the corresponding scatter diagram to visualize the varying variance for the residuals. Your scatter diagram showing the relationship between the dependent and explanatory variable should look like the following (in case of an increasing trend for variance of residuals). Note how observations of the variable get wider and wider around the regression line for larger values of .
Why do you think the dataset that you found shows a heteroscedasticity behavior? You need to justify this behavior based on your understanding of the data.
Search the internet to find out what issues heteroscedasticity causes in a regression analysis. Make sure you use your own words in your response.
Propose at least three ways to deal with heteroscedasticity and resolve it. Make sure you use your own words in your response.