1 (a). Since the amount garnished is accumulating at a rate that will pay off the loan at around 12 months, in six months it should have been paid half the total amount. Hence the slope should be 45 degrees (value=1/2) and intercept should be 0.
1 (b). Plot of the variables are shown below along with the LS regression line:
Bivariate Fit of Amt.Repaid.at.6.Months By Total.Amt.to.be.Repaid
Hence,
Slope = 0.49 ; Intercept = 2926.25.
The slope is fairly close to 1/2. An intercept of 0 can almost never be expected in actual data (unless simulated to perfection), hence the value should be expected below or above 0. Intercept value of 2926 is, however, well above 0 and could be explained by a big residual at point 36 (case number 450819117) which has a relatively high value for the amount repaid at six months. This point could explain the big shift in intercept.
1 (c). The residual plot is shown below:
Bivariate Fit of Residuals Amt.Repaid.at.6.Months By Total.Amt.to.be.Repaid
Linear Fit
Residuals Amt.Repaid.at.6.Months = -3.52e-13 + 5.816e-18*Total.Amt.to.be.Repaid
The normal Quantile Plot for the residual is shown below:
Distributions
Residuals Amt.Repaid.at.6.Months
For the data to conform to the assumptions of SRM, it needs to agree to the following points:
i) Residuals are independent ii) Residuals have mean zero and have constant variance. iii) Residuals are normally distributed.
From the Normal Quantile plot, it appears that the residual violates the third assumption as a normal distribution is not seen. Because of the scale of the plot, it is hard to observe the independence and constant variance assumptions; scaling has been distorted because of the big residual point (#36).
It should be noted that point #36 (case number 450819117) appears to have a data error, specifically the amount repaid at six months ($3,533,140) appears to be significantly higher than total amount to be repaid ($67,952). It is possible that a decimal point could have been missed and the actual amount could be $35,331.40 which is approximately half of the total amt to be paid.
Assuming that this point has wrong data, we will exclude this point from our plot to observe a better scaled version of the residual. The fact that this is not highly leveraged and hence not influential (observed from the fact that the fit line appear similar before and after excluding the point), we will proceed with the exclusion plot as shown below:
Bivariate Fit of Residuals Amt.Repaid.at.6.Months By Total.Amt.to.be.Repaid
Linear Fit
Residuals Amt.Repaid.at.6.Months = -4342.507 - 0.0266809*Total.Amt.to.be.Repaid
We observe some level of Heteroscedascity thus invalidating assumption ii.
Finally, the Normal Quantile plot of the residual (excluding the big residual point): Distributions Residuals Amt.Repaid.at.6.Months
Clearly, the residual plot doesn’t appear to be Normally distributed and also seem to follow a sinusoidal pattern thus invalidating independence assumption.
Hence, the data doesn’t conform to SRM. It appears, the residuals get bigger as the total amount to be repaid gets bigger. This phenomenon should however be expected because of the selection of the plot. As the total amount to be paid increases, the amount paid in six months is expected to increase thus displaying a “fan” effect (Heteroscedascity).
2 (a). If all the loans are performing comparably on a constant pay-out scale and accumulating at a rate that will pay off the loans at around 12 months, then Amount repaid at six months should be about half of the total amounts due in 12 months. Thus PRSM should be 1 (2*1/2); this plotted against total amount should give a straight line with slope around 0 and intercept about 1.
2 (b). The plot for PRSM vs Total Amt to be repaid is shown below with the LS regression line:
Bivariate Fit of PRSM By
Vito Ricci - R Functions For Regression Analysis – 14/10/05 (vito_ricci@yahoo.com) R FUNCTIONS FOR REGRESSION ANALYSIS Here are some helpful R functions for regression analysis grouped by their goal. The name of package is in parentheses. Linear model Anova: Anova Tables for Linear and Generalized Linear Models (car) anova: Compute an analysis of variance table for one or more linear model fits (stasts) coef: is a generic function which extracts model coefficients from objects returned by…
Regression Analysis of Graduation % Dependent Variable: Graduation % The Graduation percentage is the dependent variable that the independent variable will influence. In this scenario the independent variables will be SAT math (SATM) scores, SAT vocabulary (SATV) scores, percentage of the faculty that have a PHD (%FacPhD), and percentage of high school graduates who graduated in the top 20% of their class (%Top20%), which will help to determine the most logical forecast of a colleges graduation…
Risk is the one of the core concept of finance industry. The subprime mortgage crisis that happened in the United Stats since the end of 2007 queried the ability and the methods of the more efficient risk management. The traditional risk management that using variance to measure risk has been unable to play a part in present capital market. In other words, basing on a risk management sentiment, the occurrence of crisis required new approaches capable of precisely evaluating the systematic risk under…
Introduction: Correlation analysis: Sales and price: Price is an important factor for the success of a business. Price is amount of money someone is willing to pay in exchange for an asset or service. It is crucial that the price is not too low that the company does not make profit or if the price is too high less people will be able to purchase the product. Price plays a significant impact on sales. Price and sales tend to have a negative relationship. In other words when price increase demand…
Smithers the correct analysis of “year 4” projected cell phone orders, Paul needs to put together a regression analysis by looking at the orders for the past 36 months (the past 3 years). When looking at the past orders, Paul will then be able to analyze months 37-49 (year 4) and give this analysis to Mr. Smithers with a spread sheet and facts that prove these numbers accordingly. In order to come up with “Year 4” numbers I used my graphing calculator. First I clicked Stat, Edit, Then put in numbers…
This report summarizes findings from an analysis of optimal output and pricing for baguettes for Rob’s Select Markets. You provided me with data from the previous 20 weeks in four different categories: quantity produced, variable cost, price, and total sales. Using this data, I created a multiplicative exponential regression model and linear regression model that allowed me to find the optimal output and pricing for baguette production. Below are the multiplicative exponential and linear functions…
How FOREIGN OWNERSHIP has affected the performance and revenues of teams in the English Premier League Prof. Callan EC 275 INTRODUCTION In this project I will be exploring if foreign ownership has affected the performance and revenues of teams in the English Premier League (EPL) looking at statistical performances of x10 teams (Arsenal, Aston Villa, Chelsea, Everton, Liverpool, Manchester City, Manchester United, Newcastle United, Tottenham Hotspur and West Ham United) from 2002 to 2012. My decision…
To: Dr. Babb From: Richard Follett Table of Contents Summary of Analysis of stats from Major League Baseball’s 1997 season…………………………………………… 1 Figures…………………………………………………………………………………………………………………………………………………..3 Summary The 1997 season of Major League Baseball ended in an epic fashion. The Florida Marlins, the National League wild card that year, ended up going on to win the World Series in movie worthy fashion over the Cleveland Indians. The Marlins finished the regular season nine games behind…
Linear Regression Analysis Aarian Charania Professor Habibullah Section: 8am Introduction The independent research company Consumer Research, Inc. conducted research and collected data of annual income and household size, and annual credit card charges. In this report I plan find how much our independent variables (household size and income) affects our dependent variable annual credit card charges. By using descriptive statistics such as mean and standard deviation in order to look…