Einstein Headline Animator

Thursday, August 13, 2009

Inferential Presentation: Statistics

Inferential Presentation: Statistics

This exploratory presentation looks at expense account data of a company’s seven sales representatives over the past four months. The study investigated the relationships between the variables of amounts claimed and amounts reimbursed in relation to legitimate expense claims in order to test the null hypothesis that reimbursement is dependent upon claims fulfilling legitimate expense claim criteria. The value of alpha is equal to a 0.05 level of significance. The inferential analysis follows that outlined by the previous descriptive analysis which demonstrated a strong and dependable relationship between the dependent variable ‘claimed’ and the explanatory variable ‘reimbursed’.

This shows that the data were correctly processed by simply using clustering for the company’s seven sales representatives over the previous four months from the previous data set.

Table 1

Claimed

Reimbursed

Correlation Table

Data Set #1

Data Set #1

Claimed

1.000

0.857

Reimbursed

0.857

1.000

It shows that the correlation data 0.857 is not correct based on computation using Microsoft excel programs which revealed the value of 0.8768125

Claimed

Reimbursed

Covariance Table

Data Set #1

Data Set #1

Claimed

78383.46

64843.84

Reimbursed

64843.84

73014.63

This computation for the covariance is absolutely wrong. The actual computations for covariance using Microsoft excel program is

Covariance

Claimed

Reimburse

Claimed

8330.798469

Reimburse

2468.482143

951.3928571

The above table 1 shows the positive correlation between the variables of claims and reimbursed claims as derived from the StatTools output for calculation and covariance.

This statement is correct because there was a positive correlation with a matrix value of 0.8768125.

Looking further into the relationship using regression analysis we find the following below in table 2:

Table 2

Multiple

R-Square

Adjusted

Std Err of

Summary

R

R-Square

Estimate

0.8575

0.7353

0.7330

139.7820051

Degrees of

Sum of

Mean of

F-Ratio

p-Value

ANOVA Table

Freedom

Squares

Squares

Explained

1

6188762.843

6188762.843

316.7388

<>

Unexplained

114

2227447.019

19539.00894

Coefficient

Standard

t-Value

p-Value

Lower

Upper

Regression Table

Error

Limit

Limit

Constant

-18.57178804

19.19132541

-0.9677

0.3352

-56.58965587

19.44607978

Claimed

0.82859208

0.046557551

17.7972

<>

0.736361931

0.920822229

The coefficient of correlation of 0.828 for claimed, shows a high correlation and therefore a strong relationship between the dependent variable (reimbursement) and the explanatory variable (claimed).

Using the above the tables (Table 2), the value for the correlation is not 0.828, but, 0.8575. The correlation statement is only quite correct. It should be very high correlation because it is near to 1 and above 0.75.

The confidence limits start at a positive lower limit value of 0.736 at p-value of <0.0001 style="mso-bidi-font-weight: normal">

If the computed data was correct the statement was correct, however, based on computation the regression slope of the confidence interval is very small with 0.060265684 to 0.507232254 which is closer to zero, and has significant relation to the “legitimate” expense claims.

The R2 0.735 at a p-value of <0.0001 style="mso-bidi-font-weight: normal">

If the computed data was correct the statement was correct however based on the computation, the resultant R Square 0.7565 is very close to 1 and it is above 0.75 which means that the correlation is near normal curve distribution, so, it is interpreted as very high positive correlation. Thus, in percentile (%), 75.65 is an indicator of significant relationship between the variables of amounts claimed and amounts reimbursed in relation to “legitimate” expense claims. Finally, the R square of 0.756494365is quite near to the adjusted R square 0.695617957. This means that the regression model approximately fits the data.

From the data above a simple linear regression equation can be calculated thus;

Y= 0.828 X2 18.571

(Y= claimed and X2 = reimbursed)

In this data you can see that the values are inconsistent with those in table 1. This means that the data were absolutely wrong.

The hypothesis test in table 3 below, we look at the alternative hypothesis that submitted claims i.e. those deemed to legitimate are not dependant upon the reimbursement criteria being fulfilled. The results illustrate that at an output at 1% significance it fails to reject the null hypothesis as the p-value is just greater than 0.05 at 0.0519. Indeed, the failure to reject the null is maintained at 5% level of significance.

Table 3

StatTools

(Core Analysis Pack)

Analysis:

Hypothesis Test

Performed By:

Computing Services

Date:

25-Jun-09

Updating:

Live

Claimed

Reimbursed

Sample Summaries

Data Set #1

Data Set #1

Sample Size

116

116

Sample Mean

$303.66

$233.03

Sample Std Dev

$279.97

$270.53

Equal

Unequal

Hypothesis Test (Difference of Means)

Variances

Variances

Hypothesized Mean Difference

0

0

Alternative Hypothesis

<> 0

<> 0

Sample Mean Difference

$70.62

$70.62

Standard Error of Difference

36.14719623

36.14719623

Degrees of Freedom

230

229

t-Test Statistic

1.9537

1.9537

p-Value

0.0519

0.0520

Null Hypoth. at 10% Significance

Reject

Reject

Null Hypoth. at 5% Significance

Don't Reject

Don't Reject

Null Hypoth. at 1% Significance

Don't Reject

Don't Reject

Equality of Variances Test

Ratio of Sample Variances

1.0710

p-Value

0.7135

In addition, when we look at the reimbursed outputs in table 4 below, we can see that the coefficient of correlation for reimbursed is a high correlation value at 0.858, again indicating a strong relationship the dependent variable (claimed) and the explanatory variable (reimbursed). The confidence limits begin in the positive 0.736 at <0.0001>2 0.735 at a p-value of <0.0001>

Using the given tables, correlation was not high correlation, but, it is very high correlation because it is above 0.75. The R2 0.735 is very near to the adjusted R 0.733 which means that the regression model approximately fits the data very well.

From the data in table 4 below a simple linear regression equation can be calculated thus;

Y+ 0.887 X2 96.847

(Y= reimbursed and X2 = claimed)

In this data you can see that the values are inconsistent with those in table 1. This means that the data were absolutely wrong.

The hypothesis table 3 shows the alternative hypothesis i.e. that successfully reimbursed claims - those confirmed legitimate are not dependant upon the claims criteria being fulfilled. The results show that the p-value for reimbursed is just above 0.05 at 0.0520 and therefore at a 1% significance fails to reject the null hypothesis. Again, failure to reject the null is maintained at the 5% level of significance.

This statement is correct because the null hypothesis of this study was accepted in the computation of the analysis of variance. The result of the “Analysis of Variance” (ANOVA) shows that the computed F (12.43) is less than the tabular values of F-statistics (230) at 0.05 degree of freedom (1, 4). This means that there is no significant difference between the variables of amounts claimed and amounts reimbursed in relation to “legitimate” expense claims.

Table 4

Multiple

R-Square

Adjusted

StErr of

Summary

R

R-Square

Estimate

0.8575

0.7353

0.7330

144.6618885

Degrees of

Sum of

Mean of

F-Ratio

p-Value

ANOVA Table

Freedom

Squares

Squares

Explained

1

6628413.141

6628413.141

316.7388

<>

Unexplained

114

2385685.066

20927.06198

Coefficient

Standard

t-Value

p-Value

Lower

Upper

Regression Table

Error

Limit

Limit

Constant

96.84747643

17.76052565

5.4530

<>

61.66401192

132.0309409

Reimbursed

0.887455339

0.049865003

17.7972

<>

0.788673152

0.986237527

Conclusion

The data set is limited in that it contains only two numeric variables upon which to undertake regression analysis. The inferential analysis follows that outlined by the previous descriptive analysis which demonstrated a strong and dependable relationship between the dependent variable (claimed) and the explanatory variable (reimbursed).

The regression method was already presented in the tables. The computed value of the regression was 316.7388 while the tabulated value 253 at 0.05 level of significant. This implies that the null hypothesis was rejected because computed value was greater than the tabulated value.

This is to be expected given that it is a truism that claims made would be expected to be reimbursed assuming that they fulfill the claims criteria. We have learned that 23.3% of claims were not reimbursed in the 4 month period due to not being considered “legitimate”. This therefore proves the null hypothesis that the successful reimbursement of expense claims are indeed dependant upon the claim criteria being fulfilled.

This statement is true because there is no significant difference between the variables of amounts claimed and amounts reimbursed in relation to “legitimate” expense claims.

Reference

Albright, S., Winston, W., & Zappe, C. (2006). Data Analysis and Decision Making with Microsoft Excel. Manson, US: Thomson South-Western, USA.


1 comment:

Anonymous said...

[url=http://vtyupdr.com]YyQOyki[/url] - mHwUPwboLOUyf - http://iluubcb.com

Bidvertise Sponsor

Blog Advertising - Advertise on blogs with SponsoredReviews.com
SEO ss_blog_claim=76ee0d03e1b915adca3caa2ed8142e7c