Since these four values are higher than 5, we can assume normality.
Next we calculate the confidence interval
Α=0.02
We divide 0.02 by 2 so =0.01
CI = (Pyes – Pno) +/- Z0.01 [SE(Pyes-Pno)]
=0.00735699 +/- 2.33 [0.002537518] this value was calculated earlier
=(0.00738699 + 2.33 [0.002537518]) (0.00738699 -2.33[0.002537518]
= (0.00738699 + 0.0059124169), (0.00738699 -0.005912416)
CI=(0.013299406, 0.001474574)
Since 0 is not in our confidence interval we can assume that there is a statistically significant difference between the two groups. B)
Ho : πyes – πno = 0
H1 :πyes – πno ≠ 0 α=0.01 Continuity Correlation:
Ppooled = Ryes + Rno/Nyes + Nno
=(40+23)/(2904+3601)
=0.009684857
SE(Pyes-Pno) = √Ppooled (1-Ppooled)1/nyes+1/nno
=√0.009684857(1-0.009684857)(1/2904+1/3601)
=√0.009684857(0.990315143)(0.000344352+0.0002777)
=√0.00959106(0.000622052)
=√0.000005966
SE=0.002442539
Next, Test Statistic
(|Pyes-Pno| - [1/2(1/nyes +1/nno)] Pyes – Pno|H0 / SE(Pyes-Pno)H0
=|0.00738699| - [1/2(0.000344352+0.000277)-0]/0.002442539
=0.00738699-[1/2(0.00622052)|)/0.002442539
=(0.00738699-0.000311026)/0.002442539
=0.007075964/0.002442539
Test Statistic (Zobs) = 2.896970734, round to 2.90
Pvalue from table is 0.0037
Our alpha was 0.01 but since our Pvalue is 0.0037 we strongly reject the null hypothesis and whether you are exposed/not expose has an impact on whether you develop bladder cancer.
C)
RR = [a/(a+b)]/[c/(c+d)]
=40/(40+2864)/23(23+3578)
=0.013774104/0.006387114
RR=2.15654582 = We will round to 2.16
Next:
SE(logRR) = 1/a-1/a+b+1/c-1/c+d
= 1/40-1/2904+1/23-1/3601
=0.26
Next:
Take log of RR = log2.16
=0.7701
Because we must use a 99% confidence interval for RR :
Log (RR) +/- Zci * SE (logRR)
=0.7701 +/- 2.58 * 0.26 which is 0.7701 +/- 0.6708
Then we take the log of 0.0993 which is 1.4409
To find Confidence interval we take e0.0993 and e1.4409 which gives us [1.104397569, 4.224496153]
Since the value of 1 does not appear, there is a significant relative risk.
D)
OR=A*D/B*C
=(40 * 3578)/(2864*23)
=143120/65872
OR=2.172698567
PE of Log0R = √1/a+1/b+1/c+1/d
=√1/40+1/2864+1/23+1/3578
=√0.025+0.000349162+0.04347826+0.000279485
=0.2629
CI: 0.7620994 +/- 2.33 (0.2629)
0.7620994 + (0.612557) = 1.3746564
0.7620994 - (0.612557) = 0.1495424
Next to get our interval we e1.3746564 and e 0.1495424 [ 1.454491565, 1.161302709]
The value of 1 isn’t in our interval, which suggests a significant odds ratio.
E)
X2obs= (nyes+nno)([a*d-c*b]2)/(a+b)(a+c)(b+d)(c+d)
= 6505 * [40*3578-23*2864]2/(2904)(40+23)(2864+3578)(3601)
= 6505*[143120-65872]2/4.244054999 10^12
=38816984043520/4.244054999 10^12
=9.15 and the corresponding P value is 0.002487
Since the P value is less than our alpha (0.02) we can reject the H0 establishing that there is a relationship between the exposure to the solvent and the risk of developing cancer.
F)
The tests that we were required to do from parts A-E show that we must reject the null hypothesis and accept the
Related Documents: Essay on Science: Statistics and Regression Model
Correlation and Regression with SPSS RSCH/8200X2: Quantitative Reasoning and Analysis Correlation and Regression with SPSS Week 10 Application This week’s application was designed to obtain the statistical results on the dataset used in previous weeks. For this assignment I chose to use the Bivariate Linear Regression model. I chose this model because according to, SPSS for Windows and Macintosh: Analyzing and Understanding Data “Linear regression can be used…
www.ccsenet.org/ijbm International Journal of Business and Management Vol. 7, No. 3; February 2012 The Effects of the External Environment on Internal Management Strategies within Micro, Small and Medium Enterprises; Kenyan Case W L Njanja School of Business, Kabarak University (ex student, UNISA) Nakuru-Eldama Ravine Highway Private Bag - 20157 KABARAK, Kenya E-mail: lnjanja@kabarak.ac.ke Martin Ogutu Department of Business Administration University of Nairobi, Kenya E-mail: Ogutum@uonbi…
Provide analytic and modeling support to Bayview’s servicing acquisition and whole loan acquisition business. Responsibilities include a variety of pricing analysis, modeling, reporting and programming tasks Develop proprietary prepayment and credit models for agency residential mortgage-backed securities; Analyze and price US Agency MBS and Credit-Risk-Sharing Bonds Conduct statistical analysis using SAS for multi-million loan level data on SQL database Research fundamental factors and policy changes…
A.P. Statistics – Vocabulary List Add your own definitions and examples or references, as are appropriate, to the following vocabulary. (Please note: These lists may be collected on a random basis for credit.) Vocabulary Term Definition, Examples, References table a set of data arranged in rows and columns 表格 68-95-99.7 Rule In statistics, the 68–95–99.7 rule, also known as the three-sigma rule or empirical rule, states that nearly all values lie within three standard deviations of the mean…
Stock Prices and Exchange Rate, Evidence from Pakistan Usman Azhar 08-arid-1606 Abid Hussain 08-arid-1608 Faisal Shahzad 08-arid-1620 Usman Fazal 08-arid-1634 MBA Finance University Institute of Management Sciences Dedication We would like to dedicate this accomplishment to our beloved and caring parents, and to our teachers with the support of whom we are standing at this step of our life stairs. Acknowledgement…
the main factors that caused the development difference in CEE and CIS countries are analyzed. The time focused on is from 1990 to 2002, which was exactly the transition period of both CEE and CIS countries. The basic theory used in analyze is Solow model. 2. The General Description on Economic Growth The data quoted from Penn World Table can give Fig.2.1 and Fig.2.2 below which indicate the economic growth rate for CEE and CIS countries. Figure 2.1 --Data from Penn World Table Figure 2.2—Data…
pilot programme live in December 2011 by a smart mobile application called ParkPGH, which provide a parking finder service and prediction of the spot availability with day in advance. The nature and focus of the model The nature of the model consist in a multivariable prediction model, based on quantitative and qualitative data collected from surveys, historical data, events data and other external resources (i.e. articles). The analysis concern on designing parking spot utilisation path and predict…
to pursue the thesis track. Final consideration and approval may be given to students who have obtained an ASU graduate grade point average of 3.20 or greater. Industrial engineering combines knowledge from the physical, mathematical and social sciences to design systems that integrate people, equipment and information in the most efficient way. Improvements made by industrial engineers in automation, information control and process quality revolutionized manufacturing in the past century and greatly…
transactions, stocks, … n Science: Remote sensing, bioinformatics, scientific simulation, … n Society and everyone: news, digital cameras, YouTube n We are drowning in data, but starving for knowledge! n “Necessity is the mother of invention”—Data mining—Automated analysis of massive data sets 3 Evolution of Sciences: New Data Science Era n Before 1600: Empirical science n 1600-1950s: Theoretical science n n 1950s-1990s: Computational science n n n Over the last…
Walk - How Randomness Rules Our Lives" by Mlodinow and pay special attend to the following questions. Some of these questions may appear on quizzes and exams. Chapter 1 Peering through the Eyepiece of Randomness 1. Explain the phenomenon "regression toward the mean." In any series of random events an extraordinary event is most likely to be followed, due purely to chance, by a more ordinary one. 2. What factors determine whether a person will be successful in career, investment, etc…