Premium Essay

Stat Correlation Analysis

In:

Submitted By lolla
Words 922
Pages 4
a) If 10 subscribers are chosen at random, probability that 4 or more plan to make real estate purchases in the next two years is 71.56% P(x>=4)? Excel functions: =1-BINOMDIST(3,10,0.4415,1) b) There are 6 subscribers whose household income would be considered an outlier (with the z-score more than 3 or less than -3). They are: Income Z-score 179700 | | | 3.02257 | 200500 | | | 3.619959 | 201700 | | | 3.654424 | 202400 | | | 3.674528 | 205900 | | | 3.77505 | 322500 | | | 7.123872 | Excel functions: Income Mean: =AVERAGE(Data!G2:G411) Standard Deviation: =STDEV(Data!G2:G411) z-score calculation: =(G2-Sheet2!$B$1)/Sheet2!$B$2 or =(G2-74459.51)/34818.21

c) Age of subscribers approximates normal distribution, and according to empirical rule, 68% of subscribers can be expected to be between ages 26 and 34, 95% of subscribers between ages 22 and 38, and 99% of subscribers between ages 18 and 42. Even though ages of subscribers range from 19 to 42, potential advertisers should be aware that data has the expected value 30.11, mode of 32, and the bulk of the subscribers will be between ages of 26 and 34, which determines the potential target market.

Excel Functions:
Variance=(x-m)^2 * f(x)
Standard Deviation=M5^0.5
Expected Value = Weighted Average= age*f(x), where f(x) is % frequency 68.3% subscribers will be within 1 STD of the mean (30 + / -4) | 95% within 2 std (30 + / - 8) | 99% within 3 std (30 + / - 16)Mode=MODE(A2:A411) |

d) The probability of a subscriber having broadband access given that the subscriber has children (conditional probability) is 63.93% P(children)=219/410=0.5341 P(children and access)=140/410=0.3415 P(access given children)=P(children and access)/P(children)=0.3415/0.5341=0.6393

Similar Documents

Premium Essay

Project

...results: Dependent Variable: Size Independent Variable: Income ($1000) Size = 2.3893113 + 0.023563983 Income ($1000) Sample size: 50 R (correlation coefficient) = 0.1984 R-sq = 0.039351758 Estimate of error standard deviation: 1.7220922 Parameter estimates: Parameter | Estimate | Std. Err. | Alternative | DF | T-Stat | P-Value | Intercept | 2.3893113 | 0.77432936 | ≠ 50 | 48 | -61.486355 | <0.0001 | Slope | 0.023563983 | 0.016804602 | > 50 | 48 | -2973.9731 | 1 | Analysis of variance table for regression model: Source | DF | SS | MS | F-stat | P-value | Model | 1 | 5.8311434 | 5.8311434 | 1.9662601 | 0.1673 | Error | 48 | 142.34886 | 2.9656012 | | | Total | 49 | 148.18 | | | | Simple linear regression results for Location=Urban: Dependent Variable: Income ($1000) Independent Variable: Size Income ($1000) = 39.63889 - 3.3611112 Size Sample size: 13 R (correlation coefficient) = -0.3589 R-sq = 0.12878229 Estimate of error standard deviation: 7.597348 Parameter estimates: Parameter | Estimate | Std. Err. | Alternative | DF | T-Stat | P-Value | Intercept | 39.63889 | 5.117386 | ≠ .40 | 11 | 7.66776 | <0.0001 | Slope | -3.3611112 | 2.6358569 | < .40 | 11 | -1.4269027 | 0.0907 | Analysis of variance table for regression model: Source | DF | SS | MS | F-stat | P-value | Model | 1 | 93.85256 | 93.85256 | 1.6260059 | 0.2285 | Error | 11 | 634.9167 | 57.719696 | | | Total | 12 | 728.7692 | | |...

Words: 1608 - Pages: 7

Premium Essay

Stats Project

...rural and suburban areas; thus, we also can say that these areas you will find that they carry higher credit balances versus urban areas, which will carry lower credit balances since they have smaller families. This part of the project gives us a great visual on size and credit balances and allows us to see which way the scale would move if we increase the sizes or decrease or sizes. APPENDIX A As you look at this graph you can see that as the size get bigger the credit balance increase also, so as you move to the right and up the linear line will move diagonal higher. The equation of the “best fit line” is y= 2591.4x+403.22. The coefficient of correlation equals .752442. The correlation coefficient is a number between 0 and 1.  If there is no relationship between the predicted values and the actual values the correlation coefficient is 0 or very low. As the strength of the relationship between the predicted values and actual values increases so does the...

Words: 1413 - Pages: 6

Premium Essay

Mpact of Workforce Diversity on Organizational Performance in the Education Sector of Karachi Pakistan

...IMPACT OF WORKFORCE DIVERSITY ON ORGANIZATIONAL PERFORMANCE IN THE EDUCATION SECTOR OF KARACHI PAKISTAN 1Hafiza Sumaiyyah Iqbal, 2Faiza Maqbool Shah (Supervisor) Department of Business Administration, Jinnah University for Women (JUW) Karachi Pakistan ABSTRACT Diversity is gradually used and accepted as a significant organizational resource in esteems to whether the objective is to be an employer of choice, to offer outstanding customer service, or to sustain a competitive advantage. It also has verified to have controlled to an opinion of being essential for organizational performance. This ultimate faith forces managers to hold and understand the theory of workplace diversity, its benefits and barriers. The purpose of this research is to discover the impact of diversify workforce towards organizational performance which focus into the education sector. The research also emphases on workforce diversity which contains the gender, ethnic and education background of the employees which is the utmost critical variables amongst all the others. The research was done by distributing 100 questionnaires to the faculty members of 5 different universities of Karachi. The questionnaire outcomes show that there is an impact on performance when diverse workforce is working in the education sector. Key words: Workforce Diversity, Organization, Performance, Gender, Ethnic, Qualification, Karachi, Universities. ___________________________________________________________________________ ...

Words: 7503 - Pages: 31

Premium Essay

Regression Analysis

...Individual Project BUSN311-1301B-10: Quantitative Methods and Analysis Instructor Leonidas Murembya April 23, 2013, Abstract This paper will be discussing regression analysis using AIU’s survey responses from the AIU data set in order to complete a regression analysis for benefits & intrinsic, benefits & extrinsic and benefit and overall job satisfaction. Plus giving an overview of these regressions along with what it would mean to a manager (AIU Online).   Introduction Regression analysis can help us predict how the needs of a company are changing and where the greatest need will be. That allows companies to hire employees they need before they are needed so they are not caught in a lurch. Our regression analysis looks at comparing two factors only, an independent variable and dependent variable (Murembya, 2013). Benefits and Intrinsic Job Satisfaction Regression output from Excel SUMMARY OUTPUT Regression Statistics Multiple R 0.018314784 R Square 0.000335431 The portion of the relations explained Adjusted R Square -0.009865228 by the line 0.00033% of relation is Standard Error 1.197079687 Linear. Observations 100 ANOVA df SS MS F Significance F Regression 1 0.04712176 0.047122 0.032883 0.856477174 Residual 98 140.4339782 1.433 Total 99 140.4811 Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Intercept 4.731133588 1.580971255...

Words: 830 - Pages: 4

Premium Essay

Regression Analysis

...Introduction Regression analysis was developed by Francis Galton in 1886 to determine the weight of mother/daughter sweet peas. Regression analysis is a parametric test used for the inference from a sample to a population. The goal of regression analysis is to investigate how effective one or more variables are in predicting the value of a dependent variable. In the following we conduct three simple regression analyses. Benefits and Intrinsic Job Satisfaction Regression output from Excel SUMMARY OUTPUT Regression Statistics Multiple R 0.616038 R Square 0.379503 Adjusted R Square 0.371338 Standard Error 0.773609 Observations 78 ANOVA df SS MS F Significance F Regression 1 27.81836 27.81836 46.48237 1.93E-09 Residual 76 45.48382 0.598471 Total 77 73.30218 Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Lower 95.0% Upper 95.0% Intercept 2.897327 0.310671 9.326021 3.18E-14 2.278571 3.516082 2.278571 3.516082 X Variable 1 0.42507 0.062347 6.817798 1.93E-09 0.300895 0.549245 0.300895 0.549245 Graph Benefits and Extrinsic Job Satisfaction Regression output from Excel SUMMARY OUTPUT Regression Statistics Multiple R 0.516369 R Square 0.266637 Adjusted R Square 0.256987 Standard Error 0.35314 Observations 78 ANOVA ...

Words: 684 - Pages: 3

Premium Essay

Unit 5 Regression Analysis

...Unit 5 Regression Analysis American Intercontinental University Regression Analysis Independent Variable: Benefits Dependent Variable: Intrinsic Regression Statistics |   | Multiple R | 0.252916544 | R Square | 0.063966778 | Adjusted R Square | 0.045966139 | Standard Error | 0.390066747 | Observations | 54 | ANOVA |   |   |   |   |   |   | df | SS | MS | F | Significance F | Regression | 1 | 0.540685116 | 0.540685116 | 3.553583771 | 0.065010363 | Residual | 52 | 7.911907477 | 0.152152067 |   |   | Total | 53 | 8.452592593 |   |   |   |   | Coefficients | Standard Error | t Stat | P-value | Lower 95% | Upper 95% | Lower 95.0% | Upper 95.0% | Intercept | 4.88865703 | 0.188506099 | 25.93368096 | 2.04938E-31 | 4.510391881 | 5.266922187 | 4.510391881 | 5.266922187 | 1.4 | 0.06958624 | 0.036913916 | 1.885095162 | 0.065010363 | -0.004486945 | 0.143659433 | -0.004486945 | 0.143659433 | Independent Variable: Benefits Dependent Variable: Extrinsic Regression Statistics |   | Multiple R | 0.332749251 | R Square | 0.110722064 | Adjusted R Square | 0.093620565 | Standard Error | 0.405766266 | Observations | 54 | ANOVA |   |   |   |   |   |   | df | SS | MS | F | Significance F | Regression | 1 | 1.065986925 | 1.065987 | 6.474407048 | 0.013952455 | Residual | 52 | 8.561605668 | 0.164646 |   |   | Total | 53 | 9.627592593 |   |   |   |   | Coefficients | Standard Error | t Stat | P-value | Lower 95% | Upper 95% | Lower 95...

Words: 463 - Pages: 2

Premium Essay

Nba Statistics

...Table of Contents Introduction……………………………………………………………………………………………………………………2 Descriptive Analysis and Frequency Distribution…………………………………………………………….2 Hypothesis Testing………………………………………………………………………………………………………….7 Hypothesis 1…….……………………………………………………………………..……………………......7 Hypothesis 2……………………………………………………………………………………………………...8 Hypothesis 3……………………………………………………………………………………………………...9 Multiple Regression Analysis…………………………………………………..…………………………………….11 Summary ……….…………………………………………………………………………..………………………………..16 Reference……………………………………………………………….…………………………………………………….19 Appendices……………………………………………………………………….…………………………………………..20 Introduction For my statistical data analysis project, I chose to analyze the National Basketball Association (NBA) 2013 regular season teams. The analysis looks at the total team and reviewed the information such as games played, field goals attempt and percentage, free throw attempts and percentage, blocks and steals. The data was obtained from the NBA website. For the 2013 NBA stats there were 30 teams that played on the average of 82 games. Based on statistical analysis, the most important keys for team success in basketball and their relative weights, in parentheses, are field goal percentage, turnovers, offensive rebounds, free throw attempts and percentage, blocks and steals. Coaches are always looking for a better understanding of what makes up a winning team. This knowledge would help them improve the team statistics in the areas listed...

Words: 3661 - Pages: 15

Free Essay

Unit 5 - Regression Analysis

...Abstract This paper describes the application of regression analysis for the workplace. Three sets of variables are investigated - benefits and intrinsic job satisfaction, benefits and extrinsic job satisfaction, and finally benefits and overall job satisfaction. The regression analysis is performed using Excel and the results are shown in this paper, along with a graph for each set. The results are analyzed for recommendation to the company.   Introduction Regression analysis is performed on three sets of variables – benefits and intrinsic job satisfaction, benefits and extrinsic job satisfaction, and finally benefits and overall job satisfaction. The results of the regression analysis are used to determine whether any relationship exists for the three sets of variables and the strength of the relationship. Benefits and Intrinsic Job Satisfaction Regression output from Excel Regression Statistics Multiple R 0.069642247 R Square 0.004850043 Adjusted R Square -0.004718707 Standard Error 0.893876875 Observations 106 ANOVA df SS MS F Significance F Regression 1 0.404991362 0.404991 0.506863 0.478094147 Residual 104 83.09765015 0.799016 Total 105 83.50264151 Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Lower 95.0% Upper 95.0% Intercept 5.506191723 0.363736853 15.13784 4.79E-28 4.784887914 6.227496 4.784888 6.227496 Benefits -0.057165607 0.080295211 -0.71194 0.478094 -0.216394019 0.102063 -0.21639 0.102063 Graph Benefits...

Words: 770 - Pages: 4

Free Essay

Quantitative Methods and Analysis Unit 5 Ip

...Unit 5 – Regression Analysis American InterContinental University Abstract In this scenario, Microsoft Excel has been utilized in order to perform a regression analysis therefore; each one has a chart in order to show the correlations in the data. However, satisfaction: overall, intrinsic, and extrinsic had been used.   Introduction An analysis has been given to employees for the benefits satisfaction and compared to three different job types such intrinsic, extrinsic, as well as the over all. However, the regression analysis that was performed had been done in excel as well as there were charts made up. Benefits and Intrinsic Job Satisfaction Regression output from Excel Regression Statistics Multiple R 0.022301 R Square 0.000497 Adjusted R Square -0.0093 Standard Error 0.656922 Observations 104 ANOVA df SS MS F Significance F Regression 1 0.021902 0.021902 0.050753 0.822209 Residual 102 44.01771 0.431546 Total 103 44.03962 Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Lower 95.0% Upper 95.0% Intercept 5.270871 0.348709 15.11541 8.66E-28 4.579209 5.962532 4.579209 5.962532 X Variable 1 0.017947 0.079664 0.225284 0.822209 -0.14007 0.175959 -0.14007 0.175959 It did not want to add my 2 to the answer of 5.962532 or did it add the 9 to the answer of 0.175959 Graph Benefits and Extrinsic...

Words: 910 - Pages: 4

Premium Essay

Laksaldk

...COURSE OUTLINE Term: Fall 2012 Subject Code: Course Number: Course Name: Grade Mode: Length: STAT 2001 Statistics 2 Numeric 42.0 Hours PLA Applicable: Y Credit Value: 3.0 Level: Post Secondary Prerequisites(s): STAT 2000 Statistics 1 or BUS 2238 Stats 1 Co-requisite(s) and Concurrent Prerequisite(s): None Equivalent(s): BUS 2206 Inferential Statistics and BUS 2239 Statistics 2 Students are advised to retain course outlines for future use in support of applications for employment or transfer of credits Course Description This course builds on the topics covered in Statistics 1, extending hypothesis testing and other inferential techniques to a range of new problems. Applications of statistical techniques to quality and productivity management are covered. Students gain further experience with the use of computer-based statistical analysis tools. Learning Outcomes: Upon completion of this course, the student will have reliably demonstrated the ability to: 1. Identify the appropriate technique, then conduct and interpret inference (both estimation and hypothesis tests) of population means when the population variance is known or unknown. 2. Perform hypothesis tests on the differences between two means using both independent samples and matched pairs experiments. STAT 2001 “Statistics 2” 3. Use sampling distributions to calculate interval estimates and select appropriate sample sizes for proportions. 4. Test the significance of a population proportion and...

Words: 766 - Pages: 4

Premium Essay

House Prices Ii

...HOUSE PRICES II CASE: 28 Olusegun Abebayo TAKSAMAI TANAPAISANKIT STACEYANN BARTON GM533 Applied Managerial Statistics Abstract Pricing your home competitively is an important factor in determining your selling price. As a seller, the aim is to get the best asking price. To prevent losing money, one has to be careful not to underprice their home. As mentioned in the article Selling Your Home – The Importance of Pricing Correctly, the most important factor when selling your home is not what your home is listed for, but rather what similar homes have recently sold for. This is the statistic that will properly tell you what buyers are willing to pay for a similar home, in a comparable neighborhood. In the article entitled Pricing Houses-Pricing Houses to Sell, Elizabeth Weintraub provided a few guidelines that can be effective in pricing one’s home. She suggested that a seller looks at every similar home that was or is listed in the same neighborhood over the past six months. Compare similar square footage, within 10% up or down from the subject property, if possible. Compare apples to apples. The objective of this study is to use the data given in Case 28 – Housing Prices 11 to determine the selling price for a house in Eastville, Oregon and prepare and establish the description of how the findings might be used as a general method for estimating the selling price of any house in my neighborhood. In doing so, we had to figure out what factors determine the selling...

Words: 6813 - Pages: 28

Premium Essay

Amaf

...Assignment on Regression Objective of the Analysis To find a regression equation to forecast the closing price of sensex by estimating its relationship with various other independent parameters Description of Data Data Frequency: Monthly Data Data Span: From year 2010 to 2014 Following independent variables are used to estimate the GDP: 1: Call Money Rate: This variable 2: Dollar Price: 3: Oil Price: 4: Gold Price: 5: IIP: Empirical Analysis Following steps were performed to find out the relationship between Sensex Closing Price and other independent variables: 1: Regression command entered in E-Views: “LS SensexClosePrice C CallMoeyRate DollarPrice OilPrice GoldPrice IIP” Dependent Variable: GDP | | | Method: Least Squares | | | Date: 08/01/14 Time: 23:09 | | | Sample: 1 41 | | | | Included observations: 41 | | | | | | | | | | | | | Variable | Coefficient | Std. Error | t-Statistic | Prob.   | | | | | | | | | | | C | 2027.956 | 2505.872 | 0.809281 | 0.4238 | POWER_C | 0.015238 | 0.009099 | 1.674679 | 0.1029 | EXPORTS | -0.026773 | 0.022733 | -1.177689 | 0.2469 | IMPORTS | -0.001196 | 0.009135 | -0.130908 | 0.8966 | HIGHWAY | 0.006676 | 0.002728 | 2.446933 | 0.0196 | TAX | 0.072515 | 0.013733 | 5.280252 | 0.0000 | | | | | | | | | | | R-squared | 0.995336 |     Mean dependent var | 34045.72 | Adjusted R-squared | 0.994670 |     S.D. dependent var | 23515.70 | S.E. of regression...

Words: 1203 - Pages: 5

Premium Essay

Statistics

...which elements are associated with poverty. The purpose of this paper is to evaluate the key determinants of American household poverty in 1980. The four possible determinants will be analyzed in this project, the average numbers of every family (FAMSIZE), URB is the percent of people live in urban, UR is the level of people have no job over 16 years and the median family income in US dollars (INCOME). Descriptive statistics, correlation and regression will be used in this project. 2. Descriptive statistics Variable | Mean | Median | Mode | VAR | STDEV | URB | 58.76034483 | 66.15 | 0 | 1012.828049 | 31.82495953 | FAMSIZE | 3.140172414 | 3.135 | 2.93 | 0.033377163 | 0.182694178 | UR | 9.293103448 | 8.95 | 5.8 | 10.92696915 | 3.30559664 | INCOME | 19240.43103 | 18512 | N/A | 10889936.04 | 329.990309 | POV | 9.120689655 | 9.05 | 8.8 | 6.230792498 | 2.496155544 | 3. Correlation Correlation and regression are techniques for investigating the statistical relationship between two, or more, variables (Barrow, 2013, pp. 238). * Correlation defines the degree to which there is a linear relationship between pairs of variables. Firstly, it is useful to graph the variables to see if anything useful is revealed. In this case, XY graphs are the most suitable and they are shown in following figures. In Figure 3.1, there is an almost flat line in the graph of poverty against URB. The graph between poverty and family size (Figure 3.2) looks much like a random scatter...

Words: 1666 - Pages: 7

Premium Essay

Economic Financial Forecasting

...answer the following questions: 1. Choose the real US export of goods and services (REXPGS), real import of goods and services (RIMPGS) and disposable personal income (DPI) variables. View their descriptive statistics. Analyze skewness, kurtosis and volatility (measured by the coefficient of variation) of each of them. Discuss possible economic factors underlying the data asymmetry, kurtosis and relative volatility. In this example, all three variables are left skewed, while kurtosis is between 1.99 and 2.29 for each of the three variables. I measured volatility based on the standard deviation of the three variables. DPI is extremely volatile, while RIMPGS and REXPGS are not as volatile as DPI. It seems as if there is a strong correlation between the amounts of standard deviations compared to the kurtosis. There is a lack of asymmetry between the three variables, as there is no equivalence in various measurements. | |REXPGS |RIMPGS |DPI | | Mean | 747.1115 | 962.0551 | 4682.076 | | Median | 447.5000 | 636.0000 | 3400.400 | | Maximum | 2123.900 | 2667.200 | 13506.80 | | Minimum | 86.60000 | 116.3000 | 353.6000 | | Std. Dev. | 618.7146 | 822.1677 | 4047.948 | | Skewness | 0.772824 | 0.733214...

Words: 2014 - Pages: 9

Premium Essay

Corporate Finance

...Accounting Standards Harmonization and Financial Statement Comparability: Evidence from Transnational Information Transfer Clare Wang The Wharton School University of Pennsylvania wclare@wharton.upenn.edu Current Version: January 2011 Abstract This study investigates whether harmonization of accounting standards enhances the comparability of Önancial information across countries. First, I statistically deÖne and link comparability to Örm value in a two-Örm, sequential information release framework. I then empirically test the prediction that a Örm yet to announce earnings reacts more strongly to the earnings announcement of a foreign Örm when both report under the same rather than di§erent accounting standards. My analysis of abnormal price and volume reactions for a global sample of Örms supports this prediction. Next, in an attempt to control for the e§ects of changes in reporting quality, I use a di§erence-in-di§erences design around the mandatory introduction of International Financial Reporting Standards (IFRS). I Önd that mandatory adopters experience a signiÖcant increase in market reactions to the release of earnings by voluntary adopters compared to pre-mandatory adoption. This increase is not observed for non-adopters over the same period. Taken together, my study shows that accounting standards harmonization facilitates transnational information transfer, and suggests comparability as a direct mechanism. I thank my dissertation committee members Brian Bushee,...

Words: 19979 - Pages: 80