# Stats Midterm

In: Other Topics

Submitted By crunkboy117
Words 305
Pages 2
Here are the questions to be answered for the Take Home portion of the Midterm Exam. This assignment is due on Friday, February 28, 2014. Use the Midterm Dataset to answer the questions. Please include all SAS code and output for full credit. You may not receive any help for anyone for this assignment, but you may refer to books, manuals, etc.

1) Create a 99% for the difference of the means of the variable WAR (wins above replacement) for those players who did not make the playoffs compared to those players who made the playoffs. Based on this interval, could you make the conclusion that the players who made the playoffs had a higher mean WAR than those who did not make the playoffs? Explain.

2) Create a 90% confidence interval for the percent of players who were considered to be outfielders. For position players, 37.5% of position players on the field are outfielders. Based on the sample provided, explain why this is (is not) a good estimate for the percentage of players who are outfielders.

3) Using hypothesis testing, explain if there is enough evidence at the 5% level of significance to say that the mean number of homeruns (HR) is less than 25.

4) At the 2% level of significance, is there any evidence to support the claim an individual’s batting average in September is going to be less than his batting average in April. Explain your answer.

5) Create a new variable called “TeamPlayer” where the variable has the value “Yes” if the player has more than 80 runs batted in (RBI) and has the value of “No” if not. Is there a difference in the percent of players in each league who are considered to be a “TeamPlayer”? Use a 5% level of significance and explain your...

### Similar Documents

#### Fgsg;

...Stat 141 R1 - Lecture #35 Announcements: 1) Assignment #11 Question 5: The answer is wrong … should be “fail to reject” but MyStatLab wants “reject”... so give the wrong answer for full marks in this question 2) Exam: STAT 141 R1 3 hrs 1400 Wed Apr 17 MAIN GYM, ~45 Multiple Choice Questions Chapters 7, 8, 18-28 …. some pre MT skills will be required. Simple Linear Regression …. continued Last time: Ex) Predicting final exam marks (%) from midterm exam marks (%) in a class of 88 students: Student | #1 | #2 | … | #88 | Midterm mark | 67% | 72% | … | 88% | Final mark | 62% | 50% | … | 91% | Stat 141 R1 - Lecture #35 page 2 Given x = midterm percentage, y = final percentage, n=88, x = 67.812, y = 52.643, sx = 17.922, sy = 25.430, r = 0.718 ∑(yi–ŷi)2 = 27278.82 We had calculated: The slope and intercept of the sample line of best fit: * sample line of best fit: y= -16.443 + 1.019 x An estimate for σ (standard deviation about the population line): se Given SSE = ∑(yi–ŷi)2 = 27278.82 : se2=SSEn-2=27278.8288-2=317.196 →σ≈se=317.196=17.810 Stat 141 R1 - Lecture #35 page 3 Inference for the population slope β1 When the 4 basic assumptions of the SLR model are satisfied: o The relationship between x and y is sufficiently linear. Presuming linearity, this means, at any x, με = 0. o The std. dev. of ε is the same for any particular x (constant). o The...

Words: 691 - Pages: 3

...expected to be positive. The fact that one of them has the wrong sign (βS) may reflect negative omitted variable bias. Multicollinearity obviously is a problem and may be responsible for the wrong sign of βS too. Variable S looks irrelevant but it should be relevant in theory. Its small t-stat may be due to omitted variable or multicollinearity. Q2. Using the intuition from simple case of omitted variable: (a) y = apple consumption, x1 = price of banana, x2 = price of orange yt = 0 + 1 x1,t + 2 x2,t + ut If we leave out x2, bias on β1 depends on (1) whether x1 and x2 are correlated in the sample which mostly likely is positive (e.g. inflation), and whether price of orange has positive or negative effect on the annual consumption of apple in the population (sign of 2) which should be positive too (law of demand). Bias is (+) (+) = (+) (b) y = student grades on the midterm, x1 = hours studies for the midterm x2 = hours slept before the midterm If we leave out x2, bias on β1 depends on (1) whether x1 and x2 are correlated in the sample which mostly likely is negative, and (2) whether hours slept before the midterm has positive or negative effect on the midterm grade in...

Words: 734 - Pages: 3

#### Midterm Paper

...recent study of 460 drivers age 70 and over by the National Highway Traffic Safety Administration reported that 75% of those drivers had uncorrected vision problems. Answer: The report of the ‘traffic safety administration’ reported that 75% (345 people) was the reported statistic from the 460 drivers of 70 or older drivers. I feel the assumed parameter would be the population of drivers that had uncorrected vision that were 70 years old or older. 4. What type of sampling procedure was used to collect the data in the MM207 Student Data Set? Answer: The sampling procedure used, believe was the ‘Conveyance’ sampling because this was a population which was readily available. STATISTICS, MM207, ‘Midterm Project’ Andre Bergman, 4-20-2014 Page, 2 of 3 5. From the MM207 Student Data Set identify one...

Words: 733 - Pages: 3

Free Essay

#### Math 121

Words: 1464 - Pages: 6

Free Essay

#### Case

...STAT 2000 Midterm Exam # 1 Review Sheet Descriptive Statistics 1. Data, categorical variable, quantitative variable, identiﬁer 2. Population, sample, census, parameter, statistic 3. Sampling designs: simple random, stratiﬁed and cluster samplings 4. Data presentation (a) Categorical variable: frequency tables, bar chart, pie chart (b) Quantitative variable: histogram, ﬁve-number summary, boxplots 5. Descriptive statistics: mean x, median, mode, range, interquartile range (IQR), variance s2 , ¯ standard deviation s. They are used to describe shape, center and spread of the distribution. 6. Percentiles: 25th = Q1 , 50th = median, 75th = Q3 etc. Correlation and Linear Regression 1. Scatterplot, y (dependent, response) variable, x (independent, explanatory) variable 2. Correlation (linear association between 2 variables) and correlation coeﬃcient r = interpretations, properties, conditions, lurking variables, eﬀect of outliers 3. Linear model: y = b0 + b1 x, where b1 = r(sy /sx ) and b0 = y − b1 x. ˆ ¯ ¯ 4. Interpretations of predicted value y , slope b1 and intercept b0 in problem context. ˆ 5. R2 = r2 : fraction of y’s variability accounted for by linear regression on x Probability 1. Trial, sample space, sample points, events 2. Three types of probability: theoretical, empirical and personal 3. Contingency table: joint probability and marginal probability 4. Notation: A, Ac , A ∩ B, A ∪ B, Venn diagram 5. Complement rule: P (Ac ) = 1 − P (A) 6. Addition rule: P (A ∪ B) = P (A)...

Words: 292 - Pages: 2

#### Cheat Sheet for Statistics 1

...Question 1 # sold | P(x) | 0 | 0.1 | 1 | 0.11 | 2 | 0.2 | 3 | 0.23 | 4 | 0.3 | 5 | 0.06 | Question 2: The average rate of occurrence of accidents was 8.15 per week. A) during that time, what is the probability of getting less than 3 accidents a week? Use poisson for this question. Since it is less that 3 accidents, x = 0,1,2 (Px=o) = [(8.15^0) (e^8.15)] / 0! = 0.00288735. (Px=1) = [(8.15^1) (e^8.15)] / 1! = 0.002353193. (Px=2) = [(8.15^2) (e^8.15)] / 2! = 0.009589362. Add all these results together, and p(x<3) is 0.01223119. B) The probability if an accident on the road is 0.295. Out of 10 reportable accidents, what is the probability that 2 occurred on the road? Use Binomial for this. Since it is exactly 2, we need to find P(x=2). Use the binomial formula, n = 10, x = 2, p = 0.295, and q is 0.705. = 10!/2!(10-2)! * 0.205 ^2 * q^ 10-2. Answer is 0.23898436. Question 3: 15% of drivers have a probationary license and 85% have a regular license. In a given year, 10% of drivers with probationary licenses are involved in an accident. 2.5% of regular drivers are involved in an accident. For this question it is better to draw a tree diagram. Question 4: Mean values for income for men are 100K, for woman, 80K. SD deviations of 20K and 15K respectively. Variables are normally distributed and independent. Calculate the 95th percentile, for men income. 95th p = 0.95. In the zscore table, we find a value of 1.64. Us e the formula for z-score (population) : ......

Words: 9115 - Pages: 37

#### Harvard Management Company

... The calculation we made to compute the debt is as follows: = Total current liabilities + Total long-term debt and other non-current liabilities - Cash and cash equivalents - Trades payables = 2647961 + 6410751 - 1330441 - 1202398 D = 11.5915 billions The proportion of debt is then: D/D+E = 0.3328 B. Estimation of the cost of debt rD According to the financial statement, the average interest rate for Debt and Financing Arrangements, which corresponds actually to long-term notes, was 4.28% for the fiscal year 2012. After that we wanted to estimate the cost of bonds. To do so we found the Heinz’s credit rating, which is BBB . We then looked at the future bonds that Heinz is going to issue, and decided to take the bond with a midterm maturity as representative debt of the company. P=400 C=3.125% Price=101.4 Since the company is rated BBB, the default rate is d=0.4%. We have found a y = 3.38% using the formula seen in the course. To get the final rD we...

Words: 773 - Pages: 4

#### Engineer

...EECE 350/450 Computer Networks Spring 2013 Course information   Catalogue Description   A course that outlines data communications; wide area networks; circuit and packet switching; routing; congestion control; local area networks; communications architecture and protocols; internetworking.   Pre-requisites   By topic: •  Programming language (such as C++) •  Basic understanding of probability theory and data structures •  Basic understanding of the internal operations of computers   By course: •  STAT 230 Probability and Random Variables, and EECE 330 Data Structures and Algorithms. Instructor   Ayman Kayssi                   Office: 404, Raymond Ghosn Building (RGB) Office Hours: Monday 2 – 3:30 pm and by appointment Extension: 3499 Email: ayman@aub.edu.lb Web: aub.edu/~ayman Messaging: +961 3 965 258 LinkedIn: lb.linkedin.com/in/aymankayssi Twitter: akayssi Facebook: akayssi Course objectives   The objectives of this course are to give students:   An understanding of the basic principles of computer networking   An overview of the main technologies used in computer networks   An overview of internetworking principles and how the Internet protocols, routing, and applications operate   The basic background in computer networks that will allow them to practice in this field, and that will form the foundation for more advanced courses in networking   The basic skills needed to write network applications. Topics covered in this......

Words: 1522 - Pages: 7

#### Student

Words: 1327 - Pages: 6

#### Accounting

Words: 4291 - Pages: 18

#### Statistics

...COMM 291 Midterm Review Package Prepared by Angelica Cabrera 1. INTRODUCTION TO DATA AND VARIABLES Categorical vs. Quantitative Data Categorical Limited number – distinct categories No Quantitative Large number Yes Possible values for variable Measurement units? EXAMPLE. Which variables are quantitative and which are categorical? Employee # Age (years) Annual Income (in Performance 1,000s of dollars) Rating (1-5 scale) 5543 48 50 – 100 4.5 2431 34 20 – 49 3.9 7281 31 0 – 19 3.4 Job Type Management Clerical Maintenance 2. SURVEYS AND SAMPLING Population: _______ individuals with a common characteristic that you want to generalize about Parameter: fact or characteristic about _____________ Sample: ________ of population Statistic: fact or characteristic about ______________ EXAMPLE. Mattel claims that less than 5% of all its Hot Wheels toys are defective. When testing 100 Hot Wheels toys from a production run of 7000 toys, 7% were found to be defective. What is the: a) Population? c) Parameter? Poor (Biased) Sampling   Convenience sampling: Choosing respondents that are __________ to obtain Voluntary response: Respondents volunteer, so those with __________ opinions are more likely to respond b) Statistic? d) Sample? Sampling Designs 1. Simple Random Sampling (SRS): Every individual has an equal chance of being selected 2. Stratified Random Sampling: Divide population into ______________ subgroups and randomly select from each stratum 3. Cluster......

Words: 2677 - Pages: 11

Free Essay

#### Statistics

Words: 865 - Pages: 4

#### Management Trainee

...3100 Intermediate Financial Accounting I Course Outline – Fall 2014 Instructor: Carla Carnaghan, Ph.D., CISA, CMA Office & Hours: Calgary Campus office, 5:00 – 6:00pm Thursdays. Please email for appointment, and for alternative times. Office hours may be done in person, with Webex, or by phone, and for smaller issues with email. E-mail: Carla.Carnaghan@uleth.ca (emails received in the evening, particularly after 10 PM, may not be responded to until the next day) Telephone: 1- 403-329-2351 (Lethbridge office – Please use e-mail where possible) Class Times: Y Section: Th 1800-2050 (S6013A) Course Web Page: on Moodle Course Pre-requisites* Prerequisites: MGT 2400; ECON 1010 and 1012 and STAT 1770 or admission to the CA Bridging Program/Professional Diploma in Accounting; a minimum C- grade is required in all prerequisite courses; MGT 3040 (co-requisite) * It is the responsibility of students to check the prerequisites for courses before registration. If students are registered in a course for which they do not have the prerequisites, and they have not received prior permission from the appropriate program chair, they can be de-registered. If this occurs after the drop/add date, tuition will not be refunded. Course Objectives Building from MGT 2100, this course examines the accounting concepts, principles, judgments, and techniques used in recording business events and preparing and presenting......

Words: 6524 - Pages: 27

#### Summer Syllabus

...|Topic |Reading Assignment | |Introduction to OM and SCM |Chapter 1 | |Competitiveness, Strategy, & Productivity |Chapter 2 | |Supply Chain Management |Chapter 15 | |Forecasting |Chapter 3 | |Product and Service Design |Chapter 4 | |Exam 1 – Online 6/11 from noon to midnight |Chapters 1-4, & 15 | |Capacity Planning |Chapter 5 | |Process Selection and Facility Layout |Chapter 6 | |Location Planning and Analysis |Chapter 8 | | Quality Management |Chapter 9 | |Exam 2– Online 6/19 from noon to midnight |Chapters 5, 6, 8, 9 | |Statistical Quality Control |Chapter...

Words: 2512 - Pages: 11