Monday, March 12, 2007

Chapter 12 Inference for proportions

Statistics in action. . .

Here's the basketball video.
http://viscog.beckman.uiuc.edu/grafs/demos/15.html

NCAA Brian's out in front with no way for anyone to catch up (I think). Pretty amazing. http://linnerstats.mayhem.sportsline.com/e
You'll need the password, which tells you who I think will win: gogators


Please try out this quiz and let me know how it works for you.
http://www.proprofs.com/quiz-school/quizview.php?id=567 :Basic stuff quiz

http://www.proprofs.com/quiz-school/quizview.php?id=585 :Which test do we do?

Cool sites for playing with proportions:
http://http://www.ltcconline.net/greenl/java/Statistics/HypTestProp/HypTestProp.htm

http://www.math.csusb.edu/faculty/stanton/m262/proportions/proportions.html
List of top engineering schools for recruiting as discussed in class (not in any particular order):
Cal Poly, Penn State, Penn, MIT, Florida A&M, Florida, RPI, Morgan State, Maryland, UCLA, Virginia, VA Tech, Iowa State, GA Tech, Howard, Colorado, Arizona, Cal – Berkley, North Carolina A&T, Puerto Rico, Michigan, Carnegie Mellon, Ohio State, Purdue, Illinois, Cornell, Texas, Texas A&M, Stanford, USC

This chapter is more of the same methods we saw in the last two chapters. You perform hypothesis tests and confidence intervals for proportions and for differences between proportions.

The tricky bits: (1) you have to keep track of which version of the proportion you will use for testing assumptions and for calculating standard deviations/std errors. Simply use the "best" information available. (2) Recognize when the inference is about proportions and when it is about measurements (chapter 11 methods). If you use X when you should have used p you let the reader know that you are confused.

When you have a 1 proportion hypothesis test, you have a hypothesized value for p that you use for both checking assumptions (conditions) and calculating the std dev.

When you are constructing a 1 proportion confidence interval, use the best info you have--the sample proportion. This is the lucky case where you just record the number of successes and the number of failures when you are checking the conditions. Because the estimator is used, we call the sqrt(p-hat(1- p-hat)/n) the standard error. Estimate------>>>>std error.

When you have a 2 proportion hypothesis test and you are testing to see if the two proportions are the same, well, doesn't that mean that the two proportions that you use in the std error calculation should be the same? In this case you generate a "pooled" estimator (Pooled sample proportion = sum of x / sum of n)to use for condition checking and for std error calculations. When checking conditions, use the pooled proportion * each value of n and (1 - the pooled proportion) * each value of n and make sure that each product is 5 or more.

On the other hand, when you are creating a 2 proportion confidence interval for the difference, you are not assuming that the proportions are the same, so the proportions must be checked separately and the formula for the std error resembles the formulas for two-sample conf interval std errors from Ch 11 a little bit. Checking conditions: for each sample check p-hat for that sample * sample size and (1-p-hat for that sample) * sample size.

7 comments:

Mrs.L said...

In answer to the question (that mysteriously disappeared!!!), the multiple choice answers that show up in the answer section are different ways that the answer could have been typed in and recognized as correct. It's the next best thing to artificial intelligence, don't ya think?

ihavenoideawhatsgoingon said...

i got a 60 and a 40


guess i need to study

ConfusedInISS said...

45 and a 72... odd distribution. Pun intended

ok so im totally lost.... is there a possiblility of you being available tomarrow morning?????

ps. i have the money for the ap exam sry i forgot 2 bring it today

Mrs.L said...

I will be available at about 8:00AM.

Ms. Gasaway says she needs all checks to be delivered by 8:00 Wednesday morning. Let's get them in, stat people.

Mrs.L said...

Clarification of what we discussed in class today:

The formula for the SE in a confidence interval and in a HT are actually very similar. You recognize that the SE in the CI is the SQRT of the sum of two variances, but did you realize that the variance you use for HT is ALSO the SQRT of the sum of two vcariances??? That's why it is so important that the two samples be independent.

Consider SQRT(p-hat (1 - phat)(1/n1 + 1/n2)).

If you distribute the p-hat (1-phat) (let's call them ph and qh for the sake of simplifying this stuff)

you get

SQRT(ph*qh/n1 + ph*qh/n2)which is the same as the SE for the CI EXCEPT for the values you use in the numerators. For HTs you use the pooled sample proportion: for CIs the individual sample p-hats.

Mrs.L said...

How about this for Confidence Intervals for the difference between two proportions. . .?

p hat1(n1)>5, (1-p hat1)(n1)>5

p hat2(n2)>5, (1-p hat2)(n2)>5

Independent

SRS

Notice that The second set of inequalities involves phat2 and n2.

For a hypothesis test use the pooled sample proportion instead of both phat1 and phat2. Terefore, the conditions are
p hat(n1)>5, (1-p hat)(n1)>5

p hat(n2)>5, (1-p hat)(n2)>5

Independent

SRS

:)

Mrs.L said...

If you only have the sample standard deviation (here, you are testing means), you have not satisfied conditions for using a z-test, so you have to use the t-test.

When testing proportions, you have a good guess at the population std deviation, so you use Z<.