title | subtitle | author | job | framework | highlighter | hitheme | widgets | mode | |||
---|---|---|---|---|---|---|---|---|---|---|---|
Homework 2 for Stat Inference |
Extra problems for Stat Inference |
Brian Caffo |
Johns Hopkins Bloomberg School of Public Health |
io2012 |
highlight.js |
tomorrow |
|
selfcontained |
- These are some practice problems for Statistical Inference Quiz 1
- They were created using slidify interactive which you will learn in Creating Data Products
- Please help improve this with pull requests here (https://github.com/bcaffo/courses) runif(1)
--- &radio The probability that a manuscript gets accepted to a journal is 12% (say). However, given that a revision is asked for, the probability that it gets accepted is 90%. Is it possible that the probability that a manuscript has a revision asked for is 20%?
- Yeah, that's totally possible.
- No, it's not possible.
- It's not possible to answer this question.
*** .hint
*** .explanation
--- &radio Suppose that the number of web hits to a particular site are approximately normally distributed with a mean of 100 hits per day and a standard deviation of 10 hits per day. What's the probability that a given day has fewer than 93 hits per day expressed as a percentage to the nearest percentage point?
- 76%
- 24%
- 47%
- 94%
*** .hint
Let
*** .explanation
round(pnorm(93, mean = 100, sd = 10) * 100)
[1] 24
--- &radio Suppose 5% of housing projects have issues with asbestos. The sensitivity of a test for asbestos is 93% and the specificity is 88%. What is the probability that a housing project has no asbestos given a negative test expressed as a percentage to the nearest percentage point?
- 0%
- 5%
- 10%
- 20%
- 50%
- 100%
*** .hint
*** .explanation We want $$ P(A^c | T_-) = \frac{P(T_- | A^c) P(A^c)}{P(T_- | A^c) P(A^c) + P(T_- | A) P(A)} $$
(.88 * .95) / (.88 * .95 + .07 * .05)
[1] 0.9958
--- &multitext Suppose that the number of web hits to a particular site are approximately normally distributed with a mean of 100 hits per day and a standard deviation of 10 hits per day.
- What number of web hits per day represents the number so that only 5% of days have more hits? Express your answer to 3 decimal places.
*** .hint
Let
*** .explanation 116.449
round(qnorm(.95, mean = 100, sd = 10), 3)
[1] 116.4
round(qnorm(.05, mean = 100, sd = 10, lower.tail = FALSE), 3)
[1] 116.4
--- &multitext Suppose that the number of web hits to a particular site are approximately normally distributed with a mean of 100 hits per day and a standard deviation of 10 hits per day. Imagine taking a random sample of 50 days.
- What number of web hits would be the point so that only 5% of averages of 50 days of web traffic have more hits? Express your answer to 3 decimal places.
*** .hint
Let
*** .explanation 102.326
round(qnorm(.95, mean = 100, sd = 10 / sqrt(50) ), 3)
[1] 102.3
round(qnorm(.05, mean = 100, sd = 10 / sqrt(50), lower.tail = FALSE), 3)
[1] 102.3
--- &multitext
You don't believe that your friend can discern good wine from cheap. Assuming that you're right, in a blind test where you randomize 6 paired varieties (Merlot, Chianti, ...) of cheap and expensive wines
- what is the change that she gets 5 or 6 right expressed as a percentage to one decimal place?
*** .hint
Let
*** .explanation
89.1
round(pbinom(4, prob = .5, size = 6, lower.tail = TRUE) * 100, 1)
[1] 89.1
--- &multitext
Consider a uniform distribution. If we were to sample 100 draws from a
a uniform distribution (which has mean 0.5, and variance 1/12) and take their
mean,
- what is the approximate probability of getting as large as 0.51 or larger expressed to 3 decimal places?
*** .hint
Use the central limit theorem that says
*** .explanation
0.365
round(pnorm(.51, mean = 0.5, sd = sqrt(1 / 12 / 100), lower.tail = FALSE), 3)
[1] 0.365
--- &multitext
If you roll ten standard dice, take their average, then repeat this process over and over and construct a histogram,
- what would it be centered at?
*** .hint
*** .explanation
The answer will be 3.5 since the mean of the sampling distribution of iid draws will be the population mean that the individual draws were taken from.
--- &multitext
If you roll ten standard dice, take their average, then repeat this process over and over and construct a histogram,
- what would be its variance expressed to 3 decimal places?
*** .hint
*** .explanation
The answer will be 0
since the variance of the sampling distribution of the mean is
mean((1 : 6 - 3.5)^2)
[1] 2.917
--- &multitext The number of web hits to a site is Poisson with mean 16.5 per day.
- What is the probability of getting 20 or fewer in 2 days expressed as a percentage to one decimal place?
*** .hint
Let
*** .explanation 1
round(ppois(20, lambda = 16.5 * 2) * 100, 1)
[1] 1