Top




nextupprevious
Next: About this document

  1. In a randomized controlled experiment,
    1. the control group always gets a placebo.
    2. the subjects randomly assign themselves to the treatment and control groups.
    3. the treatment group is similar to the control group, apart from the treatment.
    4. (a), (b) and (c) are all true.
    5. none of the above are true.
  2. In an observational study,
    1. the control group always gets a placebo.
    2. the experimenter randomly assigns the subjects to the treatment and control groups.
    3. the treatment group is similar to the control group, apart from the treatment.
    4. an association between two things can be established.
    5. none of the above are true.
  3. In a double blind experiment,
    1. a placebo is not necessary.
    2. neither the subjects nor the people who evaluate the responses actually know who got the treatment.
    3. only the people who evaluate the responses know who got the treatment.
    4. both (a) and (b) are true.
    5. both (a) and (c) are true.
  4. Confounding with unrelated factors can be a serious problem in
    1. controlled experiments.
    2. randomized controlled experiments.
    3. observational studies.
    4. both (a) and (c).
    5. both (b) and (c).
  5. Which of the following is a qualitative measurement?
    1. GPA (grade point average).
    2. Hair color.
    3. Weekly income.
    4. Both (a) and (c) are qualitative measurements.
    5. None of the above are qualitative measurements.
  6. In a series of repeated measurements, bias
    1. changes from measurement to measurement, sometimes up and sometimes down.
    2. does not affect the long run average of repeated measurements.
    3. can easily be detected just by looking at the measurements.
    4. satisfies both (a) and (b).
    5. satisfies none of the above statements.
  7. When data is collected from a series of repeated measurements,
    1. both the average and the standard deviation of the data are strongly influenced by outliers in the data.
    2. chance error changes from measurement to measurement; bias does not.
    3. the likely size of the chance error can be estimated by the SD of the sequence of measurements.
    4. both (b) and (c) are true.
    5. all of the above statements are true.
  8. A histogram
    1. represents percents by area.
    2. consists of a set of blocks where the area of each block equals the percentage of cases in the corresponding class interval.
    3. of quantitative data always has about 68% of the observations within 1 SD of the average.
    4. satisfies both (a) and (b).
    5. satisfies all of the above.
  9. For a list of observations following a normal curve,
    1. about 95% of the observations are within 2 SDs of the average.
    2. about 50% of the observations are less than the median.
    3. converting an observation into standard units tells how many SDs it is above (+) or below (-) the average.
    4. both (a) and (c) are true.
    5. all of the above statements are true.
  10. The correlation coefficient between two variables
    1. is not affected by outliers.
    2. is always between 0 and 1.
    3. is 0 when there is tight clustering of the points around the SD line.
    4. is a measure of the linear association between the two variables.
    5. satisfies all of the above.
  11. Which one of the following statements is true?
    1. If 3.5 is the 25th percentile of a set of data then the interquartile range for the data must be 7.
    2. The standard deviation of a set of data can never be zero.
    3. The slope of a straight line is equal to run/rise.
    4. If there is a strong association between two variables then the correlation coefficient must be close to one.
    5. None of the statements (a) - (d) are true.
  12. In a completely randomized design,
    1. the subjects assign themselves to the treatment and control groups.
    2. the investigators use their best judgement to assign the subjects to the treatment and control groups.
    3. historical controls may sometimes be used if the investigators have good reason for doing so.
    4. there is a serious possibility of confounding with unknown factors.
    5. none of the above are true.
  13. In an observational study,
    1. confounding is a potential problem.
    2. a placebo must always be used.
    3. the subjects are assigned at random by the investigators to the treatment and control groups.
    4. the investigators use their best judgement to assign the subjects to the treatment and control groups.
    5. confounding is not a potential problem.
  14. A qualitative variable is one which
    1. generally takes on numerical values.
    2. generally does not take on numerical values.
    3. generally takes on discrete numerical values.
    4. is never used in statistical studies.
    5. satisfies both (b) and (d) above.
  15. An outlier
    1. has no effect on the standard deviation of a variable.
    2. is an observation with an extreme value
    3. is caused by confounding of variables in the experiment.
    4. has no effect on the average of a variable.
    5. always means that there is an error in recording experimental data.
  16. Bias in statistical measurements
    1. generally varies from measurement to measurement.
    2. is caused by random measurement errors.
    3. is a systematic error in all measurements and effects all measurements the same way.
    4. can generally be ignored because of its random nature.
    5. can be readily detected by looking at the measurements themselves.
  17. In a double blind experiment,
    1. a control group is not necessary.
    2. neither the subjects nor the people who evaluate the responses actually know who got the treatment.
    3. only the subjects know who got the treatment.
    4. the possibility of bias is increased.
    5. both (a) and (b) are true.
  18. In a completely randomized experiment,
    1. the treatment group is similar to the control group, apart from the treatment.
    2. the treatment group always gets a placebo.
    3. the subjects always know which group they belong to.
    4. there is a serious possibility of confounding with unknown factors.
    5. none of the above are true.
  19. In an observational study,
    1. the control group always gets a placebo.
    2. the subjects choose which group they belong to.
    3. the treatment group is similar to the control group, apart from the treatment.
    4. confounding is not a potential problem.
    5. none of the above are true.
  20. A confounding factor
    1. can never be controlled for in an observational experiment.
    2. always increases the standard deviation of the data.
    3. may cause the results of an experiment to be misinterpreted.
    4. satisfies both (a) and (b).
    5. satisfies both (b) and (c).
  21. Confounding with unrelated factors can be a serious problem in
    1. randomized controlled experiments.
    2. controlled experiments.
    3. observational studies.
    4. both (a) and (c).
    5. both (b) and (c).
  22. In a series of repeated measurements, bias
    1. generally varies from measurement to measurement, sometimes up and sometimes down.
    2. is a systematic error in all measurements and affects all measurements the same way.
    3. can generally be ignored because of its random nature.
    4. can be readily detected by looking at the measurements themselves.
    5. satisfies both (a) and (c).
  23. When data is collected from a series of repeated measurements,
    1. both the median and the standard deviation of the data are strongly affected by outliers in the data.
    2. the median is used as a numeric measure of the spread of the data.
    3. the standard deviation of the data can be used to estimate the bias in the measurements.
    4. chance error in the measurements affects all measurements the same way.
    5. none of the above statements are true.
  24. A histogram
    1. is a graph which is used to summarize data.
    2. represents percents by area.
    3. consists of a set of blocks where the height of each block equals the percentage of cases in the corresponding class interval.
    4. satisfies both (a) and (b).
    5. satisfies all of the above.
  25. When the histogram of a set of data follows the normal curve,
    1. about 68% of the data is within one standard deviation (SD) of the average.
    2. about 90% of the data is within two SDs of the average.
    3. about 95% of the data is within two SDs of the average.
    4. both (a) and (b) are true.
    5. both (a) and (c) are true.
  26. Which one of the following statements (a) - (d) is not true?
    1. If 3.5 is the 25th percentile of a set of data then the interquartile range for the data must be 7.
    2. The standard deviation of a set of data can never be negative.
    3. The normal curve is symmetric about zero.
    4. The 50th percentile of a set of data is the same as the median of the data.
    5. All of the above statements are true.
  27. For a straight line plotted on an x-y graph,
    1. the slope equals rise/run.
    2. the slope equals run/rise.
    3. the intercept is where the line crosses the x- axis.
    4. both (a) and (c) are true.
    5. both (b) and (c) are true.
  28. The correlation coefficient
    1. measures association in a scatter plot.
    2. always lies between 0 and 1.
    3. implies causation if its value is close to 1.
    4. is used to determine the slope of the SD line.
    5. satisfies none of the above.
  29. The correlation coefficient
    1. is not affected by outliers.
    2. has the property that a value of -1 indicates the weakest possible association and a value of +1 indicates the strongest possible association.
    3. is reliable if the scatter plot consists of two completely separate clusters.
    4. is a good measure of nonlinear association.
    5. satisfies none of the above.
  30. In a regression setting for predicting y from x,
    1. the R.M.S. error is generally smaller than the SD of y.
    2. the R.M.S. error is generally smaller than the SD of x.
    3. the R.M.S. error is generally larger than the SD of y.
    4. the R.M.S. error is generally larger than the SD of x.
    5. both (a) and (b) are true.
  31. In a regression setting for predicting y from x,
    1. the regression line for the residual plot is vertical.
    2. the average of the residual values is 1.
    3. the residuals should show a strong pattern if the analysis is to be reliable.
    4. the average of the residual values is 0.
    5. both (c) and (d) are true.
  32. Heteroscedasticity in a regression setup
    1. makes the R.M.S. error more reliable.
    2. makes the prediction error more uniform.
    3. means that the scatter diagram has a generally elliptical (football) shape.
    4. means that the scatter diagram does not have an elliptical shape.
    5. both (a) and (c) are true.
  33. When summarizing the relationship between two variables x and y
    1. the average of the x values is needed.
    2. the average of the y values is needed.
    3. the SD of the x values is needed.
    4. the SD of the y values is needed.
    5. the correlation coefficient is needed.
    6. all of the above are true.
  34. The following is not a property of Chance (or probability),
    1. Chance lies between 0% and 100%.
    2. The chance of an event is 100% minus the chance of the opposite event.
    3. The chance of at least one of two events happening is always the sum of the chances of the separate events.
    4. If two events are independent, the chance that both will happen equals the product of chances of the separate events.
    5. Chance can be thought of as the percent of the time that an event occurs when the basic process is repeated over and over.
  35. For two mutually exclusive events
    1. the probability of both events occurring is the product of the individual probabilities.
    2. the probability of both events occurring is the sum of the individual probabilities.
    3. the probability of both events occurring is 1.
    4. the probability of both events occurring is 0.
    5. none of the above are true.
  36. The event A has a 40% chance of happening. The event B has a 60% chance of happening. If the chance that both events happen together is 24% then
    1. the two events are mutually exclusive.
    2. the two events are independent.
    3. the two events are dependent.
    4. both (a) and (b) are true.
    5. no conclusion can be drawn because there isn't enough information.
  37. The event A has a 30% chance of happening. The event B has a 60% chance of happening. If the chance that at least one of the events happens is 80% then
    1. the two events are not mutually exclusive.
    2. the two events are mutually exclusive.
    3. the two events are independent.
    4. the regression line between the events is not linear.
    5. none of the above is true.
  38. If there is a strong association between two variables, then
    1. there is a cause and effect relationship between the variables.
    2. the correlation coefficient must be close to one.
    3. knowing one variable helps a lot in predicting the other.
    4. both (b) and (c) are true.
    5. all of the above statements are true.
  39. In a regression setting for predicting y from x,
    1. the r.m.s. error can be calculated as tex2html_wrap_inline338.
    2. the units for the r.m.s. error are the same as the units for the variable being predicted.
    3. if a regression is appropriate for the data, then the residual plot should show a heteroscedastic pattern.
    4. both (b) and (c) are true.
    5. all of the above statements are true.
  40. In virtually all test-retest situations,
    1. the bottom group on the first test will on average show some improvement on the second test.
    2. the top group on the first test will on average show some improvement on the second test.
    3. the regression fallacy consists in thinking that the regression effect shown in the exam scores on the two tests must be due to the spread of the data around the SD line.
    4. both (a) and (b) are true.
    5. all of the above statements are true.
  41. In a regression setting for predicting y from x, the regression line and the SD line are the same
    1. if the correlation coefficient is 0.
    2. if the correlation coefficient is -1.
    3. if the SD of y-values equals the SD of the x-values.
    4. if the average of the y-values equals the average of the x-values.
    5. if both (c) and (d) hold true for the data.
  42. If the correlation coefficient between two variables is zero (r = 0), then
    1. the scatter plot of the two variables shows no pattern whatsoever.
    2. none of the points on the scatter plot lie within 1 SD of the regression line.
    3. there is a tight clustering about the SD line.
    4. there is no linear association between the variables.
    5. all of the above statements are true.
  43. The correlation coefficient between two variables
    1. is affected by interchanging the variables.
    2. is affected by multiplying all of the values of one variable by the same positive number.
    3. is affected by adding the same number to all of the values of one variable.
    4. is affected by multiplying all of the values of one variable by the same negative number.
    5. always lies between 0 and 1.
  44. In a regression setting for predicting y from x,
    1. the regression line is just a smoothed version of the graph of averages.
    2. the regression line passes through the point of averages.
    3. the r.m.s. error measures the deviation of the scatter diagram's points from the SD line.
    4. both (a) and (b) are true.
    5. all of the above statements are true.
  45. If A and B are independent events,
    1. the chance that A happens given that B happened is the same as the chance that B happens.
    2. the two events A and B must be mutually exclusive.
    3. the chance of A and B both happening is equal to the chance of A plus the chance of B.
    4. both (b) and (c) are true.
    5. none of the above statements are true.
  46. For two mutually exclusive events,
    1. the probability that at least one of the events occurs is the sum of the individual probabilities.
    2. the probability that at least one of the events occurs is the product of the individual probabilities.
    3. the probability that both of the events occur is the sum of the individual probabilities.
    4. the probability that both of the events occur is the product of the individual probabilities.
    5. none of the above statements are true.
  47. The event A has a 30% chance of happening. The event B has a 60% chance of happening. If the chance that both events happen together is 18% then
    1. the two events are mutually exclusive.
    2. the two events are dependent.
    3. the chance that A happens given that B happens is 30%.
    4. the chance that A happens given that B happens is 18%.
    5. both (a) and (b) are true.
  48. When drawing tickets at random with replacement from a box,
    1. the draws are independent.
    2. the draws are mutually exclusive.
    3. the draws are dependent.
    4. both (a) and (b) are true.
    5. both (b) and (c) are true.
  49. Under certain requirements the binomial formula, tex2html_wrap_inline386, can be used to calculate the chance that an event occurs exactly k times in n trials. Which of the following is not one of the requirements for using the binomial formula?
    1. The trials must be independent.
    2. The value of n must be fixed in advance.
    3. The value of p must be the probability that the event will occur on any particular trial.
    4. The value of p cannot change from trial to trial.
    5. All of the above requirements are needed to use the binomial formula.
  50. The binomial formula can be used to calculate
    1. the probability of getting exactly 3 heads in 10 tosses of a coin.
    2. the probability of getting exactly 3 aces before getting a six when rolling a fair die.
    3. the probability of having exactly 3 red cards among the top 10 cards in a well shuffled standard deck of cards.
    4. both (a) and (b).
    5. both (a) and (c).
  51. When tossing a fair coin and counting the number of heads obtained, the number of heads equals half the number of tosses plus a chance error. The law of averages says that
    1. the chance error is small if the number of tosses is large.
    2. the chance of getting 50 heads out of 100 tosses is the same as getting 500 heads out of 1000 tosses.
    3. the number of heads equals half the number of tosses if the number of tosses is large.
    4. the percentage of heads is likely to be close to 50% if the number of tosses is large.
    5. both (a) and (d) are true.
  52. Which of the following is  not a property of probability histograms?
    1. A probability histogram represents chance by area.
    2. The total area under a probability histogram is 100%.
    3. A probability histogram represents data obtained from a sample.
    4. A probability histogram always follows the normal curve.
    5. Both (c) and (d) are not properties of probability histograms.
  53. Draws are made at random with replacement from a box, and the number of draws is getting larger and larger. Which of the following statements is not true?
    1. The probability histogram for the sum (when put in standard units) follows the normal curve more and more closely.
    2. The histogram for the numbers drawn follows the probability histogram for the numbers in the box more and more closely.
    3. The histogram for the contents of the box (when put in standard units) follows the normal curve more and more closely.
    4. The probability histogram for the numbers in the box remains the same.
    5. The expected value of the sum locates the center of the probability histogram for the sum.
  54. A fair coin is tossed 100 times landing heads 53 times. The last 9 tosses of the coin were all heads. Which of the following best characterizes what will happen if the coin is tossed one more time?
    1. The chance of getting a head is greater than 50%.
    2. The chance of getting a head is less than 50%.
    3. The chance of getting a head is exactly 50%.
    4. The chance of getting a head is exactly 53%.
    5. Both (a) and (d) are true.
  55. When sampling from a population, which of the following is generally not true?
    1. The method of choosing a sample matters a lot regardless of the sample size.
    2. The best methods of choosing a sample are probability methods which involve an objective chance process to pick the sample.
    3. The parameters are estimated by statistics which are numbers computed from the sample.
    4. The larger the population is the larger the sample size should be.
    5. Sampling is used because it is impractical to study the entire population.
  56. The Literary Digest poll of the 1936 presidential election
    1. was an example of an excellent sampling procedure.
    2. was the first poll to accurately predict the next president of the United States.
    3. had a very low non-response rate.
    4. was an example of a biased sampling procedure.
    5. satisfied items (a), (b) and (c).
  57. When sampling from a population, which of the following is true?
    1. Non-response bias is not important.
    2. Non-response bias is important but can be overcome by increasing the sample size.
    3. Selection bias is not important.
    4. Selection bias is important and cannot be overcome by increasing the sample size.
    5. Selection bias is important but can be overcome by increasing the sample size.
  58. Simple random sampling
    1. means drawing at random with replacement.
    2. is very similar to quota sampling.
    3. means drawing at random without replacement.
    4. introduces selection bias.
    5. both (c) and (d) are true.
  59. In a simple random sample the difference between a sample percentage and a population percentage is
    1. due to sample selection bias, the size of which cannot be estimated.
    2. due to sample selection bias, the size of which can be estimated by the SE.
    3. due to chance error, the size of which cannot be estimated.
    4. due to chance error, the size of which can be estimated by the SE.
    5. explained by (a) and (d) combined.
  60. When estimating a population percentage from a sample,
    1. the absolute size of the sample determines the accuracy of the estimate.
    2. a correction factor is never needed if the sample is a simple random sample.
    3. doubling the sample size makes the SE increase by a factor of 2.
    4. doubling the sample size makes the SE decrease by a factor of 2.
    5. the size of the sample relative to the population determines the accuracy of the estimate.
    6. Both (a) and (e) are true.
  61. Under certain requirements the binomial formula, tex2html_wrap_inline386, can be used to calculate the chance that an event occurs exactly k times in n trials. Which of the statements (a) - (d) is not one of the requirements for using the binomial formula?
    1. The trials must be dependent.
    2. The value of n must be fixed in advance.
    3. The value of p must be the probability that the event will occur on any particular trial.
    4. The value of p cannot change from trial to trial.
    5. All of the above requirements are needed to use the binomial formula.
  62. A single die is rolled five times. We wish to use the binomial formula to calculate the probability that exactly two of the rolls had a three showing. In this case
    1. the correct value for n is 2.
    2. the correct value for p is 1/2.
    3. the correct value for k is 2.
    4. both (b) and (c) are true.
    5. it is improper to use the binomial formula.
  63. When tossing a fair coin and counting the number of heads obtained, the law of averages states that
    1. the number of heads equals the number of tosses plus a chance error.
    2. the number of heads equals half the number of tosses if the number of tosses is large.
    3. the percentage of heads equals 50% if the number of tosses is large.
    4. both (a) and (c) are true.
    5. none of the above are true.
  64. When drawing one hundred numbered tickets at random with replacement from a box,
    1. the sum of the tickets is equal to the expected value for the sum of the draws plus a chance error, where the size of the chance error is given by the SD of the box.
    2. the expected value for the sum of the draws is given by tex2html_wrap_inline416.
    3. the SE of the box is given by tex2html_wrap_inline418.
    4. all of the above are true.
    5. none of the above are true.
  65. A probability histogram
    1. represents chance by area.
    2. is the same thing as an empirical histogram.
    3. must be based on a large number of observations.
    4. always follows the normal curve.
    5. satisfies both (a) and (d).
  66. Draws are made at random with replacement from a box. Which of the following statements is true?
    1. The histogram for the contents of the box (when put in standard units) follows the normal curve if the number of draws is large.
    2. The probability histogram for the sum (when put in standard units) follows the normal curve if the number of draws is large.
    3. The probability histogram for the sum (when put in standard units) always follows the normal curve regardless of the number of draws.
    4. The probability histogram for the sum (when put in standard units) follows the normal curve when the number of draws is large only if the contents of the box also follow a normal curve.
    5. Statements (a) and (b) are both true.
  67. Here are two situations: (1) A fair coin is tossed 100 times and the percentage of heads is recorded. (2) A fair coin is tossed 1000 times and the percentage of heads is recorded. Which of the following statements is true?
    1. The chance of getting between 45% and 55% heads is greater in situation (1) than it is in situation (2).
    2. The chance of getting between 45% and 55% heads is greater in situation (2) than it is in situation (1).
    3. The chance of getting less than 45% heads is greater in situation (1) than it is in situation (2).
    4. Both (a) and (c) are true.
    5. Both (b) and (c) are true.
  68. When sampling from a population, if a biased selection procedure is used
    1. the bias can be reduced by increasing the sample size.
    2. the bias is the chance error which varies from individual to individual.
    3. the bias is unimportant if the SE of the sample is small.
    4. both (a) and (c) are true.
    5. none of the above are true.
  69. The Literary Digest poll of the 1936 presidential election
    1. was the first poll to accurately predict the next president of the United States.
    2. suffered from non-response bias.
    3. suffered from selection bias.
    4. suffered from both selection bias and non-response bias.
    5. satisfies none of the above.
  70. In a simple random sample, increasing the sample size by a factor of four
    1. decreases the SE for a percentage by a factor of four.
    2. decreases the SE for the sum by a factor of four.
    3. decreases the SE for a percentage by a factor of two.
    4. decreases the SE for the sum by a factor of two.
    5. both (c) and (d) are true.
  71. When estimating a population percentage from a sample,
    1. a large population needs a larger sample size than a small population to obtain the same level of accuracy of the estimate.
    2. a correction factor is never needed if the sample is a simple random sample.
    3. the accuracy of the estimate does not depend on the sample size.
    4. if the sample is a large part of the population then the estimate is likely to be unreliable.
    5. none of the above are true.
  72. Among all applicants to a certain university one year, the math SAT scores averaged 535, the SD was 100, and the scores followed the normal curve. One applicant scored 710 on the math SAT. What was her percentile rank (among all the applicants)?
    1. Her percentile rank was 80%.
    2. Her percentile rank was 84%.
    3. Her percentile rank was 92%.
    4. Her percentile rank was 96%.
    5. Her percentile rank was 98%.
  73. In an observational study,
    1. the investigators always assign the subjects to the treatment and control groups.
    2. the subjects are randomly assigned to the treatment and control groups.
    3. a placebo must always be used for the control group.
    4. confounding is a potential problem.
    5. both (a) and (c) are true.
  74. When performing repeated measurements using the same process, the bias
    1. generally changes from measurement to measurement.
    2. is a systematic error in all measurements and effects all measurements the same way.
    3. is caused by random measurement errors.
    4. can be readily detected by looking at the measurements themselves.
    5. satisfies both (a) and (b) above.
  75. When studying the relationship between two variables, the correlation coefficient
    1. has the property that a value of -1 indicates the weakest possible association and a value of +1 indicates the strongest possible association.
    2. measures linear association in a scatter plot of the two variables.
    3. always lies between 0 and 1.
    4. is used to determine the slope of the SD line for the variables.
    5. satisfies none of the above.
  76. Homoscedasticity in a regression setup
    1. makes the R.M.S. error very unreliable.
    2. means that the regression effect is due to something other than the spread around the SD line.
    3. means that the scatter diagram has a generally elliptical (football) shape.
    4. means that the scatter diagram does not have an elliptical shape.
    5. both (a) and (d) are true.
  77. A random sample of 10,000 voters is taken and classified according to income status and political party affiliation. The following table summarizes the observations:

  78. tabular226

    To test the null hypothesis that the percentage of voters who are Republicans and who have annual incomes exceeding $20,000 is 10%, against the alternative hypothesis that the percentage is greater than 10% the following test of significance should be used.

    1. A z-test using the normal curve.
    2. A two sample z-test for the difference in percentages.
    3. A t-test using Student's curve with 1 degree of freedom.
    4. tex2html_wrap_inline428-test (chi-squared test) with one degree of freedom.
    5. None of the above tests should be used.
  79. When drawing at random with replacement from a box of tickets, increasing the sample size (number of draws) by a factor of 4
    1. multiplies the SE for the average by a factor of 4.
    2. divides the SE for the average by a factor of 4.
    3. multiplies the SE for the average by a factor of tex2html_wrap_inline430.
    4. divides the SE for the average by a factor of tex2html_wrap_inline430.
    5. has no effect at all on the SE for the average.
  80. Which one of the following statements about the Gauss model for measurement error is not true?
    1. The average of the error box is zero.
    2. The actual measurement is the exact value plus a chance error.
    3. When estimating the exact value, repeated measurements will not improve the overall accuracy of the estimate.
    4. The SD of many repeated measurements can be used to estimate the SD of the error box.
    5. The chance variability is in the measuring process, not the thing being measured.
  81. Which of the following is not needed to compute a confidence interval for the population average?
    1. The sample size.
    2. The confidence level.
    3. The sample average.
    4. The sample SD.
    5. All of the above are needed to compute a confidence interval for the population average.
  82. When performing a test of significance a box model is set up, the null and alternative hypotheses are formulated and
    1. the Student's curve is used when the SD of the box is known.
    2. the Student's curve is used when the histogram of the box is quite different from the normal curve.
    3. the Student's curve is used when the number of measurements (draws from the box) is less than 25 and the histogram of the box is not too different from a normal curve.
    4. the Student's curve is used when the number of measurements (draws from the box) is very large but the histogram is quite different from the normal curve.
    5. both (a) and (d) are true.
  83. When calculating the standard error from a box model which one of the following statements is not true?
    1. SE for sum = tex2html_wrap_inline434x SD of box.
    2. SE for average = tex2html_wrap_inline436.
    3. SE for average = tex2html_wrap_inline438.
    4. SE for percent = tex2html_wrap_inline440x 100%, where SE for count = SE for sum from a 0 - 1 box.
    5. All of the above statements are true.
  84. When performing a test of significance,
    1. the P-value gives the chance that the null hypothesis is true.
    2. the larger the P-value the more evidence there is that the alternative hypothesis is true.
    3. if the P-value is less than 1% then the result is not significant.
    4. the P-value is computed assuming the null hypothesis is true.
    5. both (a) and (b) are true.
  85. When drawing tickets at random with replacement from a box, the SD of the box can be estimated by
    1. using the SE of the sample average to estimate the SD of the box.
    2. using the sample average to estimate the SD of the box.
    3. using the SD of the sample to estimate the SD of the box.
    4. using the SE for the sum of the draws to estimate the SD of the box.
    5. none of the above methods.
  86. Which one of the following statements about histograms is not true?
    1. If a histogram for data has a long right tail then the median for the data is less than the average of the data.
    2. If two lists of data have exactly the same average and the same SD then their histograms must also be the same.
    3. A histogram represents percents by area.
    4. Half of the area of a histogram is to the left of the median.
    5. All of the above statements are true.
  87. A simple random sample of size 25 is drawn (without replacement) from a population of size 100. Which one of the following statements describes the correction factor needed for the SE of the average?
    1. The correction factor is about .25.
    2. The correction factor is about .76.
    3. The correction factor is about .87.
    4. The correction factor is about .99.
    5. No correction factor is needed for the SE of the average.
  88. To test a genetic model, an experimenter sets up a null hypothesis and a one sided alternative hypothesis, collects data and computes the z-statistic. A z-statistic of zero would be expected according to the null hypothesis. The experimenter obtains a z-statistic of 1.65. Which of the following statements is true?
    1. The observed level of significance (P-value) is about 5%.
    2. The chance the model is correct is about 95%.
    3. The chance the model is correct is about 5%.
    4. The chance of getting results this extreme is about 95%.
    5. The chance of getting results this extreme is about 5%.
  89. Which of the following statements is true?
    1. If a histogram for data has a long right tail then the median for the data is greater than the average of the data.
    2. In an observational study confounding is a potential problem.
    3. An outlier has no effect on the average of a variable.
    4. Both (a) and (b) are true.
    5. None of the above are true.
  90. Heteroscedasticity in a regression setup
    1. means that the regression effect is due to something other than the spread around the SD line.
    2. means that the scatter diagram has a generally elliptical (football) shape.
    3. means that the scatter diagram does not have an elliptical shape.
    4. always makes the R.M.S. error very small.
    5. both (b) and (d) are true.
  91. When studying the relationship between two variables, the correlation coefficient
    1. has the property that a value of -1 indicates the weakest possible association and a value of +1 indicates the strongest possible association.
    2. measures the linear association in a scatter plot of the two variables.
    3. is used to determine the slope of the SD line for the variables.
    4. satisfies both (a) and (b) above.
    5. satisfies none of the above.
  92. The binomial formula can be used to calculate
    1. the probability of getting exactly 4 heads in 10 tosses of a coin.
    2. the probability of getting exactly 4 heads before getting a tail when tossing a fair coin.
    3. the probability of having exactly 4 black cards among the top 10 cards in a well shuffled standard deck of cards.
    4. both (a) and (b).
    5. both (a) and (c).
  93. A percentage obtained from a simple random sample
    1. has an expected value equal to the population percentage.
    2. has an SE equal to the population SD.
    3. has an SE equal to the population SD divided by the population size.
    4. satisfies both (a) and (c) above.
    5. satisfies none of the above.
  94. When drawing a sample from a box for which the SD is unknown, a reasonable estimate for the SE of the average of the sample is
    1. the SD of the sample divided by the sample size.
    2. the SD of the sample divided by the square root of the sample size.
    3. the SD of the sample multiplied by the square root of the sample size.
    4. the SD of the sample multiplied by the sample size.
    5. not possible to estimate since there is no reasonable way to estimate the SE of the average without knowing the SD of the box.
  95. In the model for measurement error presented in the text,
    1. when the Gauss model applies the SD of a series of repeated measurements can be used to estimate the SD of the error box.
    2. when the Gauss model applies, the SD of the error box is always known.
    3. when the Gauss model applies, any bias in the measurements can be easily detected.
    4. the Gauss model can be written as: exact value = exact value + bias + chance error.
    5. both (b) and (c) are true.
  96. In Mendel's study of genetics,
    1. his data did not fit his theory very well, just barely supporting his claims.
    2. his data did fit his theory very well, almost too good to be true.
    3. his data fit his theory in a way which is to be expected, neither too well nor too poorly.
    4. he didn't even use data to support his theory.
    5. none of the above statements is true.
  97. When calculating the SE for the sum of draws from a box, the correction factor
    1. is used only when the draws are made with replacement.
    2. is always a number that is less than one.
    3. is used only when the number of draws from the box is very small compared to the number of tickets in the box and the draws are made without replacement.
    4. satisfies all of the above statements.
    5. satisfies none of the above statements.
  98. Los Angeles has about four times as many registered voters as San Diego. A simple random sample of registered voters is taken by each city to estimate the percentage who will vote for school bonds. Assume the percentage who will vote for the school bonds is the same for each city. If a sample of 4,000 voters is taken in Los Angeles and a sample of 1,000 voters is taken in San Diego then the tex2html_wrap_inline452(tex2html_wrap_inline452 means SE for percent) for the Los Angeles estimate
    1. will be about the same as the tex2html_wrap_inline452for the San Diego estimate.
    2. will be about twice as big as the tex2html_wrap_inline452for the San Diego estimate.
    3. will be about four times bigger than the tex2html_wrap_inline452for the San Diego estimate.
    4. will be about half as big as the tex2html_wrap_inline452for the San Diego estimate
    5. cannot be determined unless the percentage of the sample is given.
  99. When performing a test of significance,
    1. rejection of the null hypothesis means that the alternative hypothesis is accepted.
    2. the alternative hypothesis says that an observed difference is real, not just due to chance error.
    3. the null hypothesis says that an observed difference is due just to chance error.
    4. both (b) and (c) are true.
    5. all of the above statements are true.
  100. When performing a test of significance, a test statistic
    1. plays no role in testing hypotheses.
    2. always follows a normal distribution.
    3. is used to measure the difference between the data and what is expected under the null hypothesis.
    4. satisfies both (b) and (c) above.
    5. satisfies none of the above statements.
  101. When performing a test of significance,
    1. the P-value of a test gives the chance that the null hypothesis is true.
    2. the P-value of a test gives the chance, under the null hypothesis, of getting a test statistic as extreme or more extreme as the observed one.
    3. the P-value is always computed assuming the null hypothesis is true.
    4. both (b) and (c) are true.
    5. all of the above statements are true.
  102. A newspaper editor of a large town claims that 85% of the people age 18 or over in the town read newspapers and 15% do not. To test his claim a simple random sample of 20 people (aged 18 or over) is taken. Which of the following tests of significance should be used?
    1. A two sample z-test for the difference in percentages.
    2. tex2html_wrap_inline428-test (chi-squared test) with 19 degrees of freedom.
    3. A t-test using Student's curve with one degree of freedom.
    4. A z-test using the normal curve.
    5. None of the above tests should be used.
  103. A laser altimeter can measure elevation to within a few inches. As part of an experiment, 25 measurements were made on a mountain peak. These averaged out to be 81,411 inches, and their SD was 30 inches. Which of the following statements is true?
    1. The elevation of the mountain peak is estimated as 81,411 inches and is likely to be off by 30 inches or so.
    2. 81,411 tex2html_wrap_inline47812 inches is a 95% confidence interval for the average of the 25 readings.
    3. 81,411 tex2html_wrap_inline47812 inches is a 95% confidence interval for the elevation of the mountain peak.
    4. About 95% of the readings were in the range 81,411 tex2html_wrap_inline47812 inches.
    5. There is a 95% chance that the next reading will be in the range 81,411 tex2html_wrap_inline47812 inches.
  104. The two sample z-statistic is calculated using
    1. the size of the two samples.
    2. the average of the two samples.
    3. the SDs of the two samples.
    4. all of the above.
    5. none of the above.
  105. In a certain town there are one million eligible voters. To study the relationship between sex and participation in the last election a simple random sample of size 10,000 was chosen. The null hypothesis is that sex and voting are independent. A test statistic of 3.90 was obtained. Which of the following statements is true?
    1. The correct P-value is between 10% and 30% based on the tex2html_wrap_inline428table.
    2. The correct P-value is between 1% and 5% based on the tex2html_wrap_inline428table.
    3. The correct P-value is between 5% and 10% based on the Student's t table.
    4. The null hypothesis should be rejected.
    5. Both (b) and (d) are true.


nextupprevious



 

Eugene Klimko

Thur Nov 18:19:05 EDT 1999


 

 
 



| Top | Home | AnswerKey |