832 Subject Index 5-number summary, 132–133 goodness-of-fit, 585–586 graphing, 69 hypothesis testing, 398–399, 412, 421–422, 448–449 for independent samples, 462–463 inferences, 448–449 Kruskal-Wallis test, 674–675 for matched pairs, 474–475 for means, 339–340, 412 measures of center, 97–98 measures of variation, 115 multiple regression equations and, 559–560 nonlinear regression, 567 normal distribution, 267 normal quantile plots, 299 one-way ANOVA, 622 outliers, 132–133 p chart, 716 Poisson probability distribution, 235 prediction intervals, 549 for proportions, 323–324, 398–399, 448–449 rank correlation, 682–683 regression, 539 runs test, 691 sample size, 323–324 sign test, 654–655 standard deviation, 351–352, 421–422 for standard deviations, 485–486 for two means, 462–463 in two-way ANOVA, 632–633 for variance, 485–486 Wilcoxon rank-sum test, 668 Wilcoxon signed-ranks test, 662 z scores, 257 Sort data, 40 Spearman’s rank correlation coefficient, 677, 743. See also Rank correlation Speed dating, 755 Spurious correlation, 516–517 Standard deviation, 106–110 bootstrap sample and, 359 Chebyshev’s theorem, 113 claims testing about, 416–425, 430 count five for, 484 for critical thinking, 224–226 critical values in, 481 defining, 106, 111 F distribution in, 482 formula for, 107–108 hypothesis test for, 481 for independent samples, 461 Levene-Brown-Forsythe test for, 485 notation, 106, 481 population, 345–352 of population, 110 for probability distribution, 208 properties of, 107 P-values in, 481 range rule of thumb for, 108–110 requirements in, 481 resampling for two, 495–496 of a sample, 106–110 software/calculator results, 351–352, 421–422 technology for, 485–486 test statistic for, 481 for two means, 461 Standard deviations F test for, 480–484 inferences from, 485–486 software/calculator results for, 485–486 Standard error of estimate, 544 Standard error of the mean, 285 Standardized score. See z score Standard normal distribution, 246–260 Statistically stable, 703 Statistical methods, 6 Statistical significance, 7 Statistics. See also specific types in data science, 20–22 defining, 4, 14 holistic, 408 inferential, 152–153 origins of, 6 probability in, 144–145 resistant, 89, 103 sampling distributions of, 274 Stemplots, 63 Stigler, Stephen, 616 Stocks, 107 Stratified sampling, 28, 29 Student t distribution, 331–332, 734 in holistic statistics, 727 important properties of, 406 Subgroup, 706 Subjective probabilities, 147 Subjects, 27 Sugging, 8 Surgery, sham, 384 Surveys low response rates, 8 pitfalls, 32 telephone, 23 wording in, 29 Survivorship bias, 5 Systematic sampling, 28, 29 T Tampering, 705 Tank serial numbers, 334 t distribution, 331–332, 734 student, 331–332 t test. See Student t distribution Taxis, 756 Teacher evaluations, 515 Technology. See also Software/calculator results for claims testing, 406–408 confidence interval and, 317 hypothesis-testing results using, 375 for inferences, 490–497 learning habits and, 1–2 projects, 696–697 P-values and, 515, 594 regression equation and, 531 for standard deviation, 485–486 two proportions and, 444 for variances, 485–486 Telephone surveys, 23 Testimony, expert witness, 319 Test of homogeneity, 595 Test of independence, defining, 591 Test of significance. See Hypothesis testing Test statistic. See also Hypothesis testing calculation of, 618 claims testing, 405, 411, 417 in contingency tables, 591 defining, 379 effect of mean on, 618 for goodness-of-fit, 578, 585 in hypothesis testing, 378, 518 Kruskal-Wallis test, 672 for large samples, 687 for matched pairs, 470–471 notation, 481 for one-way ANOVA, 616 P-value and, 615–616 rank correlation, 678 runs test, 687 in sign test, 648, 653–654 for small samples, 687 for standard deviation, 481 for tests of independence, 591 for two dependent samples, 470–471 for two means, 456 for two proportions, 442–443 value of, 672 for variances, 481 Wilcoxon rank-sum test, 665 Wilcoxon signed-ranks test, 658 Time-series graph, 63–64 Tornadoes, 754 Total deviation, 547 Total variation, 547 Tree diagram, 167 Trimmed mean, 103 True positive/negative, 175, 179 True zero, 19 Tukey test, 625 Twins, 471, 472 Two-factor authentication, 372–373, 392–396 Two-tailed test, 379, 410–411 P-values in, 420 Two-way ANOVA, 626–636 balanced design in, 628 column factor, 629 critical thinking, 634–636 Software/calculator results (continued)

RkJQdWJsaXNoZXIy NjM5ODQ=