HairEyeColor {datasets} | R Documentation |
Distribution of hair and eye color and sex in 592 statistics students.
HairEyeColor
A 3-dimensional array resulting from cross-tabulating 592 observations on 3 variables. The variables and their levels are as follows:
No | Name | Levels |
1 | Hair | Black, Brown, Red, Blond |
2 | Eye | Brown, Blue, Hazel, Green |
3 | Sex | Male, Female |
This data set is useful for illustrating various techniques for the analysis of contingency tables, such as the standard chi-squared test or, more generally, log-linear modelling, and graphical methods such as mosaic plots, sieve diagrams or association plots.
Snee, R. D. (1974), Graphical display of two-way contingency tables. The American Statistician, 28, 9–12.
Friendly, M. (1992), Graphical methods for categorical data. SAS User Group International Conference Proceedings, 17, 190–200. http://www.math.yorku.ca/SCS/sugi/sugi17-paper.html
Friendly, M. (1992), Mosaic displays for loglinear models. Proceedings of the Statistical Graphics Section, American Statistical Association, pp. 61–68. http://www.math.yorku.ca/SCS/Papers/asa92.html
chisq.test
,
loglin
,
mosaicplot
require(graphics) ## Full mosaic mosaicplot(HairEyeColor) ## Aggregate over sex: x <- apply(HairEyeColor, c(1, 2), sum) x mosaicplot(x, main = "Relation between hair and eye color")