Fisher's Iris Data

SUMMARY:

The iris data set is array containing 4 measurements on 50 flowers from each of 3 species of iris. Sepal length and width, and petal length and width are measured in centimeters. Species are Setosa, Versicolor, and Virginica.

The iris.df data set is same iris data but as a data frame. The columns are the four measurements with names: "Sepal.Length", "Sepal.Width", "Petal.Length" and "Petal.Width". A fifth column, "Species", is a factor that identifies the species: "setosa", "versicolor", "virginica". The iris.df data set matches the iris data set in R.

R. A. Fisher, "The Use of Multiple Measurements in Taxonomic Problems", Annals of Eugenics, 7, Part II, 1936, pp. 179-188. Republished by permission of Cambridge University Press.

The data were collected by Edgar Anderson, "The irises of the Gaspe Peninsula", Bulletin of the American Iris Society, 59, 1935, pp. 2-5.