Air Quality Data Set for May 1973, from Chambers et al. (1983). The whole data set consists of daily readings of air quality values from May 1, 1973 to September 30, 1973, but here are included only the values for May. This data set is an example of the special treatment of the missing values.

data(airmay, package="robustbase")

Format

A data frame with 31 observations on the following 4 variables.

X1

Solar Radiation in Longleys in the frequency band 4000-7700 from 0800 to 1200 hours at Central Park

X2

Average windspeed (in miles per hour) between 7000 and 1000 hours at La Guardia Airport

X3

Maximum daily temperature (in degrees Fahrenheit) at La Guardia Airport

Y

Mean ozone concentration (in parts per billion) from 1300 to 1500 hours at Roosevelt Island

Source

P. J. Rousseeuw and A. M. Leroy (1987) Robust Regression and Outlier Detection; Wiley, p.86, table 6.

Examples

data(airmay)
summary(lm.airmay <- lm(Y ~ ., data=airmay))
#> 
#> Call:
#> lm(formula = Y ~ ., data = airmay)
#> 
#> Residuals:
#>     Min      1Q  Median      3Q     Max 
#> -25.362 -12.785  -2.170   9.445  55.433 
#> 
#> Coefficients:
#>              Estimate Std. Error t value Pr(>|t|)   
#> (Intercept) -79.99271   46.81655  -1.709  0.10299   
#> X1           -0.01868    0.03628  -0.515  0.61219   
#> X2           -1.99577    1.14092  -1.749  0.09558 . 
#> X3            1.96332    0.66368   2.958  0.00777 **
#> ---
#> Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1
#> 
#> Residual standard error: 18.02 on 20 degrees of freedom
#>   (7 observations deleted due to missingness)
#> Multiple R-squared:  0.4612,	Adjusted R-squared:  0.3804 
#> F-statistic: 5.706 on 3 and 20 DF,  p-value: 0.005445
#> 


airmay.x <- data.matrix(airmay[,1:3])