Exam
EXAM 
The participants who need credits must answer the following questions and send the results as an R script with comments to rachel.marcone@sib.swiss until latest Friday 15th of September 2023.
Birthwt dataset
-
Load the birthwt dataset from the MASS package and describe the variables that are reported in this dataset Explore the variability in this dataset and the relationship between the variables
-
Fit a model to predict birth weight using mother’s age
-
Check the regression assumptions
-
Get the confidence interval and the prediction interval of the regression model
-
Fit a model to predict birth weight using mother’s age, mother’s weight and smoking status Is this new model providing a better fit to the data ?
-
Check the regression assumptions
-
Fit a model to predict the variable birth weight below 2500 g using mother’s age what is the type of the response variable ? Can you use a simple linear regression model ? Why ?
-
Fit a model to find the parameters which explain the probability of birth weight below 2500 g ?
-
Check the regression assumptions
Fossil dataset
-
Load the fossil dataset from the SemiPar package and describe the variables that are reported in this dataset Explore the variability in this dataset and the relationship between the variables.
-
Fit a polynomial model to predict strontium ratio using age Use different degrees of polynomial model which model better fit your data ? Why ?
-
Fit a smoothing spline to predict strontium ratio using age Does it look better than the best polynomial model ? Why ?
-
Fit a local regression model (for instance using LOESS) to predict strontium ratio using age Does it look better than the smoothing spline ? Why ?
-
Load the dragons dataset from Gabriela K Hajduk and Liam Bailey Explore the variability in this dataset and the relationship between the variables
-
Fit a simple linear model to predict test score from body length and check the regression assumptions What could be wrong with this model ?
-
Fit a model that would be more appropriate for this dataset to predict test score from body length Interpret the results of your model Check the regression assumptions