Fig. 1From: A site specific model and analysis of the neutral somatic mutation rate in whole-genome cancer dataWorkflow of the forward model selection procedure. The forward model selection is implemented on 2% of the data to determine the explanatory variables included in the final model. In each iteration of the model selection procedure, data tables are generated to summarize the site-specific annotations. The performance of the models is measured with the deviance loss obtained by cross-validation. The explanatory variable with the best performance is included in the set of variables for the next iteration. Parameter estimation for the final model is based on the remaining 98% of the dataBack to article page