I have a data set that contain 8 predictors and more than 15k observations for each predictor. I need some one can do a model selection process. And chose among models using the validation set approach and cross-validation.
I tried to run the model selection code but it seems it require a lot of memory and didn't work. (or maybe I did wrong)
Please have knowledge about statistical learning and how to use R before bidding. (and maybe 32GB ram and above is needed)
Data set is attached.
150k observations for each predictor