Attached is an Excel document containing three data sets. The first spreadsheet includes a training set of 20 subjects (columns) by 20,000 observations (rows). The training set subjects are separated into two discrete classes of size 10 subjects each. We wish to have a contractor develop a classifier against this test set and validate it against the second spreadsheet (validation set). Finally apply the classification method against each of the 300 unknown subjects in the final spreadsheet and separate them into either of the two classes found in the first two spreadsheets.
Include in your proposal, the type of classifier that you propose to build. The final deliverable is the documentation of the classification method including any software that is developed for this contest, as well as a report on the validation accuracy of the classifier and a prediction for all unknown subjects.