In Sample Error: The error rate you get on the same data set you used to build your predictor. Sometimes called resubstitution error.
Out of Sample Error: The error rate you get on a new data set. Sometimes called generalization error.
Key ideas
- Out of sample error is what you care about
- In sample error \(<\) out of sample error
- The reason is overfitting
- Matching your algorithm to the data you have