Already have access? Sign in here
Already have access? Sign in here
Ask questions about this lesson and get instant answers.
The goal of machine learning is not to perform well on the data you have. It is to perform well on data you have not yet seen. This distinction is so important that it deserves its own name: generalization.
Consider a fraud detection model trained on historical transaction data. The model's value is not in correctly classifying transactions you already know are fraudulent. Those cases are already resolved. The model's value is in correctly identifying new fraudulent transactions as they occur in the future.
The goal of machine learning is not to perform well on the data you have. It is to perform well on data you have not yet seen. This distinction is so important that it deserves its own name: generalization.