This lesson requires a premium membership to access.
Premium membership includes unlimited access to all courses, quizzes, downloadable resources, and future content updates.
While building any predictive model, it is important to first understand whether it is a classification or a regression problem. Let’s understand the difference between the two:
In a classification problem, we are trying to predict the class of a data point (discreet number of values). The Y variable that we are trying to predict generally comes in categorical form and has a finite number of classes. For example, we can classify a loan as Default or No Default. Or we can classify an image as a cat or a dog. The credit risk problem that we are trying to solve is a classification problem. We call it a binary classification when there are only one of the two classes to predict (Default or No Default - 0 or 1). If we have more than 2 classes, we call it a multi-classification problem. Such models are commonly referred to as "classifiers".