"If it walks like a duck and it quacks like a duckthen it's probably a duck." ,
Image: http://bit.ly/2qJJ0mA |
a) Default - includes the names and attributes of people who we already know have defaulted on their credit card
b)
We then use
Here's how it works:
Image: KDNuggets |
What data are we going to use?
The Default of Credit Card Clients data set comes from the UCI Machine Learning Repository. Here's the descriptionThe case of customers' default payments in Taiwan in 2016. It has 30,000 rows and 24 columns. The variables are as follows:
X1: Amount of the given credit (NT dollar): it includes both the individual consumer credit and his/her family (supplementary) credit.
X2: Gender (1 = male; 2 = female).
X3: Education (1 = graduate school; 2 = university; 3 = high school; 4 = others).
X4: Marital status (1 = married; 2 = single; 3 = others).
X5: Age (year).
X6 - X11: History of past payment. We tracked the past monthly payment records (from April to September, 2005) as follows:
X6 = the repayment status in September, 2005;
X7 = the repayment status in August, 2005;
X11 = the repayment status in April, 2005. The measurement scale for the repayment status is: -1 = pay
X12-X17: Amount of bill statement (NT dollar). X12 = amount of bill statement in September, 2005; X13 = amount of bill statement in August, 2005;
X17 = amount of bill statement in April, 2005.
X18-X23: Amount of previous payment (NT dollar).
X18 = amount paid in September, 2005;
X19 = amount paid in August, 2005;
X23 = amount paid in April, 2005.
Comments
Post a Comment