Credit.Rd
A simulated data set containing information on ten thousand customers. The aim here is to predict which customers will default on their credit card debt.
Credit
A data frame with 10000 observations on the following 4 variables.
ID
Identification
Income
Income in $10,000's
Limit
Credit limit
Rating
Credit rating
Cards
Number of credit cards
Age
Age in years
Education
Number of years of education
Gender
A factor with levels Male
and Female
Student
A factor with levels No
and Yes
indicating whether the individual was a student
Married
A factor with levels No
and Yes
indicating whether the individual was married
Ethnicity
A factor with levels African American
, Asian
, and Caucasian
indicating the individual's ethnicity
Balance
Average credit card balance in $.
Simulated data, with thanks to Albert Kim for pointing out that this was omitted, and supplying the data and man documentation page on Oct 19, 2017
James, G., Witten, D., Hastie, T., and Tibshirani, R. (2013) An Introduction to Statistical Learning with applications in R, www.StatLearning.com, Springer-Verlag, New York
summary(Credit)#> ID Income Limit Rating #> Min. : 1.0 Min. : 10.35 Min. : 855 Min. : 93.0 #> 1st Qu.:100.8 1st Qu.: 21.01 1st Qu.: 3088 1st Qu.:247.2 #> Median :200.5 Median : 33.12 Median : 4622 Median :344.0 #> Mean :200.5 Mean : 45.22 Mean : 4736 Mean :354.9 #> 3rd Qu.:300.2 3rd Qu.: 57.47 3rd Qu.: 5873 3rd Qu.:437.2 #> Max. :400.0 Max. :186.63 Max. :13913 Max. :982.0 #> Cards Age Education Gender Student #> Min. :1.000 Min. :23.00 Min. : 5.00 Male :193 No :360 #> 1st Qu.:2.000 1st Qu.:41.75 1st Qu.:11.00 Female:207 Yes: 40 #> Median :3.000 Median :56.00 Median :14.00 #> Mean :2.958 Mean :55.67 Mean :13.45 #> 3rd Qu.:4.000 3rd Qu.:70.00 3rd Qu.:16.00 #> Max. :9.000 Max. :98.00 Max. :20.00 #> Married Ethnicity Balance #> No :155 African American: 99 Min. : 0.00 #> Yes:245 Asian :102 1st Qu.: 68.75 #> Caucasian :199 Median : 459.50 #> Mean : 520.01 #> 3rd Qu.: 863.00 #> Max. :1999.00#> #> Call: #> lm(formula = Balance ~ Student + Limit, data = Credit) #> #> Coefficients: #> (Intercept) StudentYes Limit #> -334.730 404.404 0.172 #>