جستجوی مقالات مرتبط با کلیدواژه "rule extraction" در نشریات گروه "فناوری اطلاعات"
تکرار جستجوی کلیدواژه «rule extraction» در نشریات گروه «فنی و مهندسی»جستجوی rule extraction در مقالات مجلات علمی
-
Credit scoring is an important topic, and banks collect different data from their loan applicant to make an appropriate and correct decision. Rule bases are of more attention in credit decision making because of their ability to explicitly distinguish between good and bad applicants. The credit scoring datasets are usually imbalanced. This is mainly because the number of good applicants in a portfolio of loan is usually much higher than the number of loans that default. This paper use previous applied rule bases in credit scoring, including RIPPER, OneR, Decision table, PART and C4.5 to study the reliability and results of sampling on its own dataset. A real database of one of an Iranian export development bank is used and, imbalanced data issues are investigated by randomly Oversampling the minority class of defaulters, and three times under sampling of majority of non-defaulters class. The performance criterion chosen to measure the reliability of rule extractors is the area under the receiver operating characteristic curve (AUC), accuracy and number of rules. Friedman’s statistic is used to test for significance differences between techniques and datasets. The results from study show that PART is better and good and bad samples of data affect its results less.Keywords: Credit Scoring, Banking Industry, Rule Extraction, Imbalanced Data, Sampling
-
problem such as support vector machines, neural networks and rule based classifiers. Rule bases are more favorite in credit decision making because of their ability to explicitly distinguish between good and bad applicants in a credit scoring context, imbalanced data sets frequently occur as the number of good loans in a portfolio is usually much higher than the number of loans that default. This paper explores the suitability of RIPPER, One R, Decision table, PART and C4.5 for loan default prediction rule extraction. A real database of one of Iranian banks export loans is used and, class imbalance issues is investigated in its loan database by randomly Oversampling the minority class of defaulters, and three times under sampling of majority of non-defaulters class. The performance criterion chosen to measure this effect is the area under the receiver operating characteristic curve (AUC), accuracy measure andnumber of rules. Friedman‟s statistic is used to test for significance differences between techniques anddatasets. The results from study show that PART is the best classifier in all of balanced and imbalanceddatasets.Keywords: Scoring, Banking Industry, Rule extraction, Imbalanced data, Sampling
نکته
- نتایج بر اساس تاریخ انتشار مرتب شدهاند.
- کلیدواژه مورد نظر شما تنها در فیلد کلیدواژگان مقالات جستجو شدهاست. به منظور حذف نتایج غیر مرتبط، جستجو تنها در مقالات مجلاتی انجام شده که با مجله ماخذ هم موضوع هستند.
- در صورتی که میخواهید جستجو را در همه موضوعات و با شرایط دیگر تکرار کنید به صفحه جستجوی پیشرفته مجلات مراجعه کنید.