DGHNL: A new deep genetic hierarchical network of learners for prediction of credit scoring
Authors:
- Paweł Pławiak,
- Moloud Abdar,
- Joanna Pławiak,
- Vladimir Makarenkov,
- U Rajendra Acharya
Abstract
Credit scoring (CS) is an effective and crucial approach used for risk management in banks and other financial institutions. It provides appropriate guidance on granting loans and reduces risks in the financial area. Hence, companies and banks are trying to use novel automated solutions to deal with CS challenge to protect their own finances and customers. Nowadays, different machine learning (ML) and data mining (DM) algorithms have been used to improve various aspects of CS prediction. In this paper, we introduce a novel methodology, named Deep Genetic Hierarchical Network of Learners (DGHNL). The proposed methodology comprises different types of learners, including Support Vector Machines (SVM), k-Nearest Neighbors (kNN), Probabilistic Neural Networks (PNN), and fuzzy systems. The Statlog German (1000 instances) credit approval dataset available in the UCI machine learning repository is used to test the effectiveness of our model in the CS domain. Our DGHNL model encompasses five kinds of learners, two kinds of data normalization procedures, two extraction of features methods, three kinds of kernel functions, and three kinds of parameter optimizations. Furthermore, the model applies deep learning, ensemble learning, supervised training, layered learning, genetic selection of features (attributes), genetic optimization of learners parameters, and novel genetic layered training (selection of learners) approaches used along with the cross-validation (CV) training-testing method (stratified 10-fold). The novelty of our approach relies on a proper flow and fusion of information (DGHNL structure and its optimization). We show that the proposed DGHNL model with a 29-layer structure is capable to achieve the prediction accuracy of 94.60% (54 errors per 1000 classifications) for the Statlog German credit approval data. It is the best prediction performance for this well-known credit scoring dataset, compared to the existing work in the field.
- Record ID
- CUTc4216f22bcf245f4962f17b241bcf5e8
- Publication categories
- ;
- Author
- Journal series
- Information Sciences, ISSN 0020-0255, e-ISSN 1872-6291
- Issue year
- 2020
- Vol
- 516
- Pages
- 401-418
- Other elements of collation
- schem.; tab.; wykr.; Bibliografia (na s.) - 416-417; Bibliografia (liczba pozycji) - 50; Oznaczenie streszczenia - Abstr.; Numeracja w czasopiśmie - Vol. 516
- Keywords in English
- credit scoring, machine learning, data mining, ensemble learning, deep learning, genetic algorithm, feature extraction and selection
- DOI
- DOI:10.1016/j.ins.2019.12.045 Opening in a new tab
- URL
- https://www.sciencedirect.com/science/article/pii/S0020025519311569?via%3Dihub Opening in a new tab
- Language
- eng (en) English
- License
- Score (nominal)
- 200
- Publication indicators
- Citation count
- 129
- Additional fields
- Indeksowana w: Web of Science, Scopus
- Uniform Resource Identifier
- https://cris.pk.edu.pl/info/article/CUTc4216f22bcf245f4962f17b241bcf5e8/
- URN
urn:pkr-prod:CUTc4216f22bcf245f4962f17b241bcf5e8
* presented citation count is obtained through Internet information analysis, and it is close to the number calculated by the Publish or PerishOpening in a new tab system.