{"id":1630,"date":"2019-10-07T06:10:15","date_gmt":"2019-10-07T03:10:15","guid":{"rendered":"http:\/\/beeeye.com\/?p=1630"},"modified":"2021-04-11T00:04:22","modified_gmt":"2021-04-11T00:04:22","slug":"eyeonrisk-end-to-end-credit-risk-ml-modelling-platform","status":"publish","type":"post","link":"http:\/\/beeeye.com\/eyeonrisk-end-to-end-credit-risk-ml-modelling-platform\/","title":{"rendered":"EyeOnRisk End to End Credit Risk ML Modelling Platform"},"content":{"rendered":"
In previous blog posts, we discussed different aspects of our industry and how they may affect your current strategy in relation to a credit risk modelling solution. We\u2019ve covered the main reasons to make the leap toward AI\/ML based modelling in 2019<\/a>. We\u2019ve also discussed the main benefits and organizations currently using traditional scoring<\/a>, can benefit by using machine learning techniques, in depth. Finally, we\u2019ve discussed some of the problems faced by banks that haven\u2019t adopted ML yet<\/a>.<\/p>\n In this blog post, we get into the technicalities and present more detail on how the EyeOnRisk credit risk modelling platform, can help bridge the gap to achieve a more accurate, structured and streamlined modelling process.<\/p>\n There\u2019s a profound difference between vertical platforms and horizontal platforms. These days, many FIs make use of more traditional styles of structuring for their credit risk solutions in the form of a collection of different tools. It may be the case that each tool is extremely capable in its own vertical, however, for the credit risk problem \u2013 the wide range of capabilities may not be relevant or applicable.<\/p>\n This is why it makes more sense to address the credit risk modeling effort with a holistic solution. Such a solution can cover the entire lifespan of a model, from creation to deployment and re-evaluation. When examining such platforms closely, they are composed of several built-in integrated components which include the following:<\/p>\n When all these parts are integrated into the same platform, it allows for a quick turnaround of new models or improvements to existing ones can be achieved.<\/p>\n Let\u2019s dive into each one of these parts and see what they look like on the EyeOnRisk platform.<\/p>\n The platform allows for UI based additions of an unlimited number of data sources. This enables the modeler to quickly progress the in his modelling project without requiring too much support from other functions in the organization.<\/p>\n The platform\u2019s UI is straightforward and you\u2019ll be working on your own data panel by clicking only a few buttons. As the data is gathered, a immediate glimpse at the data is provided, itself along with some statistics which can help to determine how relevant this data is.<\/p>\n Many organizations are exploring the inclusion of alternative data<\/a> into their models. While there are numerous available paid data providers which can be used, the issue of integration is a limiting factor. The EyeOnRisk platform addresses this with easy UI based support for external data sources. The platform is also extremely adaptable for adding new external data source and APIs, as required by the customer.<\/p>\n Data acquired from the API is presented to the user as though it was derived from an internal table in the data warehouse.<\/p>\n Experienced credit risk modelling teams will attest that a vital factor in achieving model lift is achieved though comprehensive preparation of data and from feature engineering<\/a>. This is also the most time-consuming task when building a credit risk model. The platform facilitates this phase by offering with an easy to use interface. This enables creation of new data transformations and assists in exploring their contribution to the model in a quick and easy manner. allows to quickly and easily create new data transformations, and explore their contribution to the model.<\/p>\n Missing values can be dealt with quickly and easily by imputing missing values manually or automatically:<\/p>\n In many cases, a combination of several raw features will yield a more accurate model than those making use of raw features by themselves. Coming up with good ideas for new features requires years of experience and deep domain knowledge. However, it shouldn\u2019t be too difficult to try, for instance, adding a binning function for the AGE column:<\/p>\n The platform offers state-of-the-art and exciting technology which automatically calculates huge amounts of feature candidates which can be integrated into your model. To use this feature, the user can simply launch one of the feature generation search algorithms and let the system recommend some useful features:<\/p>\n When the data is sufficiently clean to start modelling, the process of adding a new machine learning experiment is simple and easy. By using the UI, it is possible to can control the type of algorithm to make use of which including logistic regression, random forests, boosting, bagging, SVM, decision trees and more. For each algorithm, you can tweak parameters or select the input parameters <\/p>\n You can run one or many experiments simultaneously, as well as compare results when finished. Each experiment result screen provides all the necessary information in order to comprehensively assess the effectiveness of the experiment. The following screenshot presents the ROC and confusion matrix parts. Additional information (not shown) includes feature importance and selected grid search parameters.<\/p>\n When work on the model has been completed and is approved by all parties, the next step is to make it available for the various loan origination systems in the organization. Since the platform stores keeps all required the needed setup information for a Flow (as expected from an end-to-end platform), the actual deployment is much simpler and a quick process. The deployment wizard allows you to choose the Flow to deploy with the exact model you wish to use in production. The API endpoint is then generated and exposed in the internal network:<\/p>\n When the model runs in production, it\u2019s extremely important to carefully monitor the quality of the input data. It\u2019s a common pitfall to use a model with inappropriate input data. This results in major inaccuracies and problems with the performance of the model. Here, the platform assists by automatically monitoring the statistical nature of the data fed into the model in production. If a significant change is detected when data during training is compared, the system alerts and allows for early detection in data pipeline problems:<\/p>\n We\u2019ve shown some of the basic steps in creating a robust machine learning credit risk model which can be easily deployed in production. The process doesn\u2019t have to be overly complicated when you use a holistic platform such as EyeOnRisk. To become more familiar with the platform, we are inviting you to contact us<\/strong><\/a> and schedule a demo.<\/p>\n<\/div><\/div>What makes an End to End Credit Risk Platform?<\/h2>\n
\n
Acquiring Data for Credit Risk Models (Internal, External or Other)<\/h2>\n
Using External, Alternative Data in Credit Risk Models<\/h3>\n
Dealing with Missing Data (Imputation) and Data Wrangling<\/h2>\n
Automatic Data Completion (Imputation)<\/h3>\n
Feature Generation<\/h3>\n
Automatic Feature Generation<\/h3>\n
Modelling Experimentation (using Machine Learning or other methods)<\/h2>\n
Model Deployment with API Support<\/h2>\n
Monitoring Machine Learning Models in Production<\/h2>\n
Conclusion<\/h2>\n