12-05-2012, 11:26 AM
DATA WAREHOUSING & DATA MINING AN IMPARTING EFFICIENCY OF BANKING AND FINANCE SECTORS WITH DATA MINING
Data Warehousing & Data Mining.doc (Size: 305.5 KB / Downloads: 44)
Abstract
Currently, huge electronic data repositories are being maintained by banks and other financial institutions. Valuable bits of information are embedded in these data repositories. The huge size of these data sources make it impossible for a human analyst to come up with interesting information (or patterns) that will help in the decision making process. A number of commercial enterprises have been quick to recognize the value of this concept, as a consequence of which the software market itself for data mining is expected to be in excess of 10 billion USD. This paper is intended for those who would like to get aware of the possible applications of data mining to enhance the performance of some of their core business processes. In this paper discussion is about the broad areas of application, like risk management, portfolio management, trading, customer profiling and customer care, where data mining techniques can be used in banks and other financial institutions to enhance their business performance.
INTRODUCTION:
As knowledge is becoming more and more synonymous to wealth creation and as a strategy plan for competing in the market place can be no better than the information on which it is based, the importance of knowledge and information in today’s business can never be seen as an exogenous factor to the business. Organizations and individuals having access to the right information at the right moment, have greater chances of being successful in the epoch of globalization and cut-throat competition.
Business Intelligence focuses on discovering knowledge from various electronic data repositories, both internal and external, to support better decision making. Data mining techniques become important for this knowledge discovery from databases. In recent years, business intelligence
systems have played pivotal roles in helping organizations to fine tune the business goals such as improving customer retention, market penetration, profitability and efficiency. In most cases, these insights are driven by analyses of historical data. Global competitions, dynamic markets, and rapidly decreasing cycles of technological innovation provide important challenges for the banking and finance industry.
Worldwide just-in-time availability of information allows enterprises to improve their flexibility. In financial institutions considerable developments in information technology have led to huge demand for continuous analysis of resulting data.
Data mining can contribute to solving business problems in banking and finance by finding patterns, causalities, and correlations in business information and market prices that are not immediately apparent to managers because the volume data is too large or is generated too quickly to screen by experts. The managers of the banks may go a step further to find the sequences, episodes and periodicity of the transaction behaviour of their customers which may help them in actually better segmenting, targeting, acquiring, retaining and maintaining a profitable customer base. Business Intelligence and data mining techniques can also help them in identifying various classes of customers and come up with a class based product and/or pricing approach that may garner better revenue management as well.
The broad categories of application of Data Mining and Business Intelligence techniques in
the banking and financial industry vertical may be viewed as follows:
Risk Management
Managing and measurement of risk is at the core of every financial institution. Today’s major challenge in the banking and insurance world is therefore the implementation of risk management systems in order to identify, measure, and control business exposure. Here credit and market risk present the central challenge, one can observe a major change in the area of how to measure and deal with them, based on the advent of advanced database and data mining technology.( Other types of risk is also available in the banking and finance i.e., liquidity risk, operational risk, or concentration risk. )
Today, integrated measurement of different kinds of risk (i.e., market and credit risk) is moving into focus. These all are based on models representing single financial instruments or risk factors, their behaviour, and their interaction with overall market, making this field highly important topic of research.
Financial Market Risk
For single financial instruments, that is, stock indices, interest rates, or currencies, market risk measurement is based on models depending on a set of underlying risk factor, such as interest rates, stock indices, or economic development. One is interested in a functional form between instrument price or risk and underlying risk factors as well as in functional dependency of the risk factors itself.
Today different market risk measurement approaches exist. All of them rely on models representing single instrument, their behaviour and interaction with overall market. Many of this can only be built by using various data mining techniques on the proprietary portfolio data, since data is not publicly available and needs consistent supervision.
Credit Risk
Credit risk assessment is key component in the process of commercial lending. Without it the lender would be unable to make an objective judgement of weather to lend to the prospective borrower, or if how much charge for the loan. Credit risk management can be classified into two basic groups:
Credit scoring/credit rating: Assignment of a customer or a product to risk level. (i.e., credit approval)
Behaviour scoring/credit rating migration analysis. Valuation of a customer‘s or product’s probability of a change in risk level within a given time. (i.e., default rate volatility)
In commercial lending, risk assessment is usually an attempt to quantify the risk of loss to the lender when making a particular lending decision. Here credit risk can quantify by the changes of value of a credit product or of a whole credit customer portfolio, which is based on change in the instrument’s ranting, the default probability, and recovery rate of the instrument in case of default. Further diversification effects influence the result on a portfolio level. Thus a major part of implementation and care of credit risk management system will be a typical data mining problem: the modelling of the credit instrument’s value through the default probabilities, rating migrations, and recovery rates.
Three major approaches exist to model credit risk on the transaction level: accounting analytic approaches, statistical prediction and option theoretic approaches. Since large amount of information about client exist in financial business, an adequate way to build such models is to use their own database and data mining techniques, fitting models to the business needs and the business current credit portfolio.
Portfolio Management
Risk measurement approaches on an aggregated portfolio level quantify the risk of a set of instrument or customer including diversification effects. On the other hand, forecasting models give an induction of the expected return or price of a financial instrument. Both make it possible to manage firm wide portfolio actively in a risk/return efficient manner. The application of modern risk theory is therefore within portfolio theory, an important part of portfolio management.
With the data mining and optimization techniques investors are able to allocate capital across trading activities to maximise profit or minimise risk. This feature supports the ability to generate trade recommendations and portfolio structuring from user supplied profit and risk requirement.
With data mining techniques it is possible to provide extensive scenario analysis capabilities concerning expected asset prices or returns and the risk involved.
With this functionality, what if simulations of varying market conditions e.g. interest rate and exchange rate changes) cab be run to assess impact on the value and/or risk associated with portfolio, business unit counterparty, or trading desk. Various scenario results can be regarded by considering actual market conditions. Profit and loss analyses allow users to access an asset class, region, counterparty, or custom sub portfolio can be benchmarked against common international benchmarks.
Trading
For the last few years a major topic of research has been the building of quantitative trading tools using data mining methods based on past data as input to predict short-term movements of important currencies, interest rates, or equities. The goal of this technique is to spot times when markets are cheap or expensive by identifying the factor that are important in determining market returns. The trading system examines the relationship between relevant information and piece of financial assets, and gives you buy or sell recommendations when they suspect an under or overvaluation. Thus, even if some traders find the data mining approach too mechanical or too risky to be used systematically, they may want to use it selectively as further opinion.
Trading is based on the idea of predicting short term movements in the price/value of a product (currency/equity/interest rate etc.). With a reasonable guesstimate in place one may trade the product if he/she thinks it is going to be overvalued or undervalued in the coming future. Trading traditionally is done based on the instinct of the trader. If he/she thinks the product is not priced properly he/she may sell/buy it. This instinct is usually based on past experience and some analysis based on market conditions. However, the number of factors that even the most expert of traders can account for are limited. Hence, quite often these predictions fail.
The price of a financial asset is influenced by a variety of factors which can be broadly classified as economic, political and market factors. Participants in a market observe the relation between these factors and the price of an asset, account for the current value of these factors and predict the future values to finally arrive at the future value of the asset and trade accordingly. Quite often by the time a trained eye detects these favourable factors, many others may have discovered the opportunity, decreasing the possible revenues otherwise. Also these factors in turn may be related to several other factors making prediction difficult.
Data mining techniques are used to discover hidden knowledge, unknown patterns and new rules from large data sets, which may be useful for a variety of decision making activity. With the increasing economic globalization and improvements in information technology, large amounts of financial data are being generated and stored.
These can be subjected to data mining techniques to discover hidden patterns and obtain predictions for trends in the future and the behavior of the financial markets. With the immediacy offered by data mining, latest data can be mined to obtain crucial information at the earliest. This in turn would result in an improved market place responsiveness and awareness leading to reduced costs and increased revenue.
Advancements made in technology have enabled to create faster and better prediction systems. These systems are based on a combination of data mining techniques and artificial intelligence methods like Case Based Reasoning (CBR) and Neural Networks (NN). A combination of such a forecasting system together with a good trading strategy offers tremendous opportunities for massive returns. The value of a financial asset is dependent on both macroeconomic and microeconomic variables and this data is available in a variety of disparate formats. NN and CBR techniques can be applied extensively for predicting these financial variables. NN are characterized by learning capabilities and the ability to improve performance over time. Also NN can generalize i.e. recognize new objects which may be similar but not exactly identical to previous objects. NN with their ability to derive meaning from imprecise data can be used to detect patterns which are otherwise too complex to be detected by humans. NN act as experts in the area that they have been trained to work in. these can be used to provide predictions for new situations and work in real time. Thus, historic data available about financial markets and the various variables can be used to train NN to simulate the market.
CBR methodology is based on reasoning from past performances. It uses a large repository of data stored as cases which would include all the market variables in this case. When a new case is fed in (in the form of a case containing the concerned variables), the CBR algorithm predicts the performance/result of this case based on the cases it has in its repository. Data mining techniques can be used to detect hidden patterns in these cases which may then be used for further decision making. CBR methods can be used in real time which makes analysis really quick and helps in real time decision making resulting in immediate profits. Thus data mining and business intelligence (CBR and NN) techniques may be used in conjunction in financial markets to predict market behaviour and obtain patterned behaviour to influence decision making.
Customer Profiling and Customer Relationship Management
Banks have many and huge databases containing transactional and other details of its customers. Valuable business information can be extracted from these data stores. But it is unfeasible to support analysis and decision making using traditional query languages; because human analysis breaks down with volume and dimensionality.
Traditional statistical methods do not have the capacity and scale to analyze these data, and hence modern data mining methodologies and tools are increasingly being used for decision making process not only in banking and financial institutions, but across the industries.
Customer profiling is a data mining process that builds customer profiles of different groups from the company’s existing customer database. The information obtained from this process can be used for different purposes, such as understanding business performance, making new marketing initiatives, market segmentation, risk analysis and revising company customer policies. The advantage of data mining is that it can handle large amounts of data and learn inherent structures and patterns in data. It can generate rules and models that are useful in enabling decisions that can be applied to future cases.
Customer Behaviour Modeling (CBM) or customer profiling is a tool to predict the future value of an individual and the risk category to which he belongs to based on his demographic characteristics, life-style and previous behaviour. This helps to focus on customer retention. The two important facts that have important implication in selecting customer profiling methods are:
Profiling information can consist of many variables (or dozens of them).
Majority of them are categorical variables (or non-numeric variables or nominal variables).
Customer profiling is to characterize features of special customer groups. Many data mining techniques search profiles of special customer groups systematically using Artificial Intelligence techniques. They generate accurate profiles based on beam search and incremental learning techniques.
Customer profiling also uses many predictive modeling methods. Predictive modelling techniques applicable can be categorized into two broad approaches. They depend on the type of predicted information or variables, also called target variables. If the type of predicted values is categorical, classification techniques is preferred to be used.
Classification Methods:
In this approach, risk levels are organized into two categories based on past default history. For example, customers with past default history can be classified into "risky" group, whereas the rest are placed as "safe" group. Using this categorization information as target of prediction, Decision Tree and Rule Induction techniques can be used to build models that can predict default risk levels of new loan applications