29-08-2016, 12:19 PM
1451386055-jusreport.pdf (Size: 4.55 MB / Downloads: 7)
ABSTRACT
New requirements are arising in environments where we have higher volumes of data
with high operation rates and also need to store these data for future usages. As the time passes
the amount of data need to store is increase day by day. As there are so many data sources
are available when the user requirements are complex and also related to some Business logic.
Data warehouses are a subject oriented, integrated, time variant and nonvolatile collection of
data in support of managements decision making process. To retrieve data from one or more
data sources and put it into data warehouses can be done using a technology called as ETL
process. As the data integration is an important user needed functionality so there are so many
ETL tools are available in the Business industries. Informatica is an well developed and also
highly efficient ETL tool which used now widely among the Business industries. The SQL
based data sorting RDBMS are not efficient when it handle with large amount of data such
as in case of internet,so to overcome these disadvantage now NOSQL based datastorage is
available. Among them MongoDB is an example for this type,by using MongoDB along with
informatica is 5 times more efficient than if they used alone.
INTRODUCTION
New requirements are arising in environments where we have higher volumes of data
with high operation rates and also need to store these data for future usages. In recent years,
a growing number of companies have adopted various types of nonrelational database for the
data storage.
Data warehouses are a subject oriented, integrated, time variant and nonvolatile collection
of data in support of managements decision making process. A data warehouse can be used
to analyze a particular subject area. The data may be retrieved from single or more databases
using Data extraction, transformation and Loading process and ETL processing tools. After
ETL processing these data may store with in a data warehouses. In Business Intelligence and
analytic the flow of data starting with its acquisition from source systems through transformation,
consolidation, analysis and reporting. As the ETL processing rate are increased day
by day various type of Informatica tools are available, among them Informatica PowerCenter
Express is a most widely used ETL tool. PowerCenter Express is Informaticas market leading
ETL tool, and also if the user is not an expert in data integration then also the user is able to
integrate the data and also get solution for that.
RDBMS are widely used for the most application in which storage and retrieve of data.
RDBMS work best when they handle a limited amount of data. Handling a huge volume of data
like internet was inefficient for it. To overcome this disadvantage NOSQL is used. MongoDB is
a NOSQL database which have ease of use and performance, which store the data as documents.
Informatica + Mongo DB is a powerful combination that increases developer productivity.