At its core, data integration is the process of combining data from different sources so that users and applications see it as coming from a single source. There are many ways to do this, and there are just as many possible applications too. It can be used for everything from research to marketing and corporate mergers.
The process has to be business-oriented, instead of being a purely IT initiative. In other words, it has to be a solution to enhance business processes rather than as a better or cheaper way to store and manage information. Many systems based on this principle are already in use and quite common in the corporate world.
One such example is a CRM system, which integrates information from different departments such as customer service, social media and sales/marketing. This enables the sales and marketing people in targeting an existing customer based on previous purchasing history and other contacts with the company that are listed in the profile. Another case where integrating an information storage system becomes essential is when two companies enter into a merger.
It is possible to accomplish this process on both the application and middleware layers. It may also be done by physical warehousing in a completely new system, or through virtual integration where no new repository is actually created. Let's consider these choices one by one, so that it becomes clear how each one works and where it can be used.
If the application has the built-in logic to extract and combine information stored in different sources, there is no need to create a new centralized database. The same applies for a solution on a middleware layer. In this case, the logic in the middleware will provide every application with whatever information it needs from any and all sources at the back end.
Virtual integration is the simplest method to create an integrating tool which does require creation of a new storage system. Under this method, a set of pre-defined queries will access required information from separate sources. For instance, consider a case where a customer profile needs to be seen. The query extracts records from all the sources based on the main index field, which is usually a customer ID. The extracted information is then presented to the user in a single and unified view.
Warehousing is a completely new system which can siphon and store information from any number of sources. This is mostly done only at an enterprise level, where vast amounts of data coming in from all of a company's departments and locations can be collected, stored and managed in massive data centers. This centralized system can then be used by applications and users to gain enterprise-wide access, reporting and analysis capability.
The choice of a data integration technique and the scope of the process have to be made after careful consideration of a whole lot of issues. The intended usage and the number and type of sources are primary factors, along with the cost. Not to mention the security and backup implications. Other related issues to consider include migrations, synchronization and MDM (master data management).
The process has to be business-oriented, instead of being a purely IT initiative. In other words, it has to be a solution to enhance business processes rather than as a better or cheaper way to store and manage information. Many systems based on this principle are already in use and quite common in the corporate world.
One such example is a CRM system, which integrates information from different departments such as customer service, social media and sales/marketing. This enables the sales and marketing people in targeting an existing customer based on previous purchasing history and other contacts with the company that are listed in the profile. Another case where integrating an information storage system becomes essential is when two companies enter into a merger.
It is possible to accomplish this process on both the application and middleware layers. It may also be done by physical warehousing in a completely new system, or through virtual integration where no new repository is actually created. Let's consider these choices one by one, so that it becomes clear how each one works and where it can be used.
If the application has the built-in logic to extract and combine information stored in different sources, there is no need to create a new centralized database. The same applies for a solution on a middleware layer. In this case, the logic in the middleware will provide every application with whatever information it needs from any and all sources at the back end.
Virtual integration is the simplest method to create an integrating tool which does require creation of a new storage system. Under this method, a set of pre-defined queries will access required information from separate sources. For instance, consider a case where a customer profile needs to be seen. The query extracts records from all the sources based on the main index field, which is usually a customer ID. The extracted information is then presented to the user in a single and unified view.
Warehousing is a completely new system which can siphon and store information from any number of sources. This is mostly done only at an enterprise level, where vast amounts of data coming in from all of a company's departments and locations can be collected, stored and managed in massive data centers. This centralized system can then be used by applications and users to gain enterprise-wide access, reporting and analysis capability.
The choice of a data integration technique and the scope of the process have to be made after careful consideration of a whole lot of issues. The intended usage and the number and type of sources are primary factors, along with the cost. Not to mention the security and backup implications. Other related issues to consider include migrations, synchronization and MDM (master data management).
About the Author:
Peggie K. Lambert loves working and studying the world of data integration. If you are looking to learn about data integration solutions then she recommends you refer to www.liaison.com.