Migrating From “Independent” Data Marts

Article Summary: Independent data marts, often developed in isolation by different teams, pose significant challenges to organizations due to their lack of centralized architecture, leading to redundant data and processing inefficiencies. This article discusses the migration from these independent data marts to a structured data warehouse solution, highlighting the flaws in their current setup and the benefits of adopting a more architected approach for improved data management and business intelligence.

Last Updated: June 10, 2026

A severe disease has spread to epidemic proportions throughout our society. This disease is particularly dangerous as it effects are not readily identifiable at the time of infection. However if this condition goes untreated it can be debilitating and even terminal. This disease is not hepatitis, but rather “independent” data marts . While this imagery may seem a bit dramatic, unfortunately it reflects the reality in many of today’s companies.

This article will address how to migrate from independent data marts into an architected data warehouse solution. It will address the characteristics of independent data marts, the flaws in their architecture, and the reasons why they exist.

Introduction to Data Marts

Data marts are a crucial component of modern data architecture, providing a targeted subset of data to support specific business needs. They enable organizations to manage and analyze data effectively, making it easier for people to obtain the data they need to make informed decisions. By focusing on specific areas such as sales, marketing, or finance, data marts streamline data access and enhance the efficiency of business operations. This targeted approach allows departments to quickly retrieve and analyze relevant data, leading to more agile and informed decision-making processes.

Definition of a Data Mart

A data mart is a component of a larger data warehouse, typically created to assist a particular business function, such as sales, marketing, or finance. Containing a subset of the data that is kept in the data warehouse, it is built to provide quick and easy access to the data that is most relevant to the specific needs of a department or a business unit. By isolating specific data sets, data marts reduce the complexity and volume of data that users need to sift through, thereby improving query performance and making data retrieval more efficient. This focused approach ensures that each department has access to the most pertinent information available, information that is tailored to each department’s unique data requirements.

Data Mart vs. Data Warehouse vs. Data Lake

To understand the role of independent data marts, it’s essential to differentiate them from other data storage systems such as a data warehouse and a data lake. This is how they breakdown:

Data Marts: A data mart involves extracting a subject-oriented subset of data from a centralized data warehouse or operational systems. These are tailored for specific business departments like marketing or sales. These marts are less complex and easier to implement compared to an entire data warehouse, often focusing on summarized data and business intelligence tools to serve specific needs efficiently.
Data Warehouses: These systems act as a central data repository, designed to integrate and manage enterprise-wide structured data for full data access and analysis. While offering scalability and comprehensive data integration, they are resource-intensive and require robust planning to avoid data warehouse fails.
Data Lakes: Unlike data marts, data lakes store vast amounts of raw data from external sources and operational databases. This data is unstructured, allowing flexibility for future analyses and big data trends. They are ideal for data engineers and analysts exploring new data models.

Each system has a unique purpose, with dependent data marts excelling in focused analyses, data warehouses supporting strategic insights, and data lakes catering to exploratory research

Types of Data Marts

There are three main types of data marts: dependent data marts, independent data marts, and hybrid data marts. Each type has its own advantages and disadvantages, and the choice of which to use depends on the specific needs and requirements of the organization.

Dependent Data Marts: Created from an existing data warehouse, they draw data from the central data warehouse, ensuring consistency and integration across the organization. This type of data mart benefits from the robust architecture of the data warehouse, providing reliable and consistent data.
Independent Data Marts: These are standalone systems built independently of a data warehouse that are often created by individual departments to meet their specific needs. While they can be quicker to implement, they often lead to data redundancy and inconsistency across the organization.
Hybrid Data Marts: These combine elements of both dependent and independent data marts. They may draw data from a central data warehouse but also include data from other sources. This approach can offer flexibility and quick access to data while maintaining some level of consistency and integration.

Characteristics of Independent Data Marts

Independent data marts are characterized by several traits. First, each data mart is sourced directly from the operational systems without the structure of a data warehouse to supply the architecture necessary to sustain and grow the data marts. Second, these data marts are typically built independently from one another by autonomous teams. Typically, these teams usually will apply varying tools, software, hardware, and processes. Possibly the most visually descriptive trait of a company that has constructed independent data marts is that once they map out a process flow of their data warehousing environments (DWE), the flow will resemble that of a “spaghetti” chart (See Figure 1) . What is most disturbing is the number of companies that have stated that this chart resembles their current DWE architecture.

Figure 1: Independent Data Mart Architecture

As we see this architecture is not an architecture at all. Instead, it is a series of “stovepipe” DWE systems. This architecture greatly differs from that of an architected data warehouse (See Figure 2).

Figure 2: Architected Decision Support / Business Intelligence System

The purpose of this article is to discuss independent data marts and the process for migrating from them to an architected solution; however, we will touch briefly on the topic of DWE architecture. We will not go into a detailed discussion of top-down vs. bottom-up approaches (we will save that topic for another article) , except to say that the “classic” top-down approach is a more scalable, and logical approach for constructing a DWE system.

It is surprising how often the top-down methodology is mistaken for a “galactic” approach. This is a misunderstanding since the top-down approach is best used iteratively and incrementally to build the DWE system. When used in this fashion the cost for building a data warehouse that feeds “dependent” data marts becomes highly comparable to the cost of building independent data marts.

Problems with Independent Data Marts

Redundant Data

As the number of independent data marts grows, the amount of redundant data begins to grow uncontrollably across the enterprise. This redundancy occurs because each of the independent data marts requires its own, typically duplicated copy of the detailed corporate data. Often a great deal of this detailed data is not required in the data marts, which typically provide summarized views.

It would be enlightening if a study were conducted to calculate the costs of maintaining non-necessary redundant data for Fortune 1000 companies. The end total would be in the billions of dollars in expenses and lost opportunity.

Redundant Processing

A data warehouse provides the architecture to centralize integration and cleansing activities common to all of the data marts of a company. Without the data warehouse, all of these integration and cleansing processes need to be duplicated for all of the independent data marts. This greatly increases the number of support staff required to maintain the DWE system, creating a particularly disastrous situation for most companies in light of today’s IT staffing shortage.

Separate teams typically will build each of the independent data marts in isolation of one another. As a result, these teams do not leverage the other’s standards, processes, knowledge, and lessons learned. This results in a great deal of rework and re-analysis.

These autonomous teams will commonly select differing tools, software, and hardware. This forces the enterprise to retain skilled employees to support each of these technologies. In addition, a great deal of financial savings is lost, as standardization on these tools does not occur. Often a software, hardware, or tool contract can be negotiated to provide considerable discounts for enterprise licenses, which can have multiple implementation phases to reduce immediate costs. These economies of scale can provide tremendous cost savings to the organization.

Scalability

Independent data marts directly read operational system files and/or tables, which greatly limits the DWE system’s ability to scale. For example, if a company has five independent data marts it is likely that each data mart would require customer information. Therefore, there would be five separate extracts being pulled off of the same customer tables in the operational system of record. Most operational systems have limited batch windows and cannot support this number extracts. With a data warehouse, only one extract is required in the operational system of record.

Non-Integrated

As previously discussed, each independent data mart is built by autonomous teams, typically working for separate departments. As a result, these data marts are not integrated and none of them contains an enterprise view of the corporation. Therefore, if the CEO asks the IT department to provide him with a “listing of our most profitable customers” each data mart will offer a different answer. Having worked with companies that have experienced this exact situation, I can attest that the CIO is rarely pleased to have to explain why his department cannot answer this seemingly simple question.

One of the chief phenomena facing corporations today is the current merger and acquisition craze. Interestingly, one of the key factors fueling this movement is the desire by many companies to reduce their IT spending. In light of the situation the costs associated with independent data marts becomes even more magnified as companies continue to focus on controlling their ever-growing IT costs.

It is important to note that many companies that have built independent data marts are currently in the process of migrating from them. The cost, in dollars and time for the migration is not trivial.

Why Do Independent Data Marts Exist?

With all of these architectural flaws it would seem surprising that so many companies have built their DWE systems around this architecture. There are several reasons why this aberration has occurred.

Benefits of Migrating to a Dependent Data Mart

Migrating to a dependent data mart can bring several benefits to an organization, including improved data consistency and integrity. By centralizing data management within a data warehouse, dependent data marts ensure that all departments are working with the same, accurate data. This reduces the risk of discrepancies and errors that can arise from using multiple, independent data sources. Additionally, dependent data marts streamline data integration and cleansing processes, leading to more efficient data management. This centralized approach also enhances scalability, allowing the organization to grow and adapt its data infrastructure more easily. Ultimately, migrating to a dependent data mart supports better decision-making by providing reliable, consistent, and timely data to all business units.

Data Warehouse Environments (DWE) Are Complex

When the data warehousing craze spread, most companies were looking to build one of their own. Unfortunately, the task of building a well architected and scalable business intelligence system is complicated and requires sophisticated software, expensive hardware, and a highly skilled and experienced team. Finding data warehouse architects and project leaders that truly understand data warehouse architecture is a daunting challenge, both in the corporate and consulting ranks.

Understanding different data storage systems, such as data marts, data warehouses, and data lakes, is crucial for effective data management and integration. These systems vary in structure, purpose, and data types, making it essential to grasp their differences.

To construct a data warehouse a corporation must come to terms with their data and the business procedures that the data represent. While this task is challenging it is a necessary step and one in which the true value of the DWE process is derived from.

Independent Data Mart Shortcut

Building independent data marts is initially less expensive than architected data warehousing environments. In addition, independent data marts can be constructed quickly and do not require a company to really understand their data beyond that of individual departments as a data warehouse requires. These points have been used effectively to sell the concept of constructing independent data marts. Unfortunately, it is this lack of thorough analysis and long-term planning that limits the independent data marts from being an effective business intelligence system.

However, understanding business data is crucial for effective data management. Business data supports business operations and decision-making processes, and data marts tailored to specific business units can enhance operational efficiency and data-driven decision-making.

Unfortunately, it is this lack of thorough analysis and long-term planning that limits the independent data marts from being an effective business intelligence system.

Inappropriate Vendor Messages

Many vendors have developed tools that are effective at building small, departmental independent data marts. These companies in their rush to market with these tools have worked very hard at selling the independent data mart concept (of course it is never worded like this). The reasons are obvious. These companies can significantly reduce their sales cycles because only one department is involved in the software purchasing decision. In addition, their software requires much less sophistication because they merely need to build a standalone data store.

The current vendor buzzword in today’s market is “turnkey”, or “integrated solutions.” Everyone seems to offer a “turnkey” or “integrated” DWE solution. Unfortunately, merely purchasing a “turnkey” solution does not alleviate the task of learning and understanding a corporation’s data and their business processes. Integration of data from disparate systems requires a careful analysis and an understanding of business processes and the data that represents them. There is no “magic bullet” or “turnkey” solution that alleviates this task. An “integrated” solution requires that the organization understands all the sources that will contribute to the final result.

Approaches to Migration

There are two general approaches for migration; “Big Bang” and “Iterative”. Table 1 summarizes the advantages and disadvantages of each approach.

When dealing with multiple data marts, organizations often face challenges such as data silos, redundancy, and increased storage requirements. Establishing organization-wide standards in naming conventions and governance policies is crucial to facilitate effective data integration and reporting during migration.

Table 1: “Big Bang” vs. Iterative Approaches to Data Warehousing / BI

Big Bang Approach

As the name implies all of the independent data marts will be reengineered simultaneously into a structured DWE architecture. There are some advantages to this approach. First, it can provide the fastest path for migration. Often, companies will need to change their DWE architecture as quickly as possible because of a need to implement additional DWE projects that promise to lend a high return on investment (ROI) or because there are funds available for the effort currently that might not be available later.

Second, this approach allows for immediate economies of scale rather than slowly attaining them in Iterative method. The disadvantages to this approach are that it is labor intensive and requires tremendous coordination. In addition, the “Big Bang” approach is the more complex of the two to implement and thus provides the highest exposure to risk.

This approach is best suited when the independent data mart problem is relatively small and not highly complex. However, when the problem is large the complexity of the migration grows at a tremendous rate.

Iterative Approach

This approach looks to reengineer the independent data marts (one or two data marts at a time) in manageable phases. The advantages to this approach are several. First, it allows a company to manage and reduce the risk involved in a migration effort. This occurs because the migration can be accomplished in a phased manner, thereby increasing the probability of the project’s success.

Second, as each project phase is executed lessons are learned and leveraged for subsequent phases. This is very valuable since usually, once the first phase is completed, the follow up phases run much more smoothly.

The major disadvantage to this approach is that it takes longer to fully complete the migration. This approach is most successful when the independent data mart problem is large and too complex to tackle in a “Big Bang” manner.

Initial Planning

Many companies fail in their migration efforts long before they start. The chief reason for this is the lack of initial planning and sponsorship. Obtaining executive sponsorship is one of the most important tasks at the onset of the project. This is critical since typically autonomous teams in different corporate departments have constructed each of the independent data marts. Therefore, having a project champion that has cross-departmental authority is critical for dealing with the political challenges, which are commonplace in these migration efforts.

Understanding business data is crucial for effective migration planning. Business data supports business operations and decision-making processes, and recognizing its importance can enhance operational efficiency and data-driven decision-making.

During the initial planning phases it is important to plan on implementing a metadata repository that can support future DWE development efforts and that will provide a semantic layer between the business users and the DWE system. The data mart migration provides an outstanding opportunity to implement the metadata repository. Before the data mart migration begins it is best to standardize the data naming nomenclature for the DWE system. Implementing standard data naming nomenclature will aid in the DWE system’s maintenance and provide cleaner and more understandable metadata.

A great deal of research must be conducted on the independent data marts before a migration is possible (Table 2 summarizes these tasks). The most important research activity is to understand the business needs that each independent data mart is meeting. Typically, multiple independent data marts will exist to meet the same or similar business needs. These situations are common and do suggest a path for migration. The results of this research will illustrate the independent data marts that will be the most difficult to migrate.

During independent data mart migration, it is an excellent time to standardize on hardware, and software for the DWE project. For each differing software or hardware platform, a company needs to have trained personnel to support it. Therefore, by limiting the redundant software/hardware the corporation reduces the support strain on their IT staff. In addition, standardizing allows for software and hardware purchasing economies of scale to be achieved.

Table 2: Independent Data Mart Migration Research Tasks

Golden Rule

The central covenant of any independent data mart migration effort is to “Never delivery less functionality to the business users than what they have today”. Generally business users do not react well to spending money on infrastructure because they don’t initially see its value. The key business users need to be educated that a bad system architecture leads to a non-scalable and non-flexible system that will eventually need to be rewritten at a very high cost. Therefore, during migration the users must be assured that they will not receive less functionality (information, ease of use, and response time) than what they are currently receiving today.

Identifying a Migration Path

There are several activities that are necessary to conduct before a migration path will be evident.

One potential solution to some of these challenges is the implementation of a hybrid data mart, which combines dependent and independent data marts to utilize data from both storage types.

Create Your Own Spaghetti Chart

First, diagram out the current DWE architecture. This is critical for identifying which legacy systems are feeding which independent data marts.

Figure 3: Diagram Current DWE Architecture

Identify Redundant Data

Often, independent data marts will be sourced from the same legacy systems. By targeting independent data marts with the same source data often multiple independent data marts can be removed with minimal extra effort. Identifying redundant data often suggests a migration path.

Figure 4 illustrates existing independent data marts for a company. In the schematic both the Finance and Marketing data marts are sourced from the same legacy systems. This suggests that it might be wise to target both of these data marts for initial migration (assuming the Iterative approach is used).

Figure 4: Identifying Redundant Data Sources

Identify Paths of Least Resistance

Data

It is important to target those independent data marts whose data will most likely be used in future DWE efforts. By targeting these data marts first, it will ease the task of keeping all new DWE development activity in the newly architected environment.

The next step is to identify those data marts whose transformation rules are known and documented. Understand that even the best-documented transformation rules will have gaps. Moreover, even those marts that have been built using ETL (Extraction/Transformation/Load) tools have metadata (documentation) gaps. For example, ETL tools can provide the functionality to call user exits that are hand-coded programs. The processes performed by these user exits will not be captured in the ETL tool’s metadata stores. If documentation does not exist for a mart then programmers will need to manually analyze each of the ETL program’s code to extract the transformation rules. Manually analyzing code to extract transformation rules is a very time consuming and expensive activity.

Political

It will be critical to obtain support from the current independent data mart IT teams and business users. Identify those data mart teams most likely to work cooperatively with the centralized DWE team. Recognize the strengths and weaknesses of those teams that can and will provide the most aid. If a particular data mart team/business users are not willing to assist with the migration effort it is best to work around these teams by delaying the migration of their particular data mart. If this is not an option, then utilize your executive sponsorship to “motivate” this group to provide their support.

Understand your team’s strengths and weaknesses

Keep in mind that any team will have its stronger and weaker areas of knowledge and experience. As much as possible keep your team’s areas of weakness off the critical path. Any mission critical team weaknesses must be reinforced with internal members from the other data mart teams or from outside vendors. Consider training all the team in the areas of data management to ensure a strong team.

Case Study: Putting the Concepts into Motion

The following case study looks to put the concepts we have discussed into action. This case study illustrates the iterative approach to independent data mart migration since most companies that have independent data marts typically have a pervasive and complex situation.

Background

The XYZ company is a Fortune 500 consumer electronics firm. XYZ recently acquired a smaller company (Acme Electronics) that has a single Marketing data mart; little is known about this data mart. In addition, XYZ is standardizing on a new order entry system in five (5) years and existing batch windows for the legacy systems have reached its limit. XYZ’s management team is stable, well organized, and fully supports the migration effort. Table 3 lists the DWE specific details and Figure 5 shows the current DWE architecture.

Table 3: Case Study DWE Background

Figure 5: Independent Data Mart Architecture

Phase One Migration

By viewing the data, it is evident that the Marketing and Finance data marts share two common data sources (old and new order entry systems). In addition, the Marketing data mart has a strong end-user community that will be highly supportive of the migration effort. In addition, both the Marketing and Finance data mart’s business users have agreed to freeze their additional functionality requests for Phase One of the migration. Identifying the business and technical metadata for each data mart was an essential component of this phase.

During this phase, we avoided migrating the Quality Control and the Acme Marketing data marts. This occurred because of the lack of support in the Quality Control mart and all the unknown aspects of the Acme Marketing mart. Figure 6 illustrates the Phase One DWE architecture.

Figure 6: Phase One DWE Architecture

Phase Two Migration

During this phase, the operational logistical system’s data will be brought into the data warehouse and the Quality Control data mart is now being sourced directly from the enterprise data warehouse. In addition, during this phase the Marketing and Finance teams change requests that were frozen during Phase One implementation are now being developed. Lastly, a new dependent Accounting data mart is now sourced from the data warehouse.

Figure 7: Phase Two DWE Architecture

Phase Three Migration

In this phase we are merging the functionality in the former Acme Electronics Marketing data mart into the existing dependent Marketing data mart. Also, additional data marts are continuing to appear (CEO data mart).

Figure 8: Phase Three DWE Architecture

Conclusion

It is important to understand that the process for migrating from this architecture is a costly proposition that will only get more expensive and difficult as time goes on. Remember, as with any disease the earlier it is detected and treatment begins the sooner the patient will become healthy. However, if treatment is delayed the patient’s condition will worsen and eventually become terminal.

David Marco, PhD

David Marco, PhD is President of EWSolutions and Executive Managing Director of the Global Data Practice. He advises CDOs, CIOs, and executive leadership teams on AI and data governance, decision accountability, and trust in complex, high-stakes environments. David works with organizations to design governance systems that hold under real operational pressure and enable AI outcomes executives can trust.