Welcome to Data Innovations
Delivering successful data warehousing, master data management, data architecture and software integration projects since 2002. These projects have included the following services:
Data Innovations is an official service provider and software reseller for CA, Inc. We offer solutions to integrate CA software into our client's environments.
Explore our website to learn more about these products, services and solutions.
Data Modeling
Data modeling is a key component of any IT development project. Data modeling identifies and captures data requirements within an entity relationship diagram. Once the logical model is analyzed and normalized, a physical model is derived and used to establish and maintain the new database.
The entire data modeling process is simplified through technology. The CA ERwin Data Modeler software automates this entire process. The software allows the data modeler to create logical models through a graphical interface. Physical models are derived from the logical models in the RDBMS of choice. The software helps to facilitate communication between the data modeler and the database administrator. Utilizing the CA Model Manager software allows the logical and physical models to be shared amongst project members. The software also facilitates change management through user and library security.
Enterprise Resource Planning
Many organizations have invested heavily in the acquisition and implementation of Enterprise Resource Planning (ERP) packages to streamline specific business functions such as human resource, supply chain, financial, customer, warehouse, and decision support management. The ERP package itself often replaces two or more homegrown applications. The intention is to leverage the cross-functional capability of the package to replace disparate systems and consolidate the data into a single data model.
However, the data models that support these packages contain thousands and thousands of tables and attributes. Understanding where specific data is stored in these packages can be difficult at best. In addition, the names of the tables and attributes are proprietary and often cryptic. Even worse, the relationships between tables and attributes are enforced by the application, not the data structure itself. This creates serious challenges for performing data modeling and data sourcing activities against these packages.
Data Innovations is now providing solutions to support the reverse engineering and profiling of your ERP software. Our solution utilizes the CA ERwin® Saphir Option to transform ERP metadata into CA ERwin® Data Modeler data models. These models are leveraged to integrate the ERP into your IT environment. The models are ideal for identifying data to be profiled and sourced from your ERP into external business intelligence solutions, such as data warehouses or master data management solutions.
Data Profiling
Data profiling is the analysis of the data itself to infer metadata. The inferred metadata is useful for many different purposes such as data modeling, data quality, data sourcing, enterprise resource planning, data warehousing, master data management, metadata repositories, application development, and business intelligence. Data profiling software is powerful technology when properly deployed and utilized.
How do organizations benefit from data profiling?
Implementing data profiling eliminates the code-load-explode development methodology for data warehousing or master data management projects that occurs when ETL specifications are created based mainly on the institutional knowledge in the head of the business subject matter expert (SME) ,. The SME based specifications are utilized to develop the ETL code. Unit testing the code will often cause the code to explode due to unexpected problems with the data. The problems are identified and sent back through the development process to the SME to analyze and correct the ETL specifications.
These types of data problems are usually located incrementally, causing the process to repeat itself several times. Finally, when the code passes unit testing, it is then moved on to the next level of testing, system testing. The code-load-explode process then repeats itself again because more data is utilized during system testing. Problems uncovered during system level testing have to go all the way back to the beginning of the development process. This process is repeated throughout the different levels of testing and often leads to project overrun. The cost to the organization is easily calculated by tracking the development hours for reworking the code.
Even scarier is that the code-load-explode approach can allow bad data to make it into production targets. The cost of correcting data problems in production is expensive, but the cost to the organization may be even more expensive if erroneous business decisions are being made because of the data.
Can your business afford to have erroneous data in business intelligence applications or transaction systems?
Data profiling eliminates the code-load-explode method because the profiling software allows the SME to review all of the data content and the inferred metadata to get the specifications right the first time. The profiling results provide the detailed information necessary to create accurate ETL specifications. However, this is only one of the many ways that data profiling software can be leveraged by the organization.
Data Innovations is the leading data profiling service provider delivering successful data profiling projects for clients since 2002.
DI consultants have a proven track record for establishing the profiling environment, developing bulletproof data analysis methodologies, and mentoring clients to successful utilization of the technology. For more information on how DI can help your organization leverage data profiling contact us at solutions@dataprofilers.com.
Ensuring Business Continuity for Virtualization
Virtualization is a growing trend in the IT industry. Organizations are leveraging virtualization in distributed environments to reduce their overall physical server footprint and maximize the utilization of each physical server. Reducing the number of physical servers decreases the carbon emissions and power consumption by the organization. This is the right thing to do for the environment and can reduce the organization's power and support costs.
However, introducing virtualization increases the complexity of ensuring critical applications, such as company email or front-end applications are always available. Having data recoverability with robust and proven solutions is also essential to all virtual solutions.
CA, Inc. (CA) provides software to address business continuity for virtualization. The following solution identifies two CA software products that address high availability and disaster recovery for distributed virtual environments.
| Phone: | 1-888-GET-ER1S (438-3717) |
| Email: | sales@dataprofilers.com |
| solutions@dataprofilers.com |
Data Innovations, CA and Tech Data Present: Accessing ERP Data through Data Modeling - webcast
Exeros Signs Data Innovations as a New Reseller and Integration Partner - cnbc.com
Data Innovations and CA Present:
A Go Green Solutions Seminar - Ensuring Business Continuity for Virtualization - webcast 
Data Innovations at CAWorld - caworld.com