After several business initiatives failed because of unreliable data, the retailer made data quality a top priority. As a data-driven organization, the company wanted to ensure accurate data was used for improving its business processes. Optimizing sales, logistics, and merchandising required reliable product, geolocation, and supplier data. At the same time, online customers were demanding more granular and complete product information to make purchasing decisions. Together, these factors led the company to look for a data quality solution.
When selecting a data quality vendor and tool, the company looked for the following:
The retailer took full advantage of the flexibility of the Ataccama ONE platform, starting with ETL and DQ processing use cases on traditional data sources. The DQ team then built a data lake and reused existing configurations to scale their solution with big data processing capabilities leveraging Spark. Later, the team built a Kafka-based pilot to stream real-time data from cash desks in brick-and-mortar shops, and use this data for immediate decision making.
Business development teams and management can reliably use company master data for driving business decisions.
DQ engineers can clearly visualize and quantify data quality issues.
A firmly established data quality culture resulting in good data used for decision making across the company.
The team of 23 data quality engineers perform data quality checks on their data lake for various data-critical projects, such as launching a new product or creating a new report in the data warehouse. In terms of technical implementation, the data is processed with Spark jobs on a Hadoop cluster or a standalone server, depending on data volumes. The results are visualized in Tableau or exported to Excel for further analysis.
Ataccama ONE works as an ETL tool for proving data to the company’s ERP system, with a data volume of 50 GB per day in this particular case. Additionally, DQ engineers use Ataccama ONE for fast ETL prototyping.
The company also leveraged Ataccama ONE as an address validation solution with two use cases:
To build the solution, the client team used an open source, countrywide address lookup, then constructed several data pipelines around it and published them as web services.
RSA implemented Ataccama’s Reference Data Management solution on AWS to centralize data management, avoid data duplication, and propagate a single version of reference data to consuming systems.
The charity implemented Ataccama ONE MDM to master and consolidate enterprise-wide data into a platform that could be shared across the organization as a single version of the truth.