11
Next Generation Data Validation Solution Infosys Data Testing Workbench (IDTW)

Infosys Data Testing Workbench (IDTW)

  • Upload
    others

  • View
    15

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Infosys Data Testing Workbench (IDTW)

Next Generation Data Validation Solution

Infosys Data Testing

Workbench (IDTW)

Page 2: Infosys Data Testing Workbench (IDTW)

Data validation services - Infosys Point of View

The demand now is for interactive analytics with multi-channel support and scalability . There is huge scope for improvement and enhancements in this space by offering a simple end-to-end solution for analytics testing.

3

ETL Jobs

Data Migration

File/RDBMS to RDBMS/file

Data Comparison

Table Analysis

Report Testing

Test Management

Multi-source, omni-channels

Lambda-Arch, Data Lakes, Cloud DBs, IoT

Data Pipelines

Trad

itio

nal

Eme

rgin

g Sc

op

e

Reporting - Counts, Max, Min, Avg, Data Pre-processing SQL Queries

Summary

Data Stores Data Processing

Unstructured

Validation

ALM, DevOps, CI/CD

Pipeline Validation

Data Lakes Health

AI-based Detection

Insights

Schema Rigid

RDBMS

Structured

4Vs Agile/ DevOpsDrivers InteractiveData science Open StackEmerging Tech

AI/ML on data

Open Standards

Collaboration

Schema-less

Image/Video/Social Media

Huge Volume – TBs, ZBsInsights anywhere

Interactive Visualization

Real-time Analysis, Test

Variety, Volume, Veracity, Velocity

Page 3: Infosys Data Testing Workbench (IDTW)

ValidationMulti-source, Omni-

channels

Real-time

analysis, testPipeline Validation

AI / ML on dataInteractive

VisualizationData Lakes Health

Open StandardsEnd-to-end

validation Insights anywhere

Schema Rigid

RDBMS

Structured

ETL Jobs

Data Migration

File/RDBMS to RDBMS/file

Data Comparison

Table Analysis

Report Testing

Test Management

Data stores Data processing

Data Testing in today’s context

10 years ago Today

CSV

XML</>

JSON

With the rapid rise in data stores, streaming apps, Cloud & IoT – the expectation is to automate right from event to access time while having the capability to support data both in motion and at rest. Enterprises in today’s setting need to be enabled with simple, scalable end-to-end validation of streaming systems and multi-channel scenarios with seamless integration with CI/CD QA tools.

Structured Un-structured

4

Page 4: Infosys Data Testing Workbench (IDTW)

Infosys Confidential© 2020 Infosys Ltd.

• Supports data in rest, data in motion• Multi-protocol Support• Data migration Validation – On-prem to

Cloud DBS, AWS, Azure, etc.• Data Quality for all heterogenous files,

RDBMS, Big Data, Cloud Data Stores

End-to-end Automated Validation

Enterprise Features

QA Workflow, Test Governance, Management,

Audit-trail, Security, Reporting, Batch Scheduling Micro-services

PlatformCloud Native

• Single node

• Cluster to N

laptops/desktops

ScalableInstall

OAuth2

LDAP/Custom

Security

IDTW – A comprehensive one-stop solution for this

new era of Data Testing

• Test Orchestration Pipeline – catering to streaming, multi-interface data validation with visual monitoring

-Pre-built automated QA templates• SQL for whole no-SQL world• Interactive Notebook-based Visualizations

5

Ci/CD/QA

SQL

X

Total

XTotal

Reporting/Analysis

Page 5: Infosys Data Testing Workbench (IDTW)

IDTW has helped its users realize benefits on multiple aspects across verticals

10

Effort Savings Reduce effort savings of 40-60% on test execution due to automation

50% savings in over all testing effort

Reduced time to market up to 25%

100% coverage and complete traceability

Vertical Challenge Benefit

Banking

Client has Global Data Warehouse (GDW) & a 3 Terabyte Data Warehouse that connects to 120+ source systems & 100+ downstream systems

» Cycle Time Reduction by 20-22%» 90% reduction in Data Validation

Banking

Client Group integrated data warehouses of 4 banks where the technology landscape was highly complex

» COQ reduced to 15.9% » ZD - 588K AUD Approved

Insurance

Client enhanced their complex agency compensation repository for new metrics calculation

» Cycle Time Reduction by 15%» 70% Effort Savings

Telecom

Project involved testing across various data sources with huge volumes of data » 100% Test Data Coverage

Retail

Client planned to optimize and migrate their BW landscape to HANA Landscape within 6 months

» Cycle Time Reduction by 40%» 60% Effort Savings

Page 6: Infosys Data Testing Workbench (IDTW)

IDTW architecture and deployment in the eco-system

User

In-Mem / Postgres / MySQL

http

jdbc

AWS/Azure/OS

jdbc

DevOps / Test Management

Load balancer

browsers

IDTW App

OA

uth

2

HTTP

HTTP

Data services

RDBMS Files No SQL

Cloud DB s

HTTP

Data sources / SUT

Page 7: Infosys Data Testing Workbench (IDTW)

JSON

XML</>

Txt

CSV

T1

T2

T..N

Store/Access

X Y

HDFS

Analytics

IDTW

CI/CD,ALM stack

• Mismatch of data• Anomalies• Create defect

Consume data from Kafka

• Validate the incoming CSV• Check blanks, card no format• Non-standard Values

Event time Access time

Validate the data from file/db/storage Bnc check, total amount

Data sources read by Flink, for e.g. csv

Put in Kafka

Test pipelines – streaming scenarios

8

Page 8: Infosys Data Testing Workbench (IDTW)

Data Comparison Module

6

Data Comparison between any source Vs. any target

Field Level Comparison

Detailed Anomaly Reporting for easy defect fixing

Execution &

Report Generation

Data

Comparison

Testcase

Details

Source Query

Mapping

SRC & TGT Columns

Target Query

Page 9: Infosys Data Testing Workbench (IDTW)

Data Quality Checks on Big Data

7

• Hierarchical

• File-based

• Semi Structured

Data Source

• Data Integrity Checks

• Data Level Validation

Analysis• Business Rules

• Data Integrity Constraints

• Outliers

Rules/ Constraints

• Configurable Reports

• Detailed Anomalies

Report

Metadata

• Constraints (PK, FK)• column Data Type• Nullability

Statistical

• Column Statistics• Outliers Check• Data Duplication Check

Relationship

• Foreign Key Verification

• Cardinality Verification

Pattern

• Standardization Check • Precision Constraints• Format Verification

Custom Rules

• Domain-specific Business Rules

Page 10: Infosys Data Testing Workbench (IDTW)

Benefits

9

Multi Data

Sources Support

End-to-end

Validation

Emerging Tech

Integration

Wider

Coverage

Support for various data stores – files, DBs, cloud stores, Kafka, streaming, Twitter, AWS, etc.

For a leading Financial Major, IDTW acted as a single automation platform for end-to-end

data validation, all the way from on-premise legacy systems to various AWS cloud data

sources and validated huge amounts of data. 100% automation of E2E and Regression

Test Suites was also achieved.

IDTW enabled Snowflake-based migration on Azure for a Canadian retail giant.

Direct connectivity for automated data validation in legacy DBs, AWS Redshift

and Amazon S3 was also established.

A leading Coffee Major required a single automation tool supporting disparate data

sources. IDTW did this and ensured 100% data validation of all records with high

quality and reliability of migrated data. In addition, it also identified potential bugs

due to exhaustive test coverage.

Effort Savings Reduced effort savings of 40-60% on test execution due to automation

Delivers 50% savings in overall testing effort

Reduced time-to-market up to 25%

100% Coverage and complete Traceability

Page 11: Infosys Data Testing Workbench (IDTW)

© 2019 Infosys Limited, Bengaluru, India. All Rights Reserved. Infosys believes the information in this document is accurate as of its publication date; such information is subject to change without notice.

Infosys acknowledges the proprietary rights of other companies to the trademarks, product names and such other intellectual property rights mentioned in this document. Except as expressly permitted,

neither this documentation nor any part of it may be reproduced, stored in a retrieval system, or transmitted in any form or by any means, electronic, mechanical, printing, photocopying, recording or

otherwise, without the prior permission of Infosys Limited and/ or any named intellectual property rights holders under this document.

THANK YOU