57
Informatica Data Quality Analyst (Version 9.1.0) User Guide

Informatica Data Quality Analyst 9.1.0 User Guide (English) · 1 Quality Analyst 1

  • Upload
    others

  • View
    12

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Informatica Data Quality Analyst 9.1.0 User Guide (English) · 1 Quality Analyst 1

Informatica Data Quality Analyst (Version 9.1.0)

User Guide

Page 2: Informatica Data Quality Analyst 9.1.0 User Guide (English) · 1 Quality Analyst 1

Informatica Data Quality Analyst User Guide

Version 9.1.0March 2011

Copyright (c) 1998-2011 Informatica. All rights reserved.

This software and documentation contain proprietary information of Informatica Corporation and are provided under a license agreement containing restrictions on use anddisclosure and are also protected by copyright law. Reverse engineering of the software is prohibited. No part of this document may be reproduced or transmitted in any form,by any means (electronic, photocopying, recording or otherwise) without prior consent of Informatica Corporation. This Software may be protected by U.S. and/or internationalPatents and other Patents Pending.

Use, duplication, or disclosure of the Software by the U.S. Government is subject to the restrictions set forth in the applicable software license agreement and as provided inDFARS 227.7202-1(a) and 227.7702-3(a) (1995), DFARS 252.227-7013 © (1)(ii) (OCT 1988), FAR 12.212(a) (1995), FAR 52.227-19, or FAR 52.227-14 (ALT III), asapplicable.

The information in this product or documentation is subject to change without notice. If you find any problems in this product or documentation, please report them to us inwriting.

Informatica, Informatica Platform, Informatica Data Services, PowerCenter, PowerCenterRT, PowerCenter Connect, PowerCenter Data Analyzer, PowerExchange,PowerMart, Metadata Manager, Informatica Data Quality, Informatica Data Explorer, Informatica B2B Data Transformation, Informatica B2B Data Exchange Informatica OnDemand, Informatica Identity Resolution, Informatica Application Information Lifecycle Management, Informatica Complex Event Processing, Ultra Messaging and InformaticaMaster Data Management are trademarks or registered trademarks of Informatica Corporation in the United States and in jurisdictions throughout the world. All other companyand product names may be trade names or trademarks of their respective owners.

Portions of this software and/or documentation are subject to copyright held by third parties, including without limitation: Copyright DataDirect Technologies. All rightsreserved. Copyright © Sun Microsystems. All rights reserved. Copyright © RSA Security Inc. All Rights Reserved. Copyright © Ordinal Technology Corp. All rightsreserved.Copyright © Aandacht c.v. All rights reserved. Copyright Genivia, Inc. All rights reserved. Copyright Isomorphic Software. All rights reserved. Copyright © MetaIntegration Technology, Inc. All rights reserved. Copyright © Intalio. All rights reserved. Copyright © Oracle. All rights reserved. Copyright © Adobe Systems Incorporated. Allrights reserved. Copyright © DataArt, Inc. All rights reserved. Copyright © ComponentSource. All rights reserved. Copyright © Microsoft Corporation. All rights reserved.Copyright © Rogue Wave Software, Inc. All rights reserved. Copyright © Teradata Corporation. All rights reserved. Copyright © Yahoo! Inc. All rights reserved. Copyright ©Glyph & Cog, LLC. All rights reserved. Copyright © Thinkmap, Inc. All rights reserved. Copyright © Clearpace Software Limited. All rights reserved. Copyright © InformationBuilders, Inc. All rights reserved. Copyright © OSS Nokalva, Inc. All rights reserved. Copyright Edifecs, Inc. All rights reserved. Copyright Cleo Communications, Inc. All rightsreserved. Copyright © International Organization for Standardization 1986. All rights reserved. Copyright © ej-technologies GmbH . All rights reserved. Copyright © JaspersoftCorporation. All rights reserved.

This product includes software developed by the Apache Software Foundation (http://www.apache.org/), and other software which is licensed under the Apache License,Version 2.0 (the "License"). You may obtain a copy of the License at http://www.apache.org/licenses/LICENSE-2.0. Unless required by applicable law or agreed to in writing,software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See theLicense for the specific language governing permissions and limitations under the License.

This product includes software which was developed by Mozilla (http://www.mozilla.org/), software copyright The JBoss Group, LLC, all rights reserved; software copyright ©1999-2006 by Bruno Lowagie and Paulo Soares and other software which is licensed under the GNU Lesser General Public License Agreement, which may be found at http://www.gnu.org/licenses/lgpl.html. The materials are provided free of charge by Informatica, "as-is", without warranty of any kind, either express or implied, including but notlimited to the implied warranties of merchantability and fitness for a particular purpose.

The product includes ACE(TM) and TAO(TM) software copyrighted by Douglas C. Schmidt and his research group at Washington University, University of California, Irvine,and Vanderbilt University, Copyright © 1993-2006, all rights reserved.

This product includes software developed by the OpenSSL Project for use in the OpenSSL Toolkit (copyright The OpenSSL Project. All Rights Reserved) and redistribution ofthis software is subject to terms available at http://www.openssl.org and http://www.openssl.org/source/license.html.

This product includes Curl software which is Copyright 1996-2007, Daniel Stenberg, <[email protected]>. All Rights Reserved. Permissions and limitations regarding thissoftware are subject to terms available at http://curl.haxx.se/docs/copyright.html. Permission to use, copy, modify, and distribute this software for any purpose with or withoutfee is hereby granted, provided that the above copyright notice and this permission notice appear in all copies.

The product includes software copyright 2001-2005 (©) MetaStuff, Ltd. All Rights Reserved. Permissions and limitations regarding this software are subject to terms availableat http://www.dom4j.org/ license.html.

The product includes software copyright © 2004-2007, The Dojo Foundation. All Rights Reserved. Permissions and limitations regarding this software are subject to termsavailable at http://dojotoolkit.org/license.

This product includes ICU software which is copyright International Business Machines Corporation and others. All rights reserved. Permissions and limitations regarding thissoftware are subject to terms available at http://source.icu-project.org/repos/icu/icu/trunk/license.html.

This product includes software copyright © 1996-2006 Per Bothner. All rights reserved. Your right to use such materials is set forth in the license which may be found at http://www.gnu.org/software/ kawa/Software-License.html.

This product includes OSSP UUID software which is Copyright © 2002 Ralf S. Engelschall, Copyright © 2002 The OSSP Project Copyright © 2002 Cable & WirelessDeutschland. Permissions and limitations regarding this software are subject to terms available at http://www.opensource.org/licenses/mit-license.php.

This product includes software developed by Boost (http://www.boost.org/) or under the Boost software license. Permissions and limitations regarding this software are subjectto terms available at http:/ /www.boost.org/LICENSE_1_0.txt.

This product includes software copyright © 1997-2007 University of Cambridge. Permissions and limitations regarding this software are subject to terms available at http://www.pcre.org/license.txt.

This product includes software copyright © 2007 The Eclipse Foundation. All Rights Reserved. Permissions and limitations regarding this software are subject to termsavailable at http://www.eclipse.org/org/documents/epl-v10.php.

This product includes software licensed under the terms at http://www.tcl.tk/software/tcltk/license.html, http://www.bosrup.com/web/overlib/?License, http://www.stlport.org/doc/license.html, http://www.asm.ow2.org/license.html, http://www.cryptix.org/LICENSE.TXT, http://hsqldb.org/web/hsqlLicense.html, http://httpunit.sourceforge.net/doc/license.html, http://jung.sourceforge.net/license.txt , http://www.gzip.org/zlib/zlib_license.html, http://www.openldap.org/software/release/license.html, http://www.libssh2.org,http://slf4j.org/license.html, http://www.sente.ch/software/OpenSourceLicense.html, http://fusesource.com/downloads/license-agreements/fuse-message-broker-v-5-3-license-agreement; http://antlr.org/license.html; http://aopalliance.sourceforge.net/; http://www.bouncycastle.org/licence.html; http://www.jgraph.com/jgraphdownload.html ; http://www.jcraft.com/jsch/LICENSE.txt. http://jotm.objectweb.org/bsd_license.html; http://www.w3.org/Consortium/Legal/2002/copyright-software-20021231; http://www.slf4j.org/license.html; http://developer.apple.com/library/mac/#samplecode/HelpHook/Listings/HelpHook_java.html; http://www.jcraft.com/jsch/LICENSE.txt; http://nanoxml.sourceforge.net/orig/copyright.html; http://www.json.org/license.html; http://forge.ow2.org/projects/javaservice/; http://www.postgresql.org/about/license.html; http://www.sqlite.org/copyright.html; http://www.tcl.tk/software/tcltk/license.html; http://www.jaxen.org/faq.html; http://www.jdom.org/docs/faq.html; and http://www.slf4j.org/license.html.

Page 3: Informatica Data Quality Analyst 9.1.0 User Guide (English) · 1 Quality Analyst 1

This product includes software licensed under the Academic Free License (http://www.opensource.org/licenses/afl-3.0.php), the Common Development and DistributionLicense (http://www.opensource.org/licenses/cddl1.php ) the Common Public License (http://www.opensource.org/licenses/cpl1.0.php ), the Sun Binary Code LicenseAgreement Supplemental License Terms, the BSD License (http://www.opensource.org/licenses/bsd-license.php), the MIT License (http://www.opensource.org/licenses/mit-license.php) and the Artistic License (http://www.opensource.org/licenses/artistic-license-1.0).

This product includes software copyright © 2003-2006 Joe WaInes, 2006-2007 XStream Committers. All rights reserved. Permissions and limitations regarding this softwareare subject to terms available at http://xstream.codehaus.org/license.html. This product includes software developed by the Indiana University Extreme! Lab. For furtherinformation please visit http://www.extreme.indiana.edu/.

This product contains runtime modules of IBM DB2 Driver for JDBC and SQLJ (c) Copyright IBM Corporation 2006 All rights reserved.

This Software is protected by U.S. Patent Numbers 5,794,246; 6,014,670; 6,016,501; 6,029,178; 6,032,158; 6,035,307; 6,044,374; 6,092,086; 6,208,990; 6,339,775;6,640,226; 6,789,096; 6,820,077; 6,823,373; 6,850,947; 6,895,471; 7,117,215; 7,162,643; 7,254,590; 7,281,001; 7,421,458; 7,496,588; 7,523,121; 7,584,422, 7,720,842;7,721,270; and 7,774,791 , international Patents and other Patents Pending.

DISCLAIMER: Informatica Corporation provides this documentation "as is" without warranty of any kind, either express or implied, including, but not limited to, the impliedwarranties of noninfringement, merchantability, or use for a particular purpose. Informatica Corporation does not warrant that this software or documentation is error free. Theinformation provided in this software or documentation may include technical inaccuracies or typographical errors. The information in this software and documentation issubject to change at any time without notice.

NOTICES

This Informatica product (the "Software") includes certain drivers (the "DataDirect Drivers") from DataDirect Technologies, an operating company of Progress SoftwareCorporation ("DataDirect") which are subject to the following terms and conditions:

1.THE DATADIRECT DRIVERS ARE PROVIDED "AS IS" WITHOUT WARRANTY OF ANY KIND, EITHER EXPRESSED OR IMPLIED, INCLUDING BUT NOTLIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NON-INFRINGEMENT.

2. IN NO EVENT WILL DATADIRECT OR ITS THIRD PARTY SUPPLIERS BE LIABLE TO THE END-USER CUSTOMER FOR ANY DIRECT, INDIRECT,INCIDENTAL, SPECIAL, CONSEQUENTIAL OR OTHER DAMAGES ARISING OUT OF THE USE OF THE ODBC DRIVERS, WHETHER OR NOT INFORMED OFTHE POSSIBILITIES OF DAMAGES IN ADVANCE. THESE LIMITATIONS APPLY TO ALL CAUSES OF ACTION, INCLUDING, WITHOUT LIMITATION, BREACHOF CONTRACT, BREACH OF WARRANTY, NEGLIGENCE, STRICT LIABILITY, MISREPRESENTATION AND OTHER TORTS.

Part Number: DQA-USG-91000-0002

Page 4: Informatica Data Quality Analyst 9.1.0 User Guide (English) · 1 Quality Analyst 1

Table of Contents

Preface . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ivInformatica Resources. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . iv

Informatica Customer Portal. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . iv

Informatica Documentation. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . iv

Informatica Web Site. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . iv

Informatica How-To Library. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . iv

Informatica Knowledge Base. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . v

Informatica Multimedia Knowledge Base. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . v

Informatica Global Customer Support. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . v

Chapter 1: Introduction to Informatica Data Quality Analyst. . . . . . . . . . . . . . . . . . . . . . . . . 1Informatica Data Quality Analyst Overview. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1

Informatica Analyst. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1

Informatica Analyst Navigator. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2

Informatica Analyst Views. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2

Contents View. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3

Properties View. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3

Security View. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3

Logging In to Informatica Analyst. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4

Chapter 2: Projects. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5Projects Overview. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5

Creating a Project. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6

Duplicating a Project. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6

Renaming a Project. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7

Deleting a Project. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7

Rules and Guidelines for Projects. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7

Folders. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7

Creating a Folder. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8

Renaming a Folder. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8

Duplicating a Folder. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8

Moving a Folder. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8

Deleting a Folder. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9

Viewing a Project or Folder. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9

Objects. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9

Object Properties. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10

Viewing an Object. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11

Duplicating an Object. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12

Table of Contents i

Page 5: Informatica Data Quality Analyst 9.1.0 User Guide (English) · 1 Quality Analyst 1

Renaming an Object. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12

Moving an Object. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12

Deleting an Object. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12

Metadata Bookmarks. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13

Creating a Metadata Bookmark. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13

Opening a Metadata Bookmark. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13

Tags. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14

Creating and Assigning a Tag. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14

Viewing Tags. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14

Search. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15

Search Syntax. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15

Search Filters. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16

Search Results. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17

Performing a Search. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17

Importing Metadata Manager Tables. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17

Searching Objects Example. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18

Security. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18

Assigning Permissions on a Project. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18

Rules and Guidelines for Security. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19

Job Status. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19

Monitoring Job Status. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20

Metadata Manager Business Terms. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21

Managing Business Terms. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21

Chapter 3: Data Objects. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22Data Objects Overview. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22

Flat Files. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23

Flat File Options. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23

Flat File Datatypes. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24

Datetime Datatypes. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24

Adding a Flat File. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26

Rules and Guidelines for Flat Files. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26

Tables. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27

Database Connection Properties. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27

Deleting a Database Connection. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 28

Adding a Table. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 28

Rules and Guidelines for Tables. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29

Viewing Data Objects. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29

Editing Data Objects. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30

Chapter 4: Exception Record Management. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31Exception Record Management Overview. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31

Exception Management Process Flow. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31

ii Table of Contents

Page 6: Informatica Data Quality Analyst 9.1.0 User Guide (English) · 1 Quality Analyst 1

Reserved Column Names . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32

Exception Management Tasks. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 33

Importing a Database for Exception Management. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 33

Viewing and Editing Bad Records. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 33

Updating Bad Record Status. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 34

Viewing and Filtering Duplicate Record Clusters . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 34

Editing Duplicate Record Clusters. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 34

Consolidating Duplicate Record Clusters. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 35

Viewing the Audit Trail. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 35

Chapter 5: Reference Tables. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37Reference Tables. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37

Reference Table Properties. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 38

Create Reference Tables. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 39

Creating a Reference Table Manually. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 39

Creating a Reference Table from Profile Columns. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 39

Creating a Reference Table from Column Values. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 40

Creating a Reference Table from Column Patterns. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 41

Importing a Reference Table. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 42

Reference Table Management. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 42

Managing Columns. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 42

Managing Rows. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43

Finding and Replacing Values. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43

Exporting a Reference Table. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 44

Audit Trail Events. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 44

Viewing Audit Trail Events. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 45

Rules and Guidelines for Reference Tables. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 45

Index. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 46

Table of Contents iii

Page 7: Informatica Data Quality Analyst 9.1.0 User Guide (English) · 1 Quality Analyst 1

PrefaceThe Informatica Data Quality Analyst User Guide is written for data quality analysts. It describes how to useInformatica Analyst. This guide assumes that you have an understanding of data quality concepts, flat file andrelational database concepts, and the database engines in your environment.

Informatica Resources

Informatica Customer PortalAs an Informatica customer, you can access the Informatica Customer Portal site at http://mysupport.informatica.com. The site contains product information, user group information, newsletters,access to the Informatica customer support case management system (ATLAS), the Informatica How-To Library,the Informatica Knowledge Base, the Informatica Multimedia Knowledge Base, Informatica ProductDocumentation, and access to the Informatica user community.

Informatica DocumentationThe Informatica Documentation team takes every effort to create accurate, usable documentation. If you havequestions, comments, or ideas about this documentation, contact the Informatica Documentation team throughemail at [email protected]. We will use your feedback to improve our documentation. Let usknow if we can contact you regarding your comments.

The Documentation team updates documentation as needed. To get the latest documentation for your product,navigate to Product Documentation from http://mysupport.informatica.com.

Informatica Web SiteYou can access the Informatica corporate web site at http://www.informatica.com. The site contains informationabout Informatica, its background, upcoming events, and sales offices. You will also find product and partnerinformation. The services area of the site includes important information about technical support, training andeducation, and implementation services.

Informatica How-To LibraryAs an Informatica customer, you can access the Informatica How-To Library at http://mysupport.informatica.com.The How-To Library is a collection of resources to help you learn more about Informatica products and features. Itincludes articles and interactive demonstrations that provide solutions to common problems, compare features andbehaviors, and guide you through performing specific real-world tasks.

iv

Page 8: Informatica Data Quality Analyst 9.1.0 User Guide (English) · 1 Quality Analyst 1

Informatica Knowledge BaseAs an Informatica customer, you can access the Informatica Knowledge Base at http://mysupport.informatica.com.Use the Knowledge Base to search for documented solutions to known technical issues about Informaticaproducts. You can also find answers to frequently asked questions, technical white papers, and technical tips. Ifyou have questions, comments, or ideas about the Knowledge Base, contact the Informatica Knowledge Baseteam through email at [email protected].

Informatica Multimedia Knowledge BaseAs an Informatica customer, you can access the Informatica Multimedia Knowledge Base at http://mysupport.informatica.com. The Multimedia Knowledge Base is a collection of instructional multimedia filesthat help you learn about common concepts and guide you through performing specific tasks. If you havequestions, comments, or ideas about the Multimedia Knowledge Base, contact the Informatica Knowledge Baseteam through email at [email protected].

Informatica Global Customer SupportYou can contact a Customer Support Center by telephone or through the Online Support. Online Support requiresa user name and password. You can request a user name and password at http://mysupport.informatica.com.

Use the following telephone numbers to contact Informatica Global Customer Support:

North America / South America Europe / Middle East / Africa Asia / Australia

Toll FreeBrazil: 0800 891 0202Mexico: 001 888 209 8853North America: +1 877 463 2435

Toll FreeFrance: 0805 804632Germany: 0800 5891281Italy: 800 915 985Netherlands: 0800 2300001Portugal: 800 208 360Spain: 900 813 166Switzerland: 0800 463 200United Kingdom: 0800 023 4632 Standard RateBelgium: +31 30 6022 797France: +33 1 4138 9226Germany: +49 1805 702 702Netherlands: +31 306 022 797United Kingdom: +44 1628 511445

Toll FreeAustralia: 1 800 151 830New Zealand: 09 9 128 901 Standard RateIndia: +91 80 4112 5738

Preface v

Page 9: Informatica Data Quality Analyst 9.1.0 User Guide (English) · 1 Quality Analyst 1

vi

Page 10: Informatica Data Quality Analyst 9.1.0 User Guide (English) · 1 Quality Analyst 1

C H A P T E R 1

Introduction to Informatica DataQuality Analyst

This chapter includes the following topics:

¨ Informatica Data Quality Analyst Overview, 1

¨ Informatica Analyst, 1

¨ Logging In to Informatica Analyst, 4

Informatica Data Quality Analyst OverviewInformatica Analyst (the Analyst tool) is a web-based application client that analysts can use to perform dataquality tasks in an enterprise. Use the Analyst tool to collaborate with data quality developers on data qualitysolutions.

The Analyst Service manages the Analyst tool. The Analyst tool uses projects to store folders and objects. TheAnalyst tool stores projects, folders, and objects in the Model repository. The Analyst tool connects to the Modelrepository database to create, update, and delete projects and objects in the Analyst tool.

You can import data objects such as tables and flat files into projects and folders. The Analyst Service managesthe connection to the directory that stores uploaded flat files that you use as flat file sources in the Analyst tool.The Analyst Service also manages the connection to a database that stores reference tables that you create orimport in the Analyst tool.

You can use the data objects to create mapping specifications to define business logic that transforms and movesdata from a source to a target.

Informatica AnalystThe Analyst tool has a web-based interface that you can use to perform data integration and data quality tasks.

The Analyst tool interface has tabs, headers, views, and a Navigator. Use the Navigator to browse projects andperform tasks on projects and folders.

When you log in to the Analyst tool, the Browse: Projects tab appears. The tab displays views and the Navigator.The tab also displays the icons and the Actions menus that you can use to perform tasks in the Navigator and inviews.

1

Page 11: Informatica Data Quality Analyst 9.1.0 User Guide (English) · 1 Quality Analyst 1

When you click the link for an object in a view, the object opens in a tab. After you perform a search, the Analysttool displays the search results in a tab. You can keep multiple tabs open in the Analyst tool interface. You cannavigate between tabs.

The Analyst tool has the following header items:

¨ Log Off. Log out of the Analyst tool.

¨ Manage. Set user preferences to open metadata bookmarks in the Analyst tool or Developer tool and to deletedatabase connections. Monitor the status of Analyst tool jobs for objects such as profiles, scorecards,reference tables, and mapping specifications. Manage Metadata Manager business terms.

¨ Help. Access help for the current tab.

¨ Search. Search for folders in projects. Search for objects in the Model repository and the Metadata Managerrepository.

Informatica Analyst NavigatorUse the Navigator to browse projects and folders and their contents. After you log in to the Analyst tool, theNavigator appears in the left pane.

When you select a project in the Navigator, you can select a view to view the project contents, descriptiveinformation about the project, and permissions on the project.

The Navigator displays the following types of objects:

¨ Projects. Highest object in the Navigator hierarchy. It is the top-level container for all projects that you create.

¨ Folders. Child object of a project. Organize domain objects within a project in folders.

Refresh the Navigator to get the latest version of all objects in the Navigator. Multiple users can add projects andfolders that appear in the Navigator.

The Navigator has an Actions menu that you can use to perform tasks on projects and folders. You can also right-click projects and folders to perform the same tasks.

Use the Navigator to perform the following tasks:

¨ Create projects and folders.

¨ Manage projects and folders.

¨ Refresh the projects and folders that appear in the Navigator.

Informatica Analyst ViewsThe Analyst tool has views for the projects and folders that you select in the Navigator. Objects that open in tabsalso have views. Use the Actions menu or right-click objects to perform tasks related to the view. You can alsoclick icons in the view panels to perform the common tasks related to the view.

The Contents View and the Properties View are the views for the top-level Projects container. The Projectscontainer contains the projects that you create in the Navigator.

After you select a project or folder, the Analyst tool interface displays the following views:

¨ Contents view. Displays project or folder contents and properties for selected objects.

¨ Properties view. Displays project or folder properties.

¨ Security view. Displays user permissions on the project.

2 Chapter 1: Introduction to Informatica Data Quality Analyst

Page 12: Informatica Data Quality Analyst 9.1.0 User Guide (English) · 1 Quality Analyst 1

Contents ViewUse the Contents view to view project and folder information. In the Contents view, you can create and add dataobjects and profiles to projects and folders. You can perform project and folder management tasks.

After you select a project or folder in the Navigator, click the Contents view to view project or folder contents.

The Contents view displays project or folder contents in the Contents panel. When you select an object in theContents panel, the Analyst tool displays the object properties in the Properties panel.

You can perform the following tasks in the Contents view:

¨ Open an object.

¨ Duplicate projects, folders, and objects.

¨ Rename projects, folders, and objects.

¨ Move folders and objects.

¨ Delete projects, folders, and objects.

¨ Add a flat file to a project or folder.

¨ Add a relational table to a project or folder.

¨ Create a custom profile.

¨ Create a reference table.

¨ Create bad record or duplicate record tables.

¨ Close all tabs.

Properties ViewUse the Properties view to view descriptive information about the project or folder.

After you select a project or folder in the Navigator, click the Properties view to view the project or folderproperties.

In the Properties view, you can view the project or folder name and description.

Security ViewUse the Security view to view and assign project-level permissions to users.

After you select a project in the Navigator, click the Security view to view user permissions on the project. In theSecurity view, you can assign the read, write, and grant permissions to users. You can also add users and assignpermissions to them.

The Security view displays the following information in the Project-level permissions panel:

¨ User. User name for the user who is assigned permissions on the project.

¨ Security domain. Name of the security domain that the user belongs to. Security domain can be LDAP orNative.

¨ Permission. Permissions assigned to the user. Permissions can include read, write, or grant permission.

Informatica Analyst 3

Page 13: Informatica Data Quality Analyst 9.1.0 User Guide (English) · 1 Quality Analyst 1

Logging In to Informatica AnalystUse the Analyst tool URL to log in to the Analyst tool interface. When you log in to the Analyst tool, you mustspecify the user name, password, and the native domain or the LDAP security domain.

1. Start a Microsoft Internet Explorer or Mozilla Firefox browser.

2. In the Address field, enter the URL for the Analyst tool:http[s]://<host name>:<port number>/AnalystTool

3. On the login page, enter your user name and password.

4. Select Native or the name of a specific security domain.

The Security Domain field appears when the Informatica domain contains an LDAP security domain. If you donot know the security domain that your user account belongs to, contact the Informatica domain administrator.

5. Click Login.

The welcome screen appears.

6. Click Close to exit the welcome screen and access the Analyst tool.

4 Chapter 1: Introduction to Informatica Data Quality Analyst

Page 14: Informatica Data Quality Analyst 9.1.0 User Guide (English) · 1 Quality Analyst 1

C H A P T E R 2

ProjectsThis chapter includes the following topics:

¨ Projects Overview, 5

¨ Folders, 7

¨ Viewing a Project or Folder, 9

¨ Objects, 9

¨ Metadata Bookmarks, 13

¨ Tags, 14

¨ Search, 15

¨ Security, 18

¨ Job Status, 19

¨ Metadata Manager Business Terms, 21

Projects OverviewA project is the top-level container that you use to store folders and objects in the Analyst tool. Use projects toorganize and manage the objects that you want to analyze for data quality.

Create a project based on the structure of the data for which you want to analyze data quality. For example, ananalyst needs to assess data quality on multiple systems structured by region in a country. The analyst createsprojects named East and West to correspond with data for East and West regions. The analyst can import dataobjects such as relational tables and flat files in the East and West projects.

You must create or open a project before you can work in the Analyst tool. Use the Navigator to create a project inthe Analyst tool. When you create a project, the Analyst tool stores the project in the Model repository.

You can share a project to share the project contents and collaborate with other users on the project. When youshare a project in the Analyst tool, the project also appears in the Developer tool.

A project can contain folders and objects. You can organize objects in folders.

5

Page 15: Informatica Data Quality Analyst 9.1.0 User Guide (English) · 1 Quality Analyst 1

The following table describes the tasks you can perform on a project:

Task Description

Manage projects Manage and share project contents. You can create,duplicate, rename, and delete a project. You can view projectcontents.

Manage folders Organize project content in folders. You can create, duplicate,rename, move, and rename folders within projects.

Manage objects You can view object contents, duplicate, rename, move, anddelete objects in a project or in a folder within a project.

Search projects You can search for folders or objects in projects. You canview search results and select an object from the results toview its contents.

Assign permissions You can add users to a project. You can assign the read,write, and grant permissions to users on a project to restrict orprovide access to objects within the project.

Creating a ProjectCreate a project to store data objects and object types in the Analyst tool. You can create folders in projects. Useprojects to manage the folders and objects in the project.

1. In the Navigator, select Projects.

2. Click Actions > New Project.

The New Project window appears.

3. Enter a name for the project and an optional description.

4. Click Unshared if you do not want to share the project or Shared if you want to share the project with otherusers. Default is Unshared.

5. Click OK.

The project appears in the Navigator.

Duplicating a ProjectYou can duplicate project contents in another project that you create. Duplicate a project to use the same projectcontents to perform different project tasks. Duplicating a project does not duplicate the user permissions on theproject. The owner of the project gets all permissions by default on the duplicate project.

1. In the Navigator, select the project that you want to duplicate.

2. Click Actions > Duplicate.

The Duplicate window appears.

3. Enter the project name and an optional description.

4. Click OK.

The Analyst tool duplicates the project contents in the project.

6 Chapter 2: Projects

Page 16: Informatica Data Quality Analyst 9.1.0 User Guide (English) · 1 Quality Analyst 1

Renaming a ProjectYou can rename a project after you create it. Rename a project according to business usage and namingconvention. You may need to rename a project because the name is incorrect or the project has a different use.

1. In the Navigator, select the project that you want to rename.

2. Click Actions > Rename.

The Rename window appears.

3. Enter a name.

4. Click OK.

The Analyst tool renames the project in the Navigator.

Deleting a ProjectDelete a project when the project and its contents become redundant.

1. In the Navigator, select the project that you want to delete.

2. Click Actions > Delete.

3. In the Delete Project window, click Yes.

The Analyst tool deletes the project from the Navigator.

Rules and Guidelines for ProjectsThis section describes the rules and guidelines for working with projects.

Use the following rules and guidelines when you work with projects:

¨ You cannot move a project in the Navigator.

¨ You can move folders within a project but you cannot move a folder into one of its own child folders in a project.

¨ You cannot duplicate a project in another project with the same name.

¨ You cannot duplicate a folder within a project to another folder in a different project.

FoldersUse folders to organize project contents. You can create a folder to group objects for a particular task in a project.You can create a folder in a project or in another folder.

Create folders to group objects based on business needs. For example, a project requires data analysis for datastored in multiple relational databases across an organization. Each region has a relational database. You cancreate folders named East and West to store the project metadata for each region.

Folders appear under projects in the Navigator. A folder can contain other folders and objects.

You can perform the following tasks on a folder:

¨ Create a folder.

¨ View a folder.

¨ Rename a folder.

Folders 7

Page 17: Informatica Data Quality Analyst 9.1.0 User Guide (English) · 1 Quality Analyst 1

¨ Duplicate a folder.

¨ Move a folder.

¨ Delete a folder.

Creating a FolderCreate a folder to store objects created in the Analyst tool.

1. In the Navigator, select the project or folder where you want to create a folder.

2. Click Actions > New Folder.

The New Folder window appears.

3. Enter the folder name and optional description.

4. Click OK.

The Analyst tool creates the folder in the Navigator.

Renaming a FolderYou can rename a folder after you create it. Rename a folder to change its name according to business usage ornaming convention.

1. In the Navigator, select the project and the folder in the project that you want to rename.

2. Click Actions > Rename

The Rename window appears.

3. Enter the folder name.

4. Click OK.

The Analyst tool renames the folder in the Navigator.

Duplicating a FolderYou can duplicate a folder within a project. Duplicate a folder to organize or enhance the contents of a folder or touse the contents of a folder to perform different tasks.

1. In the Navigator, select the project and the folder in the project that you want to duplicate.

2. Click Actions > Duplicate.

The Duplicate window appears.

3. Navigate to the location where you want to duplicate the folder.

Optionally, enter the location.

4. Enter the folder name.

5. Click OK.

The Analyst tool duplicates the folder in the project in the Navigator.

Moving a FolderYou can move a folder within a project. Move folders to organize project content into a hirearchy of folders.

8 Chapter 2: Projects

Page 18: Informatica Data Quality Analyst 9.1.0 User Guide (English) · 1 Quality Analyst 1

1. In the Navigator, select the folder in the project you want to move.

2. Click Actions > Move.

The Move window appears.

3. Navigate to the location in the project where you want to move the folder.

Optionally, enter the location.

4. Click OK.

The Analyst tool moves the folder in the project in the Navigator.

Deleting a FolderDelete a folder if the folder contents become redundant.

1. In the Navigator, select the folder in the project you want to delete.

2. Click Actions > Delete.

The Delete Folder dialog box appears.

3. Click Yes to delete the folder.

The Analyst tool deletes the folder from the Navigator.

Viewing a Project or FolderYou can view objects in a project or folder. For each object, you can view the object name and object type on theContents view. You can view descriptive information such as project name and description about the project orfolder on the Properties view.

1. To view project or folder contents on the Contents view, select a project or folder in the Navigator and viewthe contents in the Contents panel.

The Analyst tool displays a list of all objects in the project or folder and displays the object name and objecttype for each object.

2. To view descriptive information about the project or folder on the Properties view, select a project or folderfrom the Navigator and view descriptive information in the Properties panel.

The Analyst tool displays the project name or folder name and description for the project or the folder.

ObjectsThe types of objects that you use in the Analyst tool depend on the structure of data for which you want to analyzedata quality. You can use data objects to structure the data and create object types to analyze data quality in aproject.

Data objects can include the relational tables and flat files that you import into the Analyst tool. Logical dataobjects created in Data Object Models in the Developer tool appear as logical data objects in projects shared bythe developer in the Analyst tool. These logical data object can appear as tables or flat files.

Viewing a Project or Folder 9

Page 19: Informatica Data Quality Analyst 9.1.0 User Guide (English) · 1 Quality Analyst 1

Object types include objects such as profiles, rules, scorecards, reference tables, bad record tables, and duplicaterecord tables that you create in the Analyst tool.

You can store objects in projects or folders. You can associate objects with tags. You can search for objects byname or by tag.

You can perform the following common tasks on object types and data objects:

¨ View an object.

¨ Duplicate an object.

¨ Rename an object.

¨ Move an object.

¨ Delete an object.

Note: You cannot duplicate, rename, move, or delete a logical data object.

Object PropertiesThe following table describes the objects that you can store in a project and the viewable object properties:

Object Data Object / Object Type Object Properties

Relational Tables Data Object - Name. Name of the table in themodel repository.

- Location. Location of the table inthe project or folder.

- Connection. Name of the databaserelational connection.

- Schema. Name of the databaseschema.

- Table Name. Name of therelational table source.

Flat Files Data Object - Location. Location of the flat file inthe project or folder.

- File Path. File path of the flat fileon a network drive.

- Uploaded. File path of theuploaded flat file.

- File Name. Name of the flat file.

Logical data object Data Object - Name. Name of the table in themodel repository.

- Location. Location of the table inthe project or folder.

- Data Object Model. Name of theData Object Model from which thelogical data object was created.

- Logical Data Object Name. Logicaldata object table name.

Profiles Object Type - Location. Location of the profile inthe project or folder.

- Name. Name of the profile.

Rules Object Type - Location. Location of the rule in theproject or folder.

- Name. Name of the rule.

10 Chapter 2: Projects

Page 20: Informatica Data Quality Analyst 9.1.0 User Guide (English) · 1 Quality Analyst 1

Object Data Object / Object Type Object Properties

Scorecards Object Type - Location. Location of the scorecardin the project or folder.

- Name. Name of the scorecard.

Reference Tables Object Type - Name. Name of the table in themodel repository.

- Location. Location of the table inthe project or folder.

- Connection. Name of the databaserelational connection.

- Schema. Name of the databaseschema.

- Table Name. Name of thereference table.

Bad Record Tables Object Type - Name. Name of the table in themodel repository.

- Location. Location of the table inthe project or folder.

- Connection. Name of the databaserelational connection.

- Schema. Name of the databaseschema.

- Table Name. Name of the badrecord table.

Duplicate Record Tables Object Type - Name. Name of the table in themodel repository.

- Location. Location of the table inthe project or folder.

- Connection. Name of the databaserelational connection.

- Schema. Name of the databaseschema.

- Table Name. Name of the duplicaterecord table.

Viewing an ObjectYou can view object properties for each object in a project or folder. You can open the object to preview data in atab. You can preview the contents of data objects and object types to view the structure of data and analyze dataquality results.

1. In the Navigator, select the project or folder that contains the object you want to view.

2. In the Contents panel, select the object you want to view.

The Analyst tool displays the name, type, and location of the object in the project or folder in the Propertiespanel. You can view connection name, Data Object Model name, table name, and schema name for tableobjects. Additionally, you can view the file path for flat file objects.

3. Click Actions > Open.

The Analyst tool opens the object contents for preview in a tab. You can preview column metadata for tablesand flat files and data quality results for other object types.

Objects 11

Page 21: Informatica Data Quality Analyst 9.1.0 User Guide (English) · 1 Quality Analyst 1

Duplicating an ObjectYou can duplicate objects within a project or within folders in a project to use objects for different tasks.

1. In the Navigator, select the project or folder that contains the object you want to duplicate.

2. In the Contents panel, select the object you want to duplicate.

3. Click Actions > Duplicate.

The Duplicate window appears.

4. Navigate to the location in the project where you want to duplicate the object.

Optionally, enter the location.

5. Enter the name of the object.

6. Click OK.

The Analyst tool duplicates the object to the location in the project or folder.

Renaming an ObjectRename an object to change its name according to business usage and naming convention.

1. In the Navigator, select the project or folder that contains the object you want to copy.

2. In the Contents panel, select the object you want to rename.

3. Click Actions > Rename.

The Renamewindow appears.

4. Enter the object name.

5. Click OK.

The Analyst tool renames the object with specified name.

Moving an ObjectMove an object within a project to another location in the project to organize project contents. You cannot move anobject to a target folder that is a child folder of the source folder.

1. In the Navigator, select the project that contains the object you want to move.

2. In the Contents panel, select the object you want to move.

3. Click Actions > Move.

The Move window appears.

4. Navigate to the location where you want to move the object to a folder.

Optionally, enter the location where you want to move the object to a folder.

5. Click OK.

The Analyst tool moves the object to specified location in the project or folder.

Deleting an ObjectDelete an object from a project or folder if the object becomes redundant.

1. In the Navigator, select the project or folder that contains the object you want to delete.

12 Chapter 2: Projects

Page 22: Informatica Data Quality Analyst 9.1.0 User Guide (English) · 1 Quality Analyst 1

2. In the Contents panel, select the object you want to delete.

3. Click Actions > Delete.

4. In the Delete Object dialog box, click Yes.

The Analyst tool deletes the object from the project or folder.

Metadata BookmarksA metadata bookmark is a link to an object in a Model repository. Use a metadata bookmark to share an objectwith other Analyst tool users.

Because the Analyst tool is a web-based tool, you can access objects through a link to the object in the theAnalyst tool. Each object you view in the Analyst tool has a unique URL. You can share an object with otherAnalyst tool users by sharing the URL for the object. You can create a metadata bookmark for any object that youcan open in the Analyst tool.

The following example shows a metadata bookmark for a relational table:

http://styx:8080/AnalystTool/com.informatica.at.AnalystTool/index.jsp#p=lewis&i=U:VeP2HpstEd66x8vkMFuKtQ&c=com.informatica.metadata.relational.datasource.RelationalDataSource

To share a metadata bookmark, open the object you want to share in the Analyst tool. Copy the link location in thelocation bar. You can then send the link in an email or add it to a document. You can also bookmark the link inyour browser to access it again.

To access a metadata bookmark, you can click the link or copy and paste the link into the location bar in abrowser. When you access a metadata bookmark, the Analyst tool prompts you to log in if you are not alreadylogged in, and then displays the object.

To access a metadata bookmark, you must have the following permissions:

¨ Permission to use the the Analyst tool.

¨ Permission to access the project that contains the object.

Creating a Metadata BookmarkCreate a metadata bookmark to share an object in Informatica Analyst with other users.

1. In the Analyst tool, open the object that you want to create a metadata bookmark for.

2. Copy the URL in the location bar of the browser.

You can then paste the bookmark into an email or a document and distribute the email or document to otherusers.

Opening a Metadata BookmarkOpen a metadata bookmark to access an object in the Analyst tool.

1. Click an active link for a metadata bookmark or copy and paste the link into the location bar in a browser.

If you are not already logged in, the Analyst tool displays the login page.

2. Log in to the Analyst tool.

The Analyst tool displays the object in a tab.

Metadata Bookmarks 13

Page 23: Informatica Data Quality Analyst 9.1.0 User Guide (English) · 1 Quality Analyst 1

TagsA tag is metadata that defines an object in the Model repository based on business usage. Create tags to groupobjects according to their business usage.

After you create a tag, you can associate the tag with one or more objects. You can remove the associationbetween a tag and an object. You can use a tag to search for objects associated with the tag in the Modelrepository. The Analyst tool displays a glossary of all tags. You can delete redundant tags.

For example, an analyst creates a tag named XYZCorp_CustomerOrders and applies it to tables that containinformation for the customers orders from the XYZ Corporation. The analyst can search by theXYZ_CustomerOrders tag to identify the tables associated with the tag.

Note: Tags associated with an object in the Analyst tool appear as tags for the same objects in the Developer tool.

Creating and Assigning a TagCreate a tag to add metadata that defines an object based on business usage. Assign the tag to an object toassociate the object with this metadata definition.

1. Click Actions > Show Tags.

2. On the Tags panel, click New.

3. Enter a name and an optional descripton.

The Analyst tool adds the tag to the glossary.

4. To assign a tag to an object, select an object in the Navigator and select the tag and click Assign.

Viewing TagsThe Analyst tool displays a glossary of tags. You can view all tags or only those tags that are assigned to objects.

Perform the following actions to view tags:

Task Action

Display tags. Click Actions > Show Tags.The Analyst tool displays aglossary of tags on the Tags panel.

View tags in the glossary by a group. Select the groups of letters, #, or the Other group.

View tags that are assigned to objects. Select Applied Tags from the top drop down menu.

View all tags. Select All Tags from the top drop down menu.

View the description of a tag. Select a tag and view the description in the bottom panel.

Hide tags. Click Actions > Hide Tags.

14 Chapter 2: Projects

Page 24: Informatica Data Quality Analyst 9.1.0 User Guide (English) · 1 Quality Analyst 1

SearchYou can search for objects and folders in the Analyst tool. Search folders to find objects for a particular task suchas profiling data or creating business rules.

You can search for data objects, object types, and folders by name in the Analyst tool. You cannot search forprojects by name.

The Model Repository Service uses a search engine to index the metadata in the Model repository. To correctlyindex the metadata, the search engine uses a search analyzer appropriate for the language of the metadata thatyou are indexing. The Analyst tool uses the search engine to perform searches on objects in the Model repository.

You can search for objects in the Model repository by object name or by a tag. If you have Metadata Manager, youcan search for objects in the Metadata Manager repository by object name or by a Metadata Manager businessterm. You can select a Metadata Manager object from the search results and import it into the Analyst tool.

You can create a search query and filter the search results. You can view search results and select an object fromthe search results and view its contents in another tab.

You can search in different languages. To search in a different language, an administrator must change the searchanalyzer and configure the Model repository to use the search analyzer. You can change the search analyzer inthe Model Repository Service. After you change the search analyzer, you must restart the Model RepositoryService and re-index the search index. For more information about changing the search analyzer, see theInformatica Administrator Guide.

Search SyntaxUse search syntax to create a search query and filter search results.

The following table describes the search syntax you can use in a search:

Search Syntax Description

Keywords Use an exact keyword match in the search.

Cases Use upper case and lower case text in the search.

Wildcards Use wildcard characters in the search.

Logical Operators Use logical operators in the search.

Keyword MatchesUse a keyword match to search for folders and objects that match the keyword.

Enclose a search query in quotation marks (" ") to search for an exact keyword match. The Analyst tool returnsobjects with the name that matches the keyword exactly.

Note: You cannot use wildcards or special characters in a search.

Search 15

Page 25: Informatica Data Quality Analyst 9.1.0 User Guide (English) · 1 Quality Analyst 1

WildcardsUse wildcards to define one or more characters in a search. Use wildcards as a prefix, suffix, or infix in a search.

The following table describes the wildcards you can use in a search:

Wildcard Description

* Represents characters. For example when you search forcustomer*, the Analyst tool can return customer,customer_name, and CustomerID.

? Represents a single character. For example when you searchfor Customer?, the Analyst tool can return Customer1,Customer2, and CustomerA.

OperatorsUse boolean search operators to logically combine search terms. All boolean operators must be upper case.

The following table describes the search operators you can use in a search:

Operator Description

AND Includes both search terms. For example, sales data ANDdata sales.

OR Includes either one of the search terms. For example, salesdata OR sales.

NOT Excludes a search term. For example, sales data NOT datasales. The NOT operator requires two operands.

Search FiltersApply a filter to the search query to refine search results based on an object name, tag, or a Metadata Managerbusiness term.

When you select a filter and apply it to a search query, the Analyst tool returns search results based on the filter.You can perform an advanced search to further refine the search results.

When you search for objects by name or by a tag, the Analyst tool returns search results for objects in the Modelrepository. You can perform an advanced search to search by object name, tag, or object type.

When you search for objects by a Metadata Manager business term, the Analyst tool returns search results forobjects associated with the business term in the Metadata Manager repository. You can select a MetadataManager object from the search results and import the object into the Analyst tool. You can perform an advancedsearch to search by object name, business term, or object type. You must have Metadata Manager to search forobjects by a Metadata Manager business term.

Note: If the tag name is the same as a business term name, the Analyst tool returns search results for objectsassociated with the tag and the business term from the Model repository and the Metadata Manager repository.

16 Chapter 2: Projects

Page 26: Informatica Data Quality Analyst 9.1.0 User Guide (English) · 1 Quality Analyst 1

The following table describes the search filters that you can use to perform a search:

Search Filter Description

Search by Name Search for objects by name.

Search by Tag / Business Term Search for objects associated with a tag in the Modelrepository or objects associated with a business term in theMetadata Manager repository.

Advanced Search Search for objects in the Model repository by object name,tag, or object type. Search for objects in the MetadataManager repository by object name, business term, or objecttype.

Search ResultsView search results to get specific objects or folders in the Analyst tool or objects in the Metadata Managerrepository. You can import the Metadata Manager objects in the search results as data objects in the Analyst tool.

After you perform a search, the Analyst tool displays the search results on the Search Results tab. The Analysttool displays objects that appear in the Model repository on the Projects tab. The Analyst tool displays objectsthat appear in the Metadata Manager repository on the External Tables tab. If you do not have MetadataManager, the Analyst tool does not display any Metadata Manager objects.

Select an object from the search results to view the object properties. You can perform an advanced search tofurther refine the search results.

Performing a SearchPerform a search to search for folders in the Analyst tool or objects in the Model repository and Metadata Managerrepository.

1. In the Search header box, enter a keyword search or select a search filter.

The Analyst tool returns the results of the search in the Search Results tab.

2. To perform an advanced search, select the Projects tab or the External Tables tab.

3. In the Advanced Search panel, select a filter and select the object types.

Importing Metadata Manager TablesAfter you search for Metadata Manager objects by a business term, you can import the objects into the Modelrepository.

Before you perfom this task, verify the following prerequisite:

¨ License to access Metadata Manager.

¨ Perform a search by business term.

1. On the Search Results tab, select the External Tables tab.

2. Select a Metadata Manager table.

3. Right-click the table and select Add Import.

The New Tables window appears. Follow the steps to import the Metadata Manager table into the Analysttool.

Search 17

Page 27: Informatica Data Quality Analyst 9.1.0 User Guide (English) · 1 Quality Analyst 1

Searching Objects ExampleAn analyst wants to search for customer financial data related to bank transactions across tables in the Modelrepository and the Metadata Manager Repository. The analyst uses the business term "finance" to search fortables that contain customer financial data. The analyst performs an advanced search to filter search results todisplay tables in the Metadata Manager repository associated with the business term finance.

1. In the Search box, select the Search by Tag / Business Term filter.

2. Enter the business term named "finance".

3. Select the External Tables tab.

4. In the Advanced Search panel, select Tables as the object type.

SecurityManage permissions on projects in the Analyst tool to control access to projects. You can add users to a projectand assign permissions for users on a project.

Even if a user has the privilege to perform certain actions, the user may also require permission to perform theaction on a particular object.

When you create a project, you are the owner of the project by default. The owner has all permissions, which youcannot change. The owner can assign permissions to users.

The following table describes the permissions you can assign for users on a project.

Permission Grants users the ability to

Read Read on project, view projects and objects in projects.

Write Modify projects, create, edit, and delete objects in projects.

Grant Grants users the ability to manage the read, write, and grant permissions on a project.

Assigning Permissions on a ProjectYou can add users to a project and assign permissions on a project to restrict, provide access, or manage theobjects within the project.

1. In the Navigator, select a project, and click the Security tab.

2. In the Project-Level Permissions View on the Contents panel, click the Edit icon to edit user permissions.

The Edit Project-Level Permissions dialog box appears.

3. Select a user from the Users panel.

4. Optionally, click Add to add another user.

The Add Users dialog box appears.

5. Select a user or users that need to have access to the project.

6. Click OK.

7. Select or clear the Read, Write, or Grant permissions in the Permissions panel.

8. Click OK.

18 Chapter 2: Projects

Page 28: Informatica Data Quality Analyst 9.1.0 User Guide (English) · 1 Quality Analyst 1

Rules and Guidelines for SecurityRules and guidelines for managing security.

Use the following rules and guidelines to manage security:

¨ A user can have the privilege to create a project but can view only those projects on which the user is assignedpermissions.

¨ A user with the privilege to create a project but not the permission to view a project can create a project with aname that already exists. The Analyst tool displays a message stating that the project already exists.

¨ When you assign the permission to view a project to a user, the user needs to refresh the Navigator to view theproject. An administrator can view all projects.

¨ A user with an active Analyst tool browser session can continue to browse projects after the Read and Writepermissions for the user on the project are removed in the Developer tool.

¨ An unauthorized user can bypass Analyst tool security by copying the Analyst tool URL from the Administratortool to gain full access to the Analyst tool.

Job StatusYou can monitor the status of Analyst tool jobs for objects such as profiles, scorecards, reference tables, andmapping specifications. You can monitor data preview for all objects and monitor drill down operations on profiles.

You can monitor the status of each Analyst tool job on the Job Status tab. After you select a job, you can viewerror or information messages and the general properties for the job on the bottom panel.

You can perform the following tasks to monitor jobs:

¨ Search for a job. Enter a job property as a search filter to search for a job.

¨ Clear search filters for a job.

¨ Refresh a job.

¨ Abort a job.

¨ View Analyst tool logs events for a job.

¨ View the context of a job. View other jobs that started around the same time as the selected job.

¨ Get notifications for new jobs.

The following table descibes the job status properties:

Property Description

Job ID Identifier for the job.

Name Name of the job.

Type Job type. The Analyst tool displays the following job types:- Profile- Scorecard- Preview- Mapping- Reference Table process- Custom

Job Status 19

Page 29: Informatica Data Quality Analyst 9.1.0 User Guide (English) · 1 Quality Analyst 1

Property Description

Select Custom to filter by multiple types.

State State of the job. The Analyst tool displays the following job states:- Running- Completed- Failed- Abborted- Unknown- CustomSelect Custom to filter by multiple states.

Started By Name of the user who starts the job.

Start Time Start time of the job. The Analyst tool displays the following start times:- Last 30 minutes- Last 4 hours- Last 1 day- Last 1 week- CustomSelect Custom to enter a date and time range.

Elapsed Time Time the Analyst tool runs the job before it completes. The Analyst tool displays the following option forelapsed time:- CustomSelect Custom to enter a date and time range.

End Time Time when the job ends.The Analyst tool displays the following options for end time:- Last 30 minutes- Last 4 hours- Last 1 day- Last 1 week- CustomSelect Custom to enter a date and time range.

User SecurityDomain

Security domain for the user name. Security domain can be Native or LDAP.

Monitoring Job StatusUse the Actions menu or the icons on the Job Status tab to monitor Analyst tool jobs.

1. On the Analyst tool header, click Manage > Monitor Job Status.

The Analyst tool opens monitoring in the Job Status tab.

2. To search for a job, enter a job status property in the search fields.

3. To view other jobs that started around the same time as the selected job, click Actions > View Context.

The Analyst tool displays nformation about the jobs in the Working view.

4. To refresh the job status, click Actions > Refresh.

5. To clear search filters, click Actions > Clear Search Filters.

6. To abort a job, select a job and click Actions > Abort Selected Job.

7. To view log events for a job in a text file, select the job and click Actions > View Logs for Selected Object.

8. To get the status of new jobs without having to refresh, select New Job Notifications.

20 Chapter 2: Projects

Page 30: Informatica Data Quality Analyst 9.1.0 User Guide (English) · 1 Quality Analyst 1

Metadata Manager Business TermsYou can access Metadata Manager and the Metadata Manager Business Glossary from the Analyst tool. You canbrowse the Metadata Manager business glossary to view the business terms in a business glossary or viewbusiness terms grouped by category. You can edit Metadata Manager business terms.

You can search for Metadata Manager objects in the Metadata Manager repository by a Metadata Managerbusiness term. You can select Metadata Manager objects from the search results and import these as data objectsin the Analyst tool.

You can perform Metadata Manager tasks based on the license for Metadata Manager. You cannot add aMetadata Manager business term to the Metadata Manager business glossary.

Managing Business TermsYou can access the Metadata Manager Business Glossary from the Analyst tool to manage Metadata Managerbusiness terms.

1. On the Analyst tool header, click Manage > Manage Terms.

Metadata Manager and the Metadata Manager Business Glossary open in another tab. Metadata Managerbusiness terms appear on the Glossary view in Metadata Manager.

2. To choose a business glossary, select a glossary from the Show list.

3. To view business terms grouped by a category, click Actions > View > Categories.

4. To view all business terms in a business glossary in alphabetic order, click Actions > View > Alphabet.

5. To view all business terms that start with a specific letter, click the letter.

6. To edit a business term, select the business term and click Actions > Edit Properties.

Metadata Manager Business Terms 21

Page 31: Informatica Data Quality Analyst 9.1.0 User Guide (English) · 1 Quality Analyst 1

C H A P T E R 3

Data ObjectsThis chapter includes the following topics:

¨ Data Objects Overview, 22

¨ Flat Files, 23

¨ Tables, 27

¨ Viewing Data Objects, 29

¨ Editing Data Objects, 30

Data Objects OverviewData objects represent the metadata sources from which you want to extract metadata to analyze in an Analysttool project. You can import data objects such as tables and flat files to analyze the structure of the data in aproject.

Data objects appear when you select the project or folder that contains the object in the Analyst tool. Any table orflat file that you add to a project in the Analyst tool also appears in the Developer tool. A table appears under thename of the connection for the table. A flat file appears with the file object name.

Logical data objects created in Data Object Models in the Developer tool appear as logical data objects in projectsshared by the developer in the Analyst tool. These appear as Logical Data Objects in the Analyst tool. You cannotrename, move, or delete logical data objects. You can view logical data objects and create profiles and scorecardsfor logical data objects.

SAP and mainframe data objects imported in the Developer tool appear as SAP and mainframe tables in projectsshared by the developer in the Analyst tool. You can view SAP and mainframe tables and create profiles andscorecards for these tables.

A developer can add parameters to relational table or flat file data objects in the Developer tool. You can view theparameterized data objects in the Analyst tool after the developer shares these data objects with the Analyst tool.You can preview data in the parameterized data objects and profile data for these objects.

Use table and flat file objects to profile source data and perform data analysis tasks. You can add data objects byimporting them into the Analyst tool. You can store data objects in projects and folders in the Navigator.

Before you can import a data object you must access the metadata source to extract the metadata that youanalyze in the data object. The Analyst tool requires a connection to the source relational table to extract metadatafor the table data object. The Analyst tool requires the network path or browse location to locate the source flat fileto extract metadata for the flat file data object.

22

Page 32: Informatica Data Quality Analyst 9.1.0 User Guide (English) · 1 Quality Analyst 1

After you add tables and files, you can create a profile for the source data that the tables and files represent.When you run the profile, the Analyst tool connects to the database table or flat file.

You can perform the following tasks on data objects :

¨ Add data objects. Add tables and flat files by importing tables and flat files into projects and folders in theNavigator.

¨ View data objects. View object properties and column metadata for tables, flat files, and logical data objects.

¨ Duplicate data objects. Duplicate tables and flat files to use them for different tasks.

¨ Rename data objects. Rename tables and flat files according to their business usage and naming convention.

¨ Move data objects. Move tables and flat files in a project or folders in a project.

¨ Delete data objects. Delete tables and flat files when they become redundant.

Note: If you delete a data object that other object types reference, the Analyst tool displays a message thatlists the object types being referenced. Determine the impact of deleting the data object before you choose todelete it.

¨ Edit data objects. Edit tables and flat files to change the name or description while viewing tables and flat files.

Flat FilesA flat file data object contains the metadata for a flat file in the Analyst tool. Use flat files to profile source data.When you add a flat file, the Analyst tool connects to the network path location or the location where you uploadthe source flat file to extract metadata.

You can add flat files in the Analyst tool by importing the flat files into projects or folders. Before you import a flatfile, you can choose to browse a file from your local machine. The Analyst tool uploads a copy of the flat file to adirectory in the Informatica Services installation directory that the Analyst tool can access. Or, you can point theAnalyst tool to a network location. The Analyst tool uses the location you specify to access the source flat file.

You can specify the directory where you upload flat files in the flat file cache when you configure the AnalystService in the Administrator tool. For more information about specifying the flat file cache, see the InformaticaAdministrator Guide.

You can import parameterized flat files into the Analyst tool. A developer can add parameters to flat files in theDeveloper tool, or to flat files in the Analyst tool that are shared with the Developer tool. The developer cannot addparameters to uploaded flat files.

Use the Add Flat File Wizard to import flat files into the Analyst tool. To add a flat file in the Analyst tool, selectthe flat file, configure the file options, and configure the column data types. After you add the flat file, you canpreview the flat file properties and column metadata in the flat file.

Flat File OptionsWhen you import a flat file, you can configure the flat file options for each column in Add Flat File wizard. Theoptions that you configure determine how the wizard reads the data from the source flat file.

Flat Files 23

Page 33: Informatica Data Quality Analyst 9.1.0 User Guide (English) · 1 Quality Analyst 1

The following table describes the flat file options that you configure the in the Add Flat File wizard:

Options Description

Delimiters Character used to separate columns of data. Use the Otherfield to enter a different delimiter. Delimiters must be printablecharacters and must be different from the escape characterand the quote character if selected. You cannot select non-printing multibyte characters as delimiters.

Text Qualifier Quote character that defines the boundaries of text strings.Choose No Quote, Single Quote, or Double Quotes. If youselect a quote character, the wizard ignores delimiters withinpairs of quotes.

Column Names Option to import column names from the first line. Select thisoption if column names appear in the first row. The wizarduses data in the first row in the preview for column names. Ifthe first row contains numeric characters, the wizard usesCOLUMNx as the default column name. If the first rowcontains special characters, the wizard converts the specialcharacters to underscore and uses the valid characters in thecolumn name. The wizard skips the following specialcharacters in a column name: ".+-=~`!%^&*()[]{}'\";:?,< >\\|\t\r\n. Default is not enabled.

Values Option to start value import from a line. Indicates the rownumber in the preview at which the wizard starts readingwhen it imports the file.

Flat File DatatypesWhen you import a flat file, you can configure the datatypes for the data in each column in the Add Flat Filewizard. The datatypes you configure determine how the wizard imports the data from the source flat file.

You can configure the following data types for the data in each column in the Add Flat File wizard:

¨ bigint. You can specify the format in the Numeric Format window. You can use the default or specify anothernumeric format and choose to make this the default numeric format.

¨ datetime. You can specify the format in the Datetime Format window. You can use the default or specifyanother datetime format and choose to make this the default datetime format.

¨ double. You can specify the format in the Numeric Format window. You can use the default or specify anothernumeric format and choose to make this the default numeric format.

¨ int. You can specify the format in the Numeric Format window. You can use the default or specify anothernumeric format and choose to make this the default numeric format.

¨ nstring. You cannot specify a format.

¨ number. You can specify the format in the Numeric Format window. You can use the default or specifyanother numeric format and choose to make this the default numeric format.

¨ string. You cannot specify a format.

Datetime DatatypesWhen you import a flat file, you can configure the datatypes for file columns in the Add Flat File wizard. When youconfigure the datetime dataype, you can specify the format in the Datetime Format window. You can use thedefault or specify another datetime format and choose to make this the default datetime format.

24 Chapter 3: Data Objects

Page 34: Informatica Data Quality Analyst 9.1.0 User Guide (English) · 1 Quality Analyst 1

The following table describes the datetime format strings to specify as part of the date:

Format String Description

AM, a.m., PM, p.m. Meridian indicator. Use any of these format strings to specifyAM and PM hours. AM and PM return the same values as doa.m. and p.m.

DAY Name of day, including up to nine characters (for example,Wednesday). The DAY format string is not case sensitive.

DD Day of month (1-31).

DDD Day of year (001-366, including leap years).

DY Abbreviated three-character name for a day (for example,Wed). The DY format string is not case sensitive.

HH, HH12 Hour of day (1-12).

HH24 Hour of day (0-23), where 0 is 12AM (midnight).

J Modified Julian Day.

MI Minutes (0-59).

MM Month (01-12).

MONTH Name of month, including up to nine characters (for example,August). Case does not matter.

MON Abbreviated three-character name for a month (for example,Aug). Case does not matter.

MS Milliseconds (0-999).

NS Nanoseconds (0-999999999).

RR Four-digit year (for example, 1998, 2034). Use when sourcestrings include two-digit years.

SS Seconds (0-59).

SSSSS Seconds since midnight.

US Microseconds (0-999999).

Y The current year with the last digit of the year replaced withthe string value.

YY The current year with the last two digits of the year replacedwith the string value.

Flat Files 25

Page 35: Informatica Data Quality Analyst 9.1.0 User Guide (English) · 1 Quality Analyst 1

Format String Description

YYY The current year with the last three digits of the year replacedwith the string value.

YYYY Four digits of a year. Do not use this format string if you arepassing two-digit years. Use the RR or YY format stringinstead.

Adding a Flat FileUse the Add Flat File Wizard to add a flat file to a project or folder. You can add flat files to projects and foldersto analyze the structure of the data before you perform data quality tasks on the data.

1. In the Navigator, select the project or folder that you want to add the flat file to.

2. Click Actions > New > Flat File.

The Add Flat File wizard appears.

3. Select Browse and Upload and click Browse to select the flat file and click Upload to upload it to themachine on which Informatica Analyst runs. Or, select Enter a Network Path and configure the path and filename of the file.

4. Click Next.

If you chose to upload the file, Informatica Analyst uploads the flat file to an Informatica Services installationdirectory that Informatica Analyst can access.

5. Configure the flat file options, and preview the flat file data.

Note: Select a code page that matches the code page of the data in the file.

6. Optionally, click Show to preview changes to the flat file data.

7. Click Next.

8. Optionally, change the Column Attribute to a column name that describes the data in the source column.

9. Click Next.

10. Configure the name, description, and the location in the Folders panel where you want to add the flat file.

The Flat Files panel displays the flat files that exist in a folder.

11. Click Finish.

The Analyst tool adds the flat file to the project or folder in the Navigator.

Rules and Guidelines for Flat FilesRules and Guidelines for working with flat files.

Use the following rules and guidelines while working with flat files:

¨ Upload small files. Use the option to upload small files to an Informatica Services installation directory on themachine where the Analyst tool runs. The Analyst tool accesses this location to extract flat file metadata thatdoes not change frequently. When you use small files of sizes up to 10MB, the Analyst tool accesses a copy ofthe file in the Informatica Services installation directory. If you modify the original file, you need to upload thefile again.

¨ Upload large files. Use the option to enable the Analyst tool to connect to a network path location for largefiles. The Analyst tool accesses this location to extract flat file metadata that changes frequently. The network

26 Chapter 3: Data Objects

Page 36: Informatica Data Quality Analyst 9.1.0 User Guide (English) · 1 Quality Analyst 1

path location should be a shared directory or file system that the Analyst tool can access. When you use largefile sizes greater than 10MB, the Analyst tool can connect to the flat file in the network path. If you modify theoriginal flat file, you must refresh the flat file in the Analyst tool. Refreshing metadata for a large flat file cantake time.

¨ Blank data rows. The Analyst tool does not import the blank rows above the first data row, blank middle rows,and blank rows after the last data row when importing a flat file.

¨ Previewing data. After a preview, you can change the row number at which the Add Flat File wizard startsreading when it imports the file. This row number corresponds with the preview. If you choose to import columnnames from the first line, refresh the preview to update the row numbers for the preview data.

TablesA table object contains the metadata for a relational database source in the Analyst tool. Use tables to profilesource data. When you add a table, the Analyst tool uses a database connection to connect to the sourcedatabase to extract metadata.

You can add tables in the Analyst tool by importing the tables into projects or folders. Before you import a table,you select or create a database connection, and select the database table that you want to add. You can addmultiple tables from a connection as data objects.

Use the Add Table Wizard to add a table to the project or folder.

You can use the database connection that the staging database uses to select a table. You can create anotherdatabase connection to connect to the source table when you import a table. An administrator creates theconnection to the staging database that stores reference tables for the Analyst tool before configuring the AnalystService. For more information about configuring the staging database, see the Informatica Adminstrator Guide.

Database Connection PropertiesYou can use the database connection that the staging database uses to select a table. When you import a table,you can create another database connection to connect to the source relational table. You can delete redundantdatabase connections.

The following table describes the database connection options that you can configure for a database connection:

Option Description

Name Name of the connection. Connection names cannot havespaces and cannot be longer than 128 characters.

Description Description of the connection.

Database Type Type of relational database. You can select an Oracle,Microsoft SQL Server, or IBM DB2 database.

User Name User name used for authentication when you connect to therelational database.

Password Password for the database user name.

Data Access Connect String Connection string used to access data from the database.IBM DB2: <database name>

Tables 27

Page 37: Informatica Data Quality Analyst 9.1.0 User Guide (English) · 1 Quality Analyst 1

Option Description

Microsoft SQL Server: <server name>@<database name>ODBC: <data source name>Oracle: <database name listed in TNSNAMES entry>

Metadata Access Connect String JDBC connection URL used to access metadata from thedatabase.IBM DB2: jdbc:informatica:db2://<hostname>:<port>;DatabaseName=<database name>ODBC: n/aOracle: jdbc:informatica:oracle://<host_name>:<port>;SID=<database name>Microsoft SQL Server: jdbc:informatica:sqlserver://<hostname>:<port>;DatabaseName=<database name>

Code page Code page use to read from a source database or write to atarget database or file.

Deleting a Database ConnectionDelete database connections that become redundant. You must have the Write permission on the databaseconnection to delete the connection.

1. In the Informatica Analyst header, click Manage > Delete Connection.

The Delete Connection window appears.

2. Click Delete.

3. Click Close.

Adding a TableUse the Add Tables Wizard to add a table to a project. Add the tables that you want to profile data for. To add atable, select or create a connection, select the schema and tables, and add the table.

1. In the Navigator, select the project or folder that you want to add the table to.

2. Click Actions > New > Table.

The Add Tables wizard appears.

3. Select a connection.

4. Optionally, click New Connection to create and configure a connection.

In the New Connection window, optionally grant users the Execute permission on the connection. TheExecute permission enables users to preview data in profiles and scorecards and run profiles and scorecardscreated with the connection. Click OK.

5. Click Next.

6. Optionally, unselect Show Default Schema Only to show all schemas associated with the selectedconnection.

7. Select the table that you want to add or enter a table name in the search box and click Go to search by tablename. Click Clear to remove the search results and display all tables.

8. Optionally, click the Properties View to view the properties and column metadata for the table. Or, click thethe Data Preview View to view the columns and data for the table.

28 Chapter 3: Data Objects

Page 38: Informatica Data Quality Analyst 9.1.0 User Guide (English) · 1 Quality Analyst 1

9. Click Next.

The wizard displays the table to add to your folder or project.

10. Click Finish.

Rules and Guidelines for TablesRules and guidelines for working with table data objects.

Use the following rules and guidelines while working with tables:

¨ The Analyst tool displays the first 100 rows by default when you preview the data for a table. The Analyst toolmay not display all the data columns in a wide table.

¨ The Analyst tool can import wide tables with more than 30 columns for profiling data. When you import a widetable, the Analyst tool does not display all the columns in the data preview. The Analyst tool displays the first30 columns in the data preview. However, you can include all the columns in the wide tables and flat files forprofiling.

¨ You can import tables and columns with lowercase and mixed-case characters.

¨ You can import tables that have special characters in the table or column name. When you import a table thathas special characters in the table or column name, the Analyst tool converts the special character to anunderscore character in the table or column name. You can use the following special characters in table orcolumn names: "$.+-=~`!%^&*()[]{}'\";:/?,< >\\|\t\r\n

¨ You can import tables and columns with Microsoft SQL92 or Microsoft SQL99 reserved words such as "concat"into the Analyst tool.

¨ You can use an ODBC connection to import Micorsoft SQL Server, MySQL, Teradata, and Sybase tables in theAnalyst tool. The OBDC connection requires a user name and password.

¨ When you use a Microsoft SQL Server connection to access tables in a Microsoft SQL Server database, theAnalyst tool does not display the synonyms for the tables.

¨ When you preview relational table data from an Oracle, IBM DB2, IBM DB2/ zOS, IBM DB2/iOS, Microsoft SQLServer, and ODBC database, the Analyst tool cannot display the preview if the table, view, schema, synonym,and column names contain mixed case or lower case characters. To preview data in tables that reside in casesensitive databases, set the Support Mixed Case Identifiers attribute to true in the connections for Oracle, IBMDB2, IBM DB2/zOS, IBM DB2/iOS, Microsoft SQL Server, and ODBC databases in the Developer tool orAdministrator tool.

¨ You can view comments for the source database table after you import the table into the Analyst tool. To viewsource table comments, use an additional parameter in the JDBC connection URL used to access metadatafrom the database. In the Metadata Access String option in the database connection properties, useCatalogOptions=1 or CatalogOptions=3. For example, use the following JDBC connection URL for an Oracledatabase connection: Oracle: jdbc:informatica:oracle:// <host_name>:<port>;SID=<databasename>;CatalogOptions=1

Viewing Data ObjectsView data objects to preview data object properties and column metadata.

1. In the Navigator, select the project or folder that contains the table or file data object.

2. Click Actions > Open to open the data object.

Viewing Data Objects 29

Page 39: Informatica Data Quality Analyst 9.1.0 User Guide (English) · 1 Quality Analyst 1

The data object appears in a new tab. The Analyst tool retrieves the first 100 rows for the data object anddisplays it on the Data Preview view.

3. Optionally, select Columns and select a column to include in the preview. Default includes all columns forpreview.

If you choose not to include a column, the Analyst tool refreshes the preview and does not include the columnin the preview.

4. Optionally, click the Properties view to view the table and file properties in the Properties panel.

The Analyst tool displays the table name, description, location, connection name, and database schema namefor the table data object. The Analyst tool displays the file name, location, upload file path, or network path forthe flat file data object.

5. Optionally, view the column metadata for each column in the Columns panel.

You can view column name and datatype for each column in the table or flat file. You can view if the column isnullable and the key for each column in the table. Nullable and key properties are relational databaseproperties.

6. Optionally, click the Refresh button to refresh the metadata for the data object.

Editing Data ObjectsYou can edit the name and description properties of tables and flat files while viewing the tables and flat files.

1. In the Navigator, select the project or folder that contains the table of flat file data object you want to edit.

2. Click Actions > Open to open the data object.

The data object opens in a new tab.

3. Click the Properties view to view the table or flat file properties in the Properties panel.

4. Click Actions > Edit to edit the data object.

The Edit window appears.

5. Enter a name and an optional description.

6. Click OK.

30 Chapter 3: Data Objects

Page 40: Informatica Data Quality Analyst 9.1.0 User Guide (English) · 1 Quality Analyst 1

C H A P T E R 4

Exception Record ManagementThis chapter includes the following topics:

¨ Exception Record Management Overview, 31

¨ Exception Management Tasks, 33

Exception Record Management OverviewAn exception is a record that does not belong in the data set in its current form. The record may contain errors, orit may be an unintended duplicate of another record.

You can use the Analyst tool to review the following exception types:

Bad records

Edit records, delete records, tag them to be reprocessed by a mapping, or profile them to analyze the qualityof changes made to the records.

Duplicate records

Consolidate clusters of similar records to a single master record. You can consolidate or remove duplicaterecords, extract records to form new clusters, and profile duplicate records.

When you use the Analyst tool to import an exception table, you choose to create either a bad record table or aduplicate record table.

You manage exceptions in tables that you import to the Informatica staging database. After you edit data in theAnalyst tool, you can write the data back to the source database.

You cannot use the Analyst tool to manage exceptions in file data.

Exception Management Process FlowTo perform exception management for bad records or duplicate records, use the Developer tool and the Analysttool.

Use the Developer tool to perform the following tasks:

Define an exception mapping

Create a mapping to identify exceptions. Add a data source that you want to analyze for exceptions, and addan Exception transformation.

Mappings that generate duplicate record exceptions require a score input. Use a Match transformation incluster mode to create scores for duplicate record exceptions.

31

Page 41: Informatica Data Quality Analyst 9.1.0 User Guide (English) · 1 Quality Analyst 1

Mappings that generate bad record exceptions do not require a score. If no score is present in a bad recordmapping, the Exception transformation writes all records with quality issues to the exception table. You canuse a Decision transformation to create numerical scores for bad record mappings.

Define an exception table

Configure the Exception transformation to connect to a database where you want to store exception records.

Add a data object for good records or automatic consolidation records

Connect the Exception transformation output ports to a data flow that connects to a data object. Exceptiontransformations that generate bad record exceptions write good records to the data object. Exceptiontransformations that generate duplicate record exceptions write automatic consolidation records to the dataobject.

Run the exception mapping

Run the mapping to process exceptions. The Data Integration Service creates an exception table in thestaging database using the name you specify in the Exception transformation. The Exception transformationwrites exception records to this table.

Use the Analyst tool to perform the following tasks:

Import the exception table into the Model repository

Import the exception table into the Model repository as a data quality table. When you import the table,choose to create a bad record table or a duplicate record table based on the type of Exception transformationthat created the table.

Review and edit exceptions

Review the exception table in the Analyst tool. Filter the exception records by quality issue and priority.

Reserved Column NamesWhen you create a bad record or consolidation table, the Analyst tool generates columns for use in its internaltables. Do not import tables that use these names. If an imported table contains a column with the same name asone of the generated columns, the Analyst tool will not process it.

Reserve the following column names for bad record or consolidation tables:

¨ checkStatus

¨ rowIdentifier

¨ acceptChanges

¨ recordGroup

¨ masterRecord

¨ matchScore

¨ any name beginning with DQA_

32 Chapter 4: Exception Record Management

Page 42: Informatica Data Quality Analyst 9.1.0 User Guide (English) · 1 Quality Analyst 1

Exception Management TasksYou can perform the following exception management tasks in the Analyst tool:

Import database tables

Configure a connection to a database and import the tables you need.

Manage bad records

Identify problem records and fix data quality issues.

Consolidate duplicate records

Merge groups of duplicate records into a single record.

View the audit trail

Review the changes made in the bad or duplicate record tables before writing the changes to the sourcedatabase.

Importing a Database for Exception ManagementComplete these steps to import a table that contains bad or duplicate records to the staging database:

1. Log in to the Analyst tool.

2. Select Actions > New Data Quality Table.

The Import DQA Table wizard opens.

3. Specify the type of data in the tables: bad records or duplicate records.

4. Select or create a database connection.

5. Select a table from the tables available on the connection.

6. Click Finish.

If you create a database connection in the Import DQA Table wizard, provide the following information:

¨ A name for the connection.

¨ A text description of the database connection. (Optional)

¨ The database type.

¨ A valid username and password to connect to the database.

¨ A connection string for data access.

¨ A connection string for metadata access.

¨ The codepage for the data.

¨ Execute permissions on the connection. Select Grant or Deny.

Viewing and Editing Bad RecordsComplete these steps to view and edit bad records:

1. Log in to the Analyst tool.

2. Select a project.

3. Select a bad records table.

4. Optionally, use the Quality Issue menu to filter the table records.

Exception Management Tasks 33

Page 43: Informatica Data Quality Analyst 9.1.0 User Guide (English) · 1 Quality Analyst 1

5. Optionally, use the Column menu to filter results by column.

You cannot select a Column until you select a Quality Issue.

6. Optionally, filter the results by value.

Use the Filter option to display records that contain the value you specify in a column you select. Leave thefilter string blank to search for NULL values.

7. Click Go to view the records matching the filter criteria.

Press the Tab key to move from one record to the record below it.

8. Select the option at the end of the row to save changes. To discard changes, click the delete button at theend of the record row.

Saving changes to a record is the first step in processing the record in the Analyst tool. After you save changes toa record, you must update the record status to accepted, reprocess, or rejected.

Updating Bad Record StatusFor each record that does not require further editing, perform one of the following actions:

Select one or more records by clicking the check box next to each record. Select all the records in the table byclicking the check box at the top of the first column.

Note: The Analyst tool does not display records that you have taken action on.

¨ Click Accept.

Indicates that the record is acceptable for use.

¨ Click Reject.

Indicates that the record is not acceptable for use.

¨ Click Reprocess.

Selects the record for reprocessing by a data quality mapping. Select this option when you are unsure if therecord is valid. Rerun the mapping with an updated business rule to recheck the record.

Viewing and Filtering Duplicate Record ClustersComplete these steps to view and filter duplicate clusters:

1. Select a staging database table from the Table drop-down menu.

2. Click Go to view the clusters in the selected table.

3. The Record Consolidation tab returns a numbered list, with each cluster represented by a number in the list.

Click a cluster number to open the page for that cluster.

4. On the Duplicate Records tab, click Filter to search the clusters returned for records that contain a givenvalue.

5. Select a column and enter a filter string. Leave the filter string blank to search for NULL values.

6. Click OK.

The Analyst tool returns a list of clusters where at least one record contains the specified data value.

Editing Duplicate Record ClustersEdit clusters to change how the Analyst tool consolidates potential duplicate records.

You can edit clusters in the following ways:

34 Chapter 4: Exception Record Management

Page 44: Informatica Data Quality Analyst 9.1.0 User Guide (English) · 1 Quality Analyst 1

To remove a record from a cluster:

Clear the selection in the Cluster column to remove the record from the cluster. When you delete a recordfrom a cluster, the record assumes a unique cluster ID.

To create a new cluster from records in the current cluster:

Select a subset of records and click the Extract Cluster button. This action creates a new cluster ID for theselected records.

To edit the record:

Select a record field to edit the data in that field.

To select the fields that populate the master record:

Click the selection arrow in a field to add its value to the corresponding field in the Final Record row. Anarrow indicates that the field provides data for the master record.

To specify a master record:

Click a cell in the Master column for a row to select that row as the master record.

Consolidating Duplicate Record ClustersWhen you have processed a cluster, complete this step to consolidate the cluster records to a single record in thestaging database.

u In the cluster you processed, click the Consolidate Cluster button.

The Analyst tool performs the following updates on cluster records:

¨ In the staging database, the Analyst tool updates the master record with the contents of the Final record andsets the status to Updated.

¨ The Analyst tool sets the status of the other selected records to Consolidated.

¨ The Analyst tool sets the status of any cleared record to Reprocess.

Viewing the Audit TrailThe Analyst tool tracks changes to the staging database in an audit trail. Use the audit trail to review the status ofthe records that have passed through the record management and consolidation processes.

Complete these steps to view audit trail records:

1. Select the Audit Trail tab.

2. Set the filter options.

You can filter by time period, staging table, user, and record status.

3. Click Go.

The following table describes record statuses for the audit trail.

Record Status Description

Updated Edited during bad record processing, or selected as theMaster record during consolidation.

Consolidated Consolidated to a master record during consolidation.

Rejected Rejected during bad record processing.

Exception Management Tasks 35

Page 45: Informatica Data Quality Analyst 9.1.0 User Guide (English) · 1 Quality Analyst 1

Record Status Description

Accepted Accepted during bad record processing.

Reprocess Marked for reprocessing during bad record processing.

Rematch Removed from a cluster during consolidation.

Extracted Extracted from a cluster into a new cluster duringconsolidation.

36 Chapter 4: Exception Record Management

Page 46: Informatica Data Quality Analyst 9.1.0 User Guide (English) · 1 Quality Analyst 1

C H A P T E R 5

Reference TablesThis chapter includes the following topics:

¨ Reference Tables, 37

¨ Reference Table Properties, 38

¨ Create Reference Tables, 39

¨ Reference Table Management, 42

¨ Audit Trail Events, 44

¨ Rules and Guidelines for Reference Tables, 45

Reference TablesA reference table contains reference data that you can use to standardize source data. Reference data caninclude valid and standard values.

Create reference tables to establish relationships between source data values and the valid and standard values.You can share reference data with a developer for use in Standardizer and Lookup transformations in theDeveloper tool.

For example, during a data quality project, you create a reference table that contains the list of valid values for anaddress column in source data. A developer can use the reference data in the Developer tool to create aStandardizer transformation in a mapplet or mapping and standardize on the valid values for the address.

When you create reference tables in the Analyst tool, a developer can view these tables in the the Developer tool.A developer can open a reference table to view the contents of the reference table and use them in Lookup andStandardizer transformations. A developer can also launch the Analyst tool from the Developer tool to edit thereference table.

To create a reference table, you can create the table manually, create the table from a profile column, or import areference table. You can also create a reference table from the column values and pattern values in a profilecolumn.

After you create a reference table, you can edit the reference table to add column or rows and add or edit standardand valid values. You can also search and replace values in the reference table rows. You create and managereference tables on the Reference Table view. The Analyst tool tracks editing activities in the audit trail log. Youcan view the audit trail events to see the changes made to a reference table on the Audit Trail view. You can viewproperties for the reference table in the Properties view.

37

Page 47: Informatica Data Quality Analyst 9.1.0 User Guide (English) · 1 Quality Analyst 1

Reference Table PropertiesWhen you create reference tables manually or from profile columns, configure column properties for each columnyou include in the reference table. When you import a reference table from a flat file, configure the flat fileproperties for the delimited flat file.

You can configure the following column properties for each column in a reference table:

Property Description

Valid Appears when you create a reference table manually orimport it as a flat file. Table record contains a valid value touse in a Lookup or Standardizer transformation in theDeveloper tool.

Name Name of the column.

Data Type Datatype for the column. You can choose one of the followingdatatypes:- bigint- date/time- decimal- double- integer- stringThe values you can configure for precision and scale dependon the datatype you choose.

Precision Precision for the column. Precision is the maximum number ofdigits or the maximum number of characters that the columncan accomodate.

Scale Scale for the column. Scale is the maximum number of digitsthat a column can accommodate to the right of the decimalpoint. Applicable for decimal columns.

Description Description for the column.

You can configure the following flat file properties when you import a reference table from a delimited flat file:

Property Description

Delimiters Character used to separate columns of data. Use the Otherfield to enter a different delimiter. Delimiters must be printablecharacters and must be different from the escape characterand the quote character if selected. Default is comma.

Text Qualifier Quote character that defines the boundaries of text strings.Choose No Quote, Single Quote, or Double Quotes. If youselect a quote character, the wizard ignores delimiters withinpairs of quotes. Default is Double Quotes.

Column Names Use data in the first row for column names. Select this optionif column names appear in the first row.

Values Indicates the row number at which the wizard starts readingwhen it imports the file.

38 Chapter 5: Reference Tables

Page 48: Informatica Data Quality Analyst 9.1.0 User Guide (English) · 1 Quality Analyst 1

Create Reference TablesUse the reference table editor, profile results, or a flat file to create reference tables. Create reference tables toshare reference data with developers in the Developer tool.

Use the following methods to create a reference table:

¨ Create a reference table manually. Use the reference table editor to create a reference table, add columns,and configure attributes.

¨ Create a reference table from profile columns. Select a column in a profile and add it to a reference table orcreate a reference table to add the column. Select a column in a profile and select the column values to add toa reference table or create a reference table to add the column values. Select a column in the profile and selectthe pattern values to add to a reference table or create a reference table to add the pattern values.

¨ Import a reference table. Import a reference table from a delimited flat file.

Creating a Reference Table ManuallyUse the New Reference Table Wizard and the reference table editor to create a reference table manually. Youcan use the reference table editor to define the structure, columns, and data for the table.

1. In the Navigator, select the project and folder where you want to create the reference table.

2. Click Actions > New Reference Table.

The New Reference Table Wizard appears.

3. Select the option to Use the reference table editor.

4. Click Next.

5. Enter the table name and optional description and default value.

The Analyst tool uses the default value for any table record that does not contain a value.

6. For each column you want to include in the reference table, click the Add New Column icon and configurethe column properties for each column.

Note: You can reorder the columns or delete columns.

7. Optionally, choose to create a description column for rows in the reference table. Configure the name andprecision for the column.

8. Optionally, enter an audit note.

The audit note appears in the audit trail log.

9. Click Finish.

Creating a Reference Table from Profile ColumnsYou can create a reference table from a profile column. You can add a profile column to an existing referencetable. The New Reference Table Wizard adds the column to the reference table.

1. In the Navigator, select the project or folder that contains the profile with the column that you want to add to areference table.

2. Click the profile name to open it in another tab.

3. In the Column Profiling view, select the column that you want to add to a reference table.

4. Click Actions > Add to Reference Table.

The New Reference Table Wizard appears.

Create Reference Tables 39

Page 49: Informatica Data Quality Analyst 9.1.0 User Guide (English) · 1 Quality Analyst 1

5. Select the option to Create a new reference table.

Optionally, select Add to existing reference table, and click Next. Navigate to the reference table in theproject or folder, preview the reference table data and click Next. Select the column to add and click Finish.

6. Click Next.

7. The column name appears by default as the table name. Optionally enter another table name, a description,and default value.

The Analyst tool uses the default value for any table record that does not contain a value.

8. Click Next.

9. In the Column Attributes panel, configure the column properties for the column.

10. Optionally, choose to create a description column for rows in the reference table.

Enter the name and precision for the column.

11. Preview the column values in the Preview panel.

12. Click Next.

13. The column name appears as the table name by default. Optionally, enter another table name and adescription.

14. In the Save in panel, select the location where you want to create the reference table.

The Reference Tables: panel lists the reference tables in the location you select.

15. Optionally, enter an audit note.

16. Click Finish.

Creating a Reference Table from Column ValuesYou can create a reference table from the column values in a profile column. Select a column in a profile andselect the column values to add to a reference table or create a reference table to add the column values.

1. In the Navigator, select the project or folder that contains the profile with the column that you want to add to areference table.

2. Click the profile name to open it in another tab.

3. In the Column Profiling view, select the column that you want to add to a reference table.

4. In the Values view, select the column values you want to add. Use the CONTROL or SHIFT keys to selectmultiple values.

5. Click Actions > Add to Reference Table.

The New Reference Table Wizard appears.

6. Select the option to Create a new reference table.

Optionally, select Add to existing reference table, and click Next. Navigate to the reference table in theproject or folder, preview the reference table data and click Next. Select the column to add and click Finish.

7. Click Next.

8. The column name appears by default as the table name. Optionally enter another table name, a description,and default value.

The Analyst tool uses the default value for any table record that does not contain a value.

9. Click Next.

10. In the Column Attributes panel, configure the column properties for the column.

11. Optionally, choose to create a description column for rows in the reference table.

40 Chapter 5: Reference Tables

Page 50: Informatica Data Quality Analyst 9.1.0 User Guide (English) · 1 Quality Analyst 1

Enter the name and precision for the column.

12. Preview the column values in the Preview panel.

13. Click Next.

14. The column name appears as the table name by default. Optionally, enter another table name and adescription.

15. In the Save in panel, select the location where you want to create the reference table.

The Reference Tables: panel lists the reference tables in the location you select.

16. Optionally, enter an audit note.

17. Click Finish.

Creating a Reference Table from Column PatternsYou can create a reference table from the column patterns in a profile column. Select a column in the profile andselect the pattern values to add to a reference table or create a reference table to add the pattern values.

1. In the Navigator, select the project or folder that contains the profile with the column that you want to add to areference table.

2. Click the profile name to open it in another tab.

3. In the Column Profiling view, select the column that you want to add to a reference table.

4. In the Patterns view, select the column patterns you want to add. Use the CONTROL or SHIFT keys to selectmultiple values

5. Click Actions > Add to Reference Table.

The New Reference Table Wizard appears.

6. Select the option to Create a new reference table.

Optionally, select Add to existing reference table, and click Next. Navigate to the reference table in theproject or folder, preview the reference table data and click Next. Select the column to add and click Finish.

7. Click Next.

8. The column name appears by default as the table name. Optionally enter another table name, a description,and default value.

The Analyst tool uses the default value for any table record that does not contain a value.

9. Click Next.

10. In the Column Attributes panel, configure the column properties for the column.

11. Optionally, choose to create a description column for rows in the reference table.

Enter the name and precision for the column.

12. Preview the column values in the Preview panel.

13. Click Next.

14. The column name appears as the table name by default. Optionally, enter another table name and adescription.

15. In the Save in panel, select the location where you want to create the reference table.

The Reference Tables: panel lists the reference tables in the location you select.

16. Optionally, enter an audit note.

17. Click Finish

Create Reference Tables 41

Page 51: Informatica Data Quality Analyst 9.1.0 User Guide (English) · 1 Quality Analyst 1

Importing a Reference TableImport reference table data from a delimited flat file.

1. In the Navigator, select the project or folder where you want to create the reference table.

2. Click Actions > New Reference Table.

The New Reference Table Wizard appears.

3. Select the option to Import a flat file.

4. Click Next.

5. Click Browse to select the flat file.

6. Click Upload to upload the file to a directory in the Informatica Services installation directory that the Analysttool can access.

7. Enter the table name, and optional description and default value.

The Analyst tool uses the default value for any table record that does not contain a value.

8. Select a code page that matches the data in the flat file.

9. Preview the data in the Preview of file panel.

10. Click Next.

11. Configure the flat file properties.

12. In the Preview panel, click Show to update the preview.

13. Click Next.

14. On the Column Attributes panel, configure the column properties for each column.

15. Optionally, choose to create a description column for rows in the reference table. Enter the name andprecision for the column.

16. Click Finish.

Reference Table ManagementYou can perform tasks to manage reference tables. You can find and replace column values, add or removecolumns and rows, edit column values, and export a reference table to a file.

You can perform the following tasks to manage reference tables:

¨ Manage columns. Use the Edit column properties window to add, edit, or delete columns in a referencetable.

¨ Manage rows. Use the Add Rows window to add rows and the Edit Row window to edit rows in a referencetable. Use the Delete icon to delete rows in a reference table.

¨ Find and replace values. You can find and replace values in individual reference table columns. You can finda value in a column and replace it with another value. You can replace all values in columns with another value.

¨ Export a reference table. Export a reference table to a comma-separated values (CSV) file, dictionary file, orExcel file.

Managing ColumnsUse the Edit column properties window to add, edit, or delete columns in a reference table.

42 Chapter 5: Reference Tables

Page 52: Informatica Data Quality Analyst 9.1.0 User Guide (English) · 1 Quality Analyst 1

1. In the Navigator, select the project or folder that contains the reference table that you want to edit.

2. Click the reference table name to open it in a tab. The Reference Table tab appears.

3. Click Actions > Edit Table or click the Edit Table icon.

The Edit column properties window appears.

4. To add a column, click the Add New Column icon in the Column Attributes panel and edit the columnproperties. Or, to edit an existing column, click the property you want to edit.

You cannot edit the datatype, precision, and scale of the column. You can rename the column and change thecolumn description.

5. To delete a column, click the column and click the Delete icon.

6. Optionally, you can enter an audit note on the Audit Note panel. The audit note appears in the audit log forany action you perform in the Edit column properties window.

7. Click OK.

Managing RowsYou can add, edit, or delete rows in a reference table.

1. In the Navigator, select the project or folder containing the reference table that you want to edit.

2. Click the reference table name to open it in a tab. The Reference Table tab appears.

3. To add a row, click Actions > Add Row or click the Add Row icon. In the Add Row window, enter the valuefor each column and enter an optional audit note. Click OK.

4. To edit rows, select the rows and click Actions > Edit or click the Edit icon. In the Edit Rows window, enterthe value for each column, select the columns to apply the changes to, and enter an optional audit note.Optionally, click Previous to edit the previous row and click Next to edit the next row. Click Apply to applythe changes.

The new column values appear in the tab.

5. To delete rows, select the rows you want to delete and click Actions > Delete or click the Delete icon. In theDelete Rows window, enter an optional audit note and click OK.

Finding and Replacing ValuesYou can find and replace values in individual reference table columns.

1. In the Navigator, select the project or folder containing the reference table that you want to find and replacevalues in.

2. Click the reference table name to open it in a tab. The Reference Table tab appears.

3. Click Actions > Find and Replace or click the Find and Replace icon.

The Find and Replace toolbar appears.

4. Enter the search criteria in the Find box. Select all columns or a column that you want to find in the list. Enterthe value you want to replace with, and click one of the following buttons:

Option Description

Next/Previous Scroll through the column values that match the search criteria.

Highlight All Highlight all the column values that match the search criteria.

Reference Table Management 43

Page 53: Informatica Data Quality Analyst 9.1.0 User Guide (English) · 1 Quality Analyst 1

Option Description

Replace Replace the currently highlighted column value.

Replace All Replace all occurrences of the search criteria in column values.

Exporting a Reference TableExport a reference table to a comma-seperated values (CSV) file, dictionary file, or Microsoft Excel file.

1. In the Navigator, select the project or folder containing the reference table that you want to view the audit trailfor.

2. Click the reference table name to open it in a tab. The Reference Table tab appears.

3. Click Actions > Export Data.

The Export data to a file window appears.

4. Configure the following options:

Option Description

File Name File name for the exported data.

File Format Format of the exported file. You can select the following formats:

¨ csv. Comma-separated values file.¨ xls. Microsoft Excel file.¨ dic. Dictionary file.

Optionally, select Export field names as first row to export the column names as a header rowin the exported file.

Code Page Code page of the reference data.

5. Click OK.

The options to save or open the file depend on your browser.

Audit Trail EventsUse the Audit Trail view for a reference table to view audit trail log events.

The Analyst tool creates audit trail log events when you make a change to a reference table and enter an audittrail note. Audit trail log events provide information about the reference tables that you manage.

44 Chapter 5: Reference Tables

Page 54: Informatica Data Quality Analyst 9.1.0 User Guide (English) · 1 Quality Analyst 1

You can configure query options on the Audit Trail tab to filter the log events that you view. You can specify filterson the date range, type, user name, and status. The following table describes the options you configure when youview audit trail log events:

Option Description

Date Start and end dates for the log events to search for. Use the calender to choose dates.

Type Type of audit trail events. You can filter and view the following events types:- Data. Events related to data in the reference table. Events include creating, editing, deleting,

and replacing all rows.- Metadata. Events related to reference table metadata. Events include creating reference

tables, adding, deleting, and editing columns, and updating valid columns.

User User who edited the reference table and entered the audit trail comment. The Analyst toolgenerates the list of users from the Analyst tool users configured in the Administrator tool.

Status Status of the audit trail log events. Status corresponds to the action performed in the referencetable editor.

Audit trail log events also include the audit trail comments and the column values that were inserted, updated, ordeleted.

Viewing Audit Trail EventsView audit trail log events to get more information about changes made to a reference table.

1. In the Navigator, select the project or folder that contains the reference table that you want to view the audittrail for.

2. Click the reference table name to open it in a tab. The Reference Table tab appears.

3. Click the Audit Trail view.

4. Configure the filter options.

5. Click Show.

The log events for the specified query options appear.

Rules and Guidelines for Reference TablesUse the following rules and guidelines while working with reference tables:

¨ When you import a reference table from an Oracle, IBM DB2, IBM DB2/ zOS, IBM DB2/iOS, Microsoft SQLServer, and ODBC database, the Analyst tool cannot display the preview if the table, view, schema, synonym,and column names contain mixed case or lower case characters. To preview data in tables that reside in casesensitive databases, set the Support Mixed Case Identifiers attribute to true in the connections for Oracle, IBMDB2, IBM DB2/zOS, IBM DB2/iOS, Microsoft SQL Server, and ODBC databases in the Developer tool orAdministrator tool.

¨ When you create a reference table from inferred column patterns in one format, the Analyst tool populates thereference table with column patterns in a different format. For example, when you create a reference table forthe column pattern X(5), the Analyst tool displays the following format for the column pattern in the referencetable: XXXXX.

Rules and Guidelines for Reference Tables 45

Page 55: Informatica Data Quality Analyst 9.1.0 User Guide (English) · 1 Quality Analyst 1

I N D E X

Aadding a flat file

flat files 26adding a table

tables 28assign permissions

projects 5assigning a tag

tags 14assigning permissions

projects 18

Ccolumn properties

reference tables 38contents view

Informatica Analyst interface 3creating a folder

folders 8creating a metadata bookmark

metadata bookmark 13creating a project

projects 6creating a reference table from column patterns

reference tables 41creating a reference table from column values

reference tables 40creating a reference table from profile columns

reference tables 39creating a reference table manually

reference tables 39creating a tag

tags 14

Ddata objects

deleting an object 22flat files 22renaming an object 22tables 22viewing an object 22

database connection propertiestables 27

datetime datatypesflat files 25

deleting a database connectiontables 28

deleting a folderfolders 9

deleting a Projectprojects 7

deleting an objectobjects 12

duplicating a folderfolders 8

duplicating a projectprojects 6

duplicating an objectobjects 12

Eediting data objects

objects 30exporting a reference table

reference tables 44

Ffinding and replacing valyes

reference tables 43flat file datatypes

flat files 24flat file options

flat files 24flat file properties

reference tables 38flat files

adding a flat file 26data objects 22datetime datatypes 25flat file datatypes 24flat file options 24viewing data objects 29

folderscreating a folder 8deleting a folder 9duplicating a folder 8moving a folder 9renaming a folder 8viewing folders contents 9

Hhiding tags

tags 14

Iimporting a reference table

reference tables 42importing Metadata Manager tables

search results 17

46

Page 56: Informatica Data Quality Analyst 9.1.0 User Guide (English) · 1 Quality Analyst 1

Informatica AnalystInformatica Analyst interface 1Navigator 2

Informatica Analyst interfacecontents view 3Informatica Analyst 1Informatica Analyst views 2log in 4properties view 3security view 3

Informatica Analyst viewsInformatica Analyst interface 2

Jjob status

monitoring job status 20projects 19

Llog in

Informatica Analyst interface 4

Mmanage folders

projects 5manage projects

projects 5manage security

projects 18managing business term

Metadata Manager business term 21managing columns

reference tables 43managing rows

reference tables 43metadata bookmark

creating a metadata bookmark 13opening a metadata bookmark 13

Metadata Manager business termmanaging business term 21projects 21

monitoring job statusjob status 20

moving a folderfolders 9

moving an objectobjects 12

NNavigator

Informatica Analyst 2

Oobjects

deleting an object 12duplicating an object 12editing data objects 30moving an object 12previewing data 11

renaming an object 12viewing object properties 11

opening a metadata bookmarkmetadata bookmark 13

Pperforming a search

search 17previewing data

objects 11projects

assign permissions 5assigning permissions 18creating a Project 6deleting a project 7duplicating a project 6job status 19manage folders 5manage projects 5manage security 18Metadata Manager business term 21renaming a project 7search projects 5viewing project contents 9

properties viewInformatica Analyst interface 3

Rreference tables

column properties 38creating a reference table from column patterns 41creating a reference table from column values 40creating a reference table from profile columns 39creating a reference table manually 39exporting a reference table 44finding and replacing values 43flat file properties 38importing a reference table 42managing columns 43managing rows 43viewing audit trail tables 45

renaming a Folderfolders 8

renaming a Projectprojects 7

renaming an objectobjects 12

Ssearch

performing a search 17search filters 16search results 17search syntax 15searching objects example 18

search filterssearch 16

search projectsprojects 5

search resultsimporting Metadata Manager tables 17search 17

Index 47

Page 57: Informatica Data Quality Analyst 9.1.0 User Guide (English) · 1 Quality Analyst 1

search syntaxsearch 15

searching objects examplesearch 18

security viewInformatica Analyst interface 3

Ttables

adding a table 28data objects 22database connection properties 27deleting a database connection 28viewing data objects 29

tagsassigning a tag 14creating a tag 14hiding tags 14tags overview 14

viewing tags 14tags overview

tags 14

Vviewing audit table events

reference tables 45viewing data objects

flat files 29tables 29

viewing folder contentsfolders 9

viewing object propertiesobjects 11

viewing project contentsprojects 9

viewing tagstags 14

48 Index