Ab Initio Course


Ab initio Course Overview

Ab Initio Software is an American multinational enterprise software corporation based in Lexington, Massachusetts. The company specializes in high-volume data processing applications and enterprise application integration.

The Ab Initio products are provided on a user-friendly homogeneous and heterogeneous platform for parallel data processing applications. These applications perform functions relating to fourth generation data analysis, batch processing, complex events, quantitative and qualitative data processing, data manipulation graphical user interface (GUI)-based parallel processing software which is commonly used to extract, transform, and load (ETL) data.

It is a flexible and extensible tool which adapts at runtime to the formats of records entered. It allows creation and incorporation of new components obtained from any program that permits integration and reuse of external legacy codes and storage engines.

What is Ab Initio software?

Ab Initio is a Business Intelligence platform comprised of six data processing products Co operating System, The Component Library, Graphical Development Environment, and Enterprise Meta Environment, Data Profiler, and Conduct It. It is a powerful GUI-based parallel processing tool for ETL data management and analysis.

Who are eligible for doing the Ab Initio Course?

Anyone who is working on Maintenance project where the underlying ETL tool is Ab Initio.  Anyone who wanted to better their ETL skill sets by learning the happening tool in the Market. Knowledge in the database and/or data warehouse.

What Skills are required to learn this Course?

Knowledge about different components and its purpose. Build a graph adhering the business requirement. How to blend different tools for different requirement.  Complete understanding of Data Warehousing and ETL terminologies.

What are the Job Opportunities after this Course?

You will be hired as data analyst, ETL developer in the companies like wells Fargo, Bank of America, DBS, commonwealth bank and more.

What are the Main topics Covered in this Course?

The main topics covered Introduction to Ab initio software, Understanding Host connection settings, Understanding components of a graph, Component Overview-Dataset components, Component Overview-Dataset components, Lookup files and Lookup Functions, Component Overview- Validate Components, performance tuning and more.

Course Syllabus

Introduction to Ab initio software

Ab initio Architecture

Understanding GDE

Understanding Co-op System

Understanding EME/Application hub

Understanding Host connection settings

Configure host settings

Understanding Abinitio Jobs/ Graphs

Basics of Graphs

Building and executing graphs

Understanding Sandboxes/Projects

Understanding components of a graph

Record formats, DML, Transform functions, Key specifiers

Dataset components/ Table components/Program components



Validating the graphs and understanding the error messages

Component Overview-Editors

Record Format Editors

Expression Editors

Transform Editors

Key Specifier Editors

Package Editors

Variable Editors

Component Overview-Dataset components

Input file

Output File

Intermediate file

Lookup files and Lookup Functions

Filter by expression

Dedup sorted






Match sorted


Denormalize Sorted

Understanding partitioning Concepts and multi files

Understanding Multi files

Understanding Parallelism

Data Parallelism

Component Parallelism

Pipeline parallelism

Component Overview-Partition components


Partition by key

Partition by expression

Partition by round robin

Partition by range

Partition by load balance

Component Overview-DE partition components





Component Overview-Database Components, DBC file configuration

Understanding database configuration

Input table

Output table


Truncate Table

Update Table

Join with DB

Component Overview- Miscellaneous Components

Redefine format

Gather Logs


Run Program


Component Overview- Validate Components

Check Order

Compare Records

Validate Records

Generate Record

Create Data

Compute Checksum

Understanding Performance Tuning

Component Folding

Additional Graph concepts





Common projects

Dependency analysis/Impact analysis

Generic Graphs/Reusable Graphs

Conditional components

Concept of Phases and check point

Understanding Plans/Conduct IT Understanding Continuous Flows

Understanding various command line options/ configuration Variables

M commands

Air commands

Configuration variables