Ab Initio Batch #46 – Course Overview

This comprehensive training program blends foundational data concepts with advanced Ab Initio development, SQL mastery, and UNIX scripting. It’s designed to equip learners with end-to-end data engineering skills through progressive modules and hands-on mini projects.

Β 

πŸ“… Module Breakdown

Day 1: SDLC Fundamentals

  • Software Development Life Cycle basics

  • Project lifecycle understanding for data workflows

Days 2–5: Data Warehousing Concepts

  • OLTP vs OLAP

  • Star schema vs Snowflake schema

  • Dimensional vs ER modeling

  • Types of Facts and Dimensions

  • Slowly Changing Dimensions (SCD 0,1,2,3,4,6)

  • ETL architecture: ODS, Data Mart

  • Data types and normalization (1NF, 2NF, 3NF)

Days 6–11: SQL Mastery

  • SQL command categories: DML, DDL, DCL, TCL

  • Clauses: WHERE, CASE WHEN, EXISTS

  • Constraints and operators (math, relational, comparison, LIKE, IN, BETWEEN, AND/OR)

  • Set operators: UNION, UNION ALL, INTERSECT, MINUS/EXCEPT

  • Joins: natural, inner, left/right/full outer, semi, anti, self

  • Aggregation: GROUP BY, HAVING, ORDER BY

  • Functions: string, date, decimal, window functions

  • Performance tuning: indexing, partitioning, views

Ab Initio Core Modules

  • Architecture: GDE, EME, Co>Op, Conduct>It, Express>It, MetaHub, PDL, Query>It, BRE

  • Scheduling tools: Control-M, Control Center, Autosys, Jill scripts

  • Component types: Transform, Data, Continuous

  • Advanced topics: Partitioning/Departitioning, WMF, EMF, ICFF (WBC), Web services, JSON, vector flattening

  • Function library: string, date, decimal, regex, lookup, error handling, validation

  • Coding logic: variables (local/global), if-else, loops, lookup chaining, vector functions, temp variables, multistage transforms

  • DML writing styles (5 types)

  • Performance tuning with MFS

  • PDL and metaprogramming

  • Conduct>It planning and looping

Data Governance & Lineage

  • Data dependency tracking

  • Lineage visualization

  • Governance principles

  • Express>It integration

Β 

πŸ› οΈ Mini Projects & Applied Logic

  • FBE & REF: Data cleansing and redirection

  • Lookup Project: Transform logic implementation

  • Join Project: SCD1 and SCD2 modeling

  • Normalization Project: Inventory management schema

  • Lookup + Partitioning: Mini project on performance optimization

Β 

πŸ–₯️ UNIX Essentials (3–4 Days)

  • Core commands: cd, cp, rm, mv, sort, uniq, paste, pwd

  • Text processing: grep, egrep, sed, awk, find, ps -elf

  • Shell scripting fundamentals

Β 

πŸš€ Deployment & Automation

  • AIR and M commands

  • Code migration: Dev to Prod

  • Conduct>It continuous components:

    • File watcher β†’ Graphs β†’ Purging/Cleansing/Housing

    • Row-to-column and column-to-row transformations

    • MetaPivot and Normalize for reshaping data

    • Rollup and accumulation for denormalization

    • Row suppression and aggregation logic