Pentaho Data Integration For Data Warehouse

Pentaho Data Integration For Data Warehouse
July 7, 2023 No Comments » Information Technology adminweb

Pentaho Data Integration For Data Warehouse

Pelatihan Pembangkit Panas Bumi Geothermal

DESKRIPSI

Pentaho Data Integration (PDI) atau Kettle adalah software dari Pentaho yang dapat digunakan untuk proses ETL (Extraction, Transformation dan Loading). PDI dapat digunakan untuk migrasi data, membersihkan data, loading dari file ke database atau sebaliknya dalam volume besar. PDI menyediakan graphical user interface dan drag-drop komponen yang memudahkan user. Elemen utama dari PDI adalah Transformation dan Job. Transformation adalah sekumpulan instruksi untuk merubah input menjadi output yang diinginkan (input-proses-output). Sedangkan Job adalah kumpulan instruksi untuk menjalankan transformasi. Ada tiga komponen dalam PDI: Spoon, Pan dan Kitchen. Spoon adalah user interface untuk membuat Job dan Transformation. Pan adalah tools yang berfungsi membaca, merubah dan menulis data. Sedangkan Kitchen adalah program yang mengeksekusi job.

 

TUJUAN

Setelah mengikuti training ini, peserta diharapkan dapat:

  1. Memahami fungsi Kettle sebagai tools ETL (Extract, Transform, Loading) Data
  2. Memahami komponen dasar Kettle
  3. Memahami pembuatan parameters dan variables pada kettle
  4. Mampu menggunakan komponen-komponen dasar kettle (Step, Hop, Job)

 

MATERI

  1. Introduction to Data Warehouse
  • Data Warehouse
  • Online Transaction Processing (OLTP) and Online Analytical Processing (OLAP)
  • Data Warehouse and OLAP
  • Delivering Solution with ETL (Extract, Transform, Load) Tool
  1. Installation and configuration
  • Java Runtime Environment / Java Development Kit
  • Pentaho Data Integration
  • XAMPP package (Apache HTTP Server and MySQL)
  • SQLYog – a GUI based mysql client
  • Data and Script samples
  1. Short introduction to MySQL
  • MySQL Storage Engines
  • Administering MySQL via PHPMyAdmin
  • PHI-Minimart sample database installation
  1. Pentaho Data Integration (Kettle)
  • Introducing Kettle as Pentaho’s ETL Suite
  • Architecture
  • Components
  • Spoon : Graphical UI Designer for job / transformation steps
  • Pan : Command line batch script for transformation execution
  • Kitchen : Command line batch script for transformation execution
  • Carte : Cluster server
  • Job / Transformation
  • Step and Hop
  • Row and Meta Data
  • Relation between job and transformation
  1. Getting started with spoon
  • File system and RDBMS based Repository
  • Spoon Development Environment
  • Database Connections
  • Job and Transformation
  • Creating job
  • Creating transformation
  • Calling transformation from job
  • Configuring Log
  1. Wulti dimensional modelling
  • Normalized versus Multi Dimensional Model
  • Fact and Dimension Tables
  • Star Schema and Snowflake Schema
  • Tasks :
  • Create a Kettle transformation to map PHI-Minimart transactional database sample to dimensional modeling database
  • Create logs for each steps
  1. Change Data Capture (CDC)
  • What is CDC ?
  • Why CDC is so hard that heavily relied on data source ?
  • SQL Server 2008’s CDC feature demonstration
  • Tasks :
  • Create a Kettle transformation to map PHI-Minimart transactional database sample to dimensional modeling database
  • Create logs for each steps
  1. Slowly Changing Dimensio (SCD)
  • Slowly Changing Dimension to solve master data historical problems
  • SCD Types
  • Use of Kettle’s step to solve several SCD types with several schema :
  • Insert / Update
  • Punch Through
  1. Orphan/Late Arrival
  • What is Late Arrival Dimension?
  • Typical Situations where Late Arrival occurs
  • Best practice of Late Arrival’s handling
  1. OLAP Vied of multidimensional data (Mondrian/JPivot)
  • Mondrian Installation
  • Creating scheme based on our fact and dimension tables
  • View and navigate our Cube using Web Browser
  1. Data staging
  • What is Data Staging?
  • Background : Physical I/O versus In-Memory Processing
  • Task :
  • Create a transformation to join from 3 data sources : text file, Excel spreadsheet, and RDBMS
  • Create a currency staging table to solve sequential dependence problem
  1. Advance controls
  • Environment Variables
  • Shared Objects
  • Error Handling
  • Email job results
  • Task :
  • Create a dynamic tables dump using variable and looping control
  • Refining existing transformations to use email alert
  1. Automation
  • Using Windows Task Scheduler to schedule ETL running job and transformation

 

INVESTASI DAN FASILITAS

Metode Pelaksanaan Harga & Fasilitas
Opsi 1 –

Pelatihan Online

   Training Online Rp 3.900.000 per peserta

Minimal kuota 1 peserta dan bisa request tanggal

Pelaksanaan training selama 2 hari half day (08.00 – 12.00 WIB atau 13.00 – 17.00 WIB)

Menggunakan aplikasi Zoom, Google Meet

Fasilitas : Sertifikat Training Softfile & Hardfile, Pengiriman Sertifikat ke Alamat Peserta, Softfile Materi

Biaya belum termasuk PPN 11%

Opsi 2 –

Pelatihan Offline di Yogyakarta

   Training Offline Rp 6.900.000 per peserta

Minimal kuota 1 peserta dan bisa request tanggal

Pelaksanaan training selama 2 hari full day (08.00 – 16.00 WIB)

Tempat pelaksanaan :

ü Hotel El Royale, Yogyakarta

ü Hotel Malyabhara, Yogyakarta

Fasilitas : Meeting Room, Modul Training, Sertifikat Training, Training Kits, Lunch, Coffee Break

Biaya belum termasuk PPN 11%

Opsi 3 –

Pelatihan Offline Luar Yogyakarta (Jakarta, Bandung, Surabaya, dll)

   Training Offline Rp 7.900.000 per peserta

Minimal kuota 2 peserta dan bisa request tanggal

Pelaksanaan training selama 2 hari full day (08.00 – 16.00 WIB)

Pilihan Tempat pelaksanaan :

ü Hotel Grand Tebu, Bandung

ü Hotel Santika Pandegiling, Surabaya

ü Hotel Asyana Kemayoran, Jakarta

ü Hotel Ibis Simpang Lima, Semarang

ü Hotel Ibis, Solo

ü dll

Fasilitas : Meeting Room, Modul Training, Sertifikat Training, Training Kits, Lunch, Coffee Break

Biaya belum termasuk PPN 11%

Opsi 4 –

Pelatihan Offline Luar Pulau Jawa (Lombok, Bali, Balikpapan, dll)

   Training Offline Rp 8.900.000 per peserta

Minimal kuota 2 peserta dan bisa request tanggal

Pelaksanaan training selama 2 hari full day (08.00 – 16.00 WIB)

Pilihan Tempat pelaksanaan :

ü Hotel Kimaya Braga, Bandung

ü Hotel Midtown, Surabaya

ü Hotel Asyana Kemayoran, Jakarta

ü Hotel Amaris Kemang, Jakarta

ü Hotel Ibis Simpang Lima, Semarang

ü Hotel Ibis, Solo

Fasilitas : Meeting Room, Modul Training, Sertifikat Training, Training Kits, Lunch, Coffee Break

Biaya belum termasuk PPN 11%

 

Instruktur

Tim Instruktur

Formulir Permintaan Informasi Lanjutan / Pra-Pendaftaran Public Training
    INFORMATION OPTIONS
    1. (required)
    2. (required)
    PERSONAL DATA
    1. (required)
    2. (required)
    3. (required)
    4. (valid email required)
    5. (required)
    6. (required)
    PRE REGISTRATION DATA (Tidak Mengikat)
    1. (required)
    MESSAGE FOR TRAINING PROVIDER
     

    In House Training lainnya yang beritanya dapat dilihat di link berikut => In House Training.

    Untuk judul dan informasi online training, kunjungi juga website PT Expertindo lainnya di alamat www.e-trainingonline.com

    Tags
    About The Author

    Leave a reply

    Your email address will not be published. Required fields are marked *

    Open chat
    Butuh Bantuan? Chat Dengan Kami
    PT Expertindo Training
    Dengan Expertindo-Training.com, ada yang bisa Kami bantu?