Talend Big Data Integration Training Course

Talend Open Studio for Big Data is an open-source ETL tool designed for processing big data. It provides a development environment to interact with big data sources and targets, enabling users to run jobs without writing code.

This instructor-led live training (available online or onsite) is targeted at technical professionals who wish to deploy Talend Open Studio for Big Data to streamline the process of reading and crunching through big data.

By the end of this training, participants will be able to:

Install and configure Talend Open Studio for Big Data.
Connect with big data systems such as Cloudera, HortonWorks, MapR, Amazon EMR, and Apache.
Understand and set up the big data components and connectors in Open Studio.
Configure parameters to automatically generate MapReduce code.
Use Open Studio's drag-and-drop interface to execute Hadoop jobs.
Prototype big data pipelines.
Automate big data integration projects.

Format of the Course

Interactive lecture and discussion.
Lots of exercises and practice.
Hands-on implementation in a live-lab environment.

Course Customization Options

To request a customized training for this course, please contact us to arrange.

This course is available as onsite live training in Kenya or online live training.

Thank you for sending your enquiry! One of our team members will contact you shortly.

Thank you for sending your booking! One of our team members will contact you shortly.

Course Outline

Introduction

Overview of "Open Studio for Big Data" Features and Architecture

Setting up Open Studio for Big Data

Navigating the UI

Understanding Big Data Components and Connectors

Connecting to a Hadoop Cluster

Reading and Writing Data

Processing Data with Hive and MapReduce

Analyzing the Results

Improving the Quality of Big Data

Building a Big Data Pipeline

Managing Users, Groups, Roles, and Projects

Deploying Open Studio to Production

Monitoring Open Studio

Troubleshooting

Summary and Conclusion

Requirements

An understanding of relational databases
An understanding of data warehousing
An understanding of ETL (Extract, Transform, Load) concepts

Audience

Business intelligence professionals
Database professionals
SQL Developers
ETL Developers
Solution architects
Data architects
Data warehousing professionals
System administrators and integrators

28 Hours

Need help picking the right course?
southafrica@nobleprog.co.za or +27 (0)10 005 5793

Testimonials (2)

A journey through the Spark world: a very intense course. DSL, spark sql, partitioning vs bucketing for me.

Georgiana Elisabeta

Course - Apache Spark Fundamentals

Hands on exercises. Class should have been 5 days, but the 3 days helped to clear up a lot of questions that I had from working with NiFi already

Talend Big Data Integration Training Course

Course Outline

Requirements

Testimonials (2)

Georgiana Elisabeta

Course - Apache Spark Fundamentals

James - BHG Financial

Course - Apache NiFi for Administrators

Related Categories

This site in other countries/regions

Europe

Asia Pacific

North America

South America

Africa / Middle East

Other sites

Talend Big Data Integration Training Course

Course Outline

Requirements

Testimonials (2)

Georgiana Elisabeta

Course - Apache Spark Fundamentals

James - BHG Financial

Course - Apache NiFi for Administrators

Related Courses

Advanced Apache Iceberg

Apache Iceberg Fundamentals

Big Data Analytics with Google Colab and Apache Spark

Apache NiFi for Administrators

PySpark and Machine Learning

Apache Spark Fundamentals

Administration of Apache Spark

Apache Spark in the Cloud

Python and Spark for Big Data (PySpark)

Python, Spark, and Hadoop for Big Data

Stratio: Rocket and Intelligence Modules with PySpark

Talend Administration Center (TAC)

Talend Data Stewardship

Talend Open Studio for ESB

Related Categories

Big Data

Talend

This site in other countries/regions

Europe

Asia Pacific

North America

South America

Africa / Middle East

Other sites