Data Engineering
Gain in-depth skills to administer and scale Apache Airflow—from configuring executors and users to building DAGs, integrating with AWS/Azure, and creating custom plugins.
Master Apache Airflow workflows using Python—from setting up executors and building DAGs to deploying production pipelines, cloud integration, and custom plugins.
- Data Engineering Automation: Ansible, Apache Airflow, and Snowflake (5 days, online)
Master data engineering automation—from provisioning infrastructure with Ansible to orchestrating workflows with Apache Airflow and integrating with Snowflake for modern cloud data warehousing.
- Distributed Task Automation with Python Celery and RabbitMQ (16 hours, online)
Gain hands‑on mastery of distributed task automation using Python, Celery & RabbitMQ. Set up Docker environments, define tasks, and scale workflows with routing, scheduling, and deployment.
- Jupyter Widgets Programming (3 days, online)
Build custom Jupyter widgets with modern ipywidgets 8.x and AnyWidget: Python widget classes, ESM-based front-ends in TypeScript, packaging, and embedding in JupyterLab and Notebook 7. Replaces the legacy CoffeeScript/Backbone-based widget toolchain.
- Distributed Task Automation with Python Faust and Kafka (16 hours, online)
Master distributed task automation with Python Faust and Kafka. Learn to containerize environments, process streaming data, manage state and fault tolerance, monitor systems, and deploy real‑time pipelines.
- Generative AI and LLMs for Python Programmers (5 days, online)
End-to-end generative AI for Python developers in 2026: frontier models (Claude 4.x, GPT-5.x, Gemini 2.x), prompt and context engineering, structured outputs, tool use, RAG, agentic patterns with MCP, evals, observability, and responsible deployment.
- Practical Apache Spark for Data Pipelines (21 hours, online)
Learn to build scalable data pipelines with Apache Spark using Python. Gain hands-on experience with Spark Core, SQL, DataFrames, and real-time processing.
- Task Automation with Python (14 hours, online)
Streamline workflows with Python by automating file ops, subprocesses, logging, CLI args, async and API tasks—plus optional AI and distributed automation modules.