Reinforcement & Decision-Making for Agentic AI (with Python) Training Course

This course explores the principles and implementation of reinforcement learning (RL) and sequential decision-making as they apply to agentic AI systems. Participants will learn how to design, train, and evaluate agents that interact dynamically with their environments to achieve long-term goals through learning and adaptation.

This instructor-led, live training (online or onsite) is aimed at advanced-level engineers and researchers who wish to integrate reinforcement learning and planning algorithms into agentic systems for automation, robotics, and adaptive reasoning.

By the end of this training, participants will be able to:

Understand the mathematical foundations of reinforcement learning and decision-making.
Implement key RL algorithms such as DQN, PPO, and A3C using Python and PyTorch.
Model environments using OpenAI Gym and design custom simulation scenarios.
Train, evaluate, and debug agents for continuous and discrete control tasks.
Apply reinforcement learning techniques to agentic AI use cases in robotics and planning.
Balance exploration, exploitation, and safety constraints in real-world deployment.

Format of the Course

Instructor-led lectures and live coding demonstrations.
Hands-on exercises using open-source frameworks and simulation environments.
Applied project integrating decision-making into an agentic AI system.

Course Customization Options

To request a customized training for this course, please contact us to arrange.

This course is available as onsite live training in United Kingdom or online live training.

Thank you for sending your enquiry! One of our team members will contact you shortly.

Thank you for sending your booking! One of our team members will contact you shortly.

Course Outline

Introduction to Reinforcement Learning and Agentic AI

Decision-making under uncertainty and sequential planning
Key components of RL: agents, environments, states, and rewards
Role of RL in adaptive and agentic AI systems

Markov Decision Processes (MDPs)

Formal definition and properties of MDPs
Value functions, Bellman equations, and dynamic programming
Policy evaluation, improvement, and iteration

Model-Free Reinforcement Learning

Monte Carlo and Temporal-Difference (TD) learning
Q-learning and SARSA
Hands-on: implementing tabular RL methods in Python

Deep Reinforcement Learning

Combining neural networks with RL for function approximation
Deep Q-Networks (DQN) and experience replay
Actor-Critic architectures and policy gradients
Hands-on: training an agent using DQN and PPO with Stable-Baselines3

Exploration Strategies and Reward Shaping

Balancing exploration vs. exploitation (ε-greedy, UCB, entropy methods)
Designing reward functions and avoiding unintended behaviors
Reward shaping and curriculum learning

Advanced Topics in RL and Decision-Making

Multi-agent reinforcement learning and cooperative strategies
Hierarchical reinforcement learning and options framework
Offline RL and imitation learning for safer deployment

Simulation Environments and Evaluation

Using OpenAI Gym and custom environments
Continuous vs. discrete action spaces
Metrics for agent performance, stability, and sample efficiency

Integrating RL into Agentic AI Systems

Combining reasoning and RL in hybrid agent architectures
Integrating reinforcement learning with tool-using agents
Operational considerations for scaling and deployment

Capstone Project

Design and implement a reinforcement learning agent for a simulated task
Analyze training performance and optimize hyperparameters
Demonstrate adaptive behavior and decision-making in an agentic context

Summary and Next Steps

Requirements

Strong proficiency in Python programming
Solid understanding of machine learning and deep learning concepts
Familiarity with linear algebra, probability, and basic optimization methods

Audience

Reinforcement learning engineers and applied AI researchers
Robotics and automation developers
Engineering teams working on adaptive and agentic AI systems

28 Hours

Delivery Options

Private Group Training

Our identity is rooted in delivering exactly what our clients need.

Pre-course call with your trainer
Customisation of the learning experience to achieve your goals -

Bespoke outlines
Practical hands-on exercises containing data / scenarios recognisable to the learners

Training scheduled on a date of your choice
Delivered online, onsite/classroom or hybrid by experts sharing real world experience

Private Group Prices RRP from £7600 online delivery, based on a group of 2 delegates, £2400 per additional delegate (excludes any certification / exam costs). We recommend a maximum group size of 12 for most learning events.

Reinforcement & Decision-Making for Agentic AI (with Python) Training Course

Course Outline

Requirements

Delivery Options

Private Group Training

Public Training

Testimonials (3)

Ion Mironescu - Facultatea S.A.I.A.P.M.

Course - Agentic AI for Enterprise Applications

Ion Mironescu - Facultatea S.A.I.A.P.M.

Course - Autonomous Decision-Making with Agentic AI

Daniel - Facultatea S.A.I.A.P.M.

Course - Agentic AI in Multi-Agent Systems

Provisional Upcoming Courses (Contact Us For More Information)

Reinforcement & Decision-Making for Agentic AI (with Python)

Reinforcement & Decision-Making for Agentic AI (with Python)

Reinforcement & Decision-Making for Agentic AI (with Python)

Reinforcement & Decision-Making for Agentic AI (with Python)

Related Categories

This site in other countries/regions

Europe

Asia Pacific

North America

South America

Africa / Middle East

Other sites

Reinforcement & Decision-Making for Agentic AI (with Python) Training Course

Course Outline

Requirements

Delivery Options

Private Group Training

Public Training

Testimonials (3)

Ion Mironescu - Facultatea S.A.I.A.P.M.

Course - Agentic AI for Enterprise Applications

Ion Mironescu - Facultatea S.A.I.A.P.M.

Course - Autonomous Decision-Making with Agentic AI

Daniel - Facultatea S.A.I.A.P.M.

Course - Agentic AI in Multi-Agent Systems

Provisional Upcoming Courses (Contact Us For More Information)

Reinforcement & Decision-Making for Agentic AI (with Python)

Reinforcement & Decision-Making for Agentic AI (with Python)

Reinforcement & Decision-Making for Agentic AI (with Python)

Reinforcement & Decision-Making for Agentic AI (with Python)

Related Courses

Autonomous Decision-Making with Agentic AI

Understanding Agentic AI: Concepts and Capabilities

Agentic AI for Enterprise Applications

Agentic AI in Multi-Agent Systems

Building Agentic AI Systems: From Theory to Practice

Governance and Security Patterns for WrenAI in the Enterprise

Modernizing Legacy BI with WrenAI: Adoption, Migration, and Change Management

Quality and Observability for WrenAI: Evaluation, Prompt Tuning, and Monitoring

Format of the Course

Course Customization Options

Building with the WrenAI API: Applications, Charts, and NL to SQL

WrenAI Cloud Essentials: From Data Sources to Dashboards

WrenAI for Financial Analytics: KPI Modeling and Regulatory-Aware Dashboards

WrenAI OSS Deep Dive: Semantic Modeling, Text to SQL, and Guardrails

WrenAI for Product Teams: Conversational Analytics and Self-Service BI

Deploying WrenAI for SaaS: Embedded GenBI in Customer-Facing Products

Operational Analytics with WrenAI Spreadsheets and Metrics Library

Related Categories

Agentic AI

This site in other countries/regions

Europe

Asia Pacific

North America

South America

Africa / Middle East

Other sites