Glossary
- RL
Reinforcment Learning. A machine learning method to learn policies trough interaction with an environment.
- SARL
Single-Agent Reinforcement Learning. A reinforcement learning setting where a single agent interacts with an environment to learn an optimal policy.
- MARL
Multi-Agent Reinforcement Learning. A reinforcement learning setting where multiple agents interact with an environment to learn optimal policies. In this setting, agents typically share a policy and learn collaboratively.
- SB3
Stable Baselines3. A popular open-source library for RL algorithms, built on top of PyTorch, providing implementations of various RL methods for easy experimentation and benchmarking.
- AFC
Active Flow Control. The use of actuators to manipulate a fluid flow in order to achieve a desired outcome, such as drag reduction or lift enhancement.
- RBC
Rayleigh-Bénard Convection. A type of natural convection that occurs in a fluid layer heated from below and cooled from above, leading to the formation of convection cells.
- TCF
Turbulent Channel Flow. A canonical flow configuration used to study turbulence in wall-bounded flows, characterized by a fluid flowing between two parallel plates.
- PPO
Proximal Policy Optimization. A policy gradient method that improves learning stability by using a clipped surrogate objective, preventing excessive policy updates and ensuring reliable performance.
- SAC
Soft Actor-Critic. An off-policy reinforcement learning algorithm that maximizes both expected reward and entropy, encouraging robust exploration and sample efficiency.