Rafael Fernandes Cunha

Artificial Intelligence Departament, University of Groningen, The Netherlands.

prof_pic.jpg

I am a Lecturer in Artificial Intelligence at the University of Groningen, the Netherlands, where I teach reinforcement learning, among other AI topics, and supervise student research projects. I am also a PhD Candidate with a research focus on multi-agent reinforcement learning.

Over the past three years as a lecturer, I have supervised more than 20 bachelor's and master's thesis projects, several of which have been published at international venues such as NLDL 2025, and ECAI 2025 workshops. These projects span topics from multi-agent coordination to applications in cyber security and large language model post-training.

My PhD research focuses on multi-agent reinforcement learning in decentralized partially observable settings (Dec-POMDPs and POSGs). I analyze the mathematical structure of these problems to develop algorithms with improved convergence and performance guarantees. In previous work, I applied deep RL to optimize switching control in vehicle platoon systems, demonstrating reinforcement learning for dynamical systems control.

During my master's studies in Electrical Engineering, with a focus on control systems at UNICAMP, I acquired experience in the mathematical modeling of dynamical systems and the resolution of convex optimization problems. This groundwork has helped me understand RL problems on the algorithmic level and discern their connection to output feedback control type problems, for which there are established mathematical tools for analysis.

Here’s a broad overview of my current research interests:

  • Deep Reinforcement Learning
  • Multiagent Reinforcement Learning
  • Transfer Learning in RL
  • RL-based Post-training of LLMs with Analytical/Verifiable Rewards

Check out here the list of open projects that you can enroll in for your bachelor’s or master’s thesis.

Selected publications

2025

  1. AAAI
    Optimally solving simultaneous-move dec-POMDPs: The sequential central planning approach
    Johan Peralez, Aurélien Delage, Jacopo Castellini, and 2 more authors
    In Proceedings of the AAAI Conference on Artificial Intelligence, 2025
  2. TMLR
    Sparsity-Driven Plasticity in Multi-Task Reinforcement Learning
    Aleksandar Todorov, Juan Cardenas-Cartagena, Rafael F. Cunha, and 2 more authors
    Transactions on Machine Learning Research, 2025
  3. NLDL
    World model agents with change-based intrinsic motivation
    Jeremias Ferrao, and Rafael Cunha
    In 2025 Northern Lights Deep Learning Conference (NLDL), 2025
  4. SPAIML/ECAI WS
    Seed Scheduling in Fuzz Testing as a Markov Decision Process
    Rafael F Cunha, Luca Müller, Thomas Rooijakkers, and 2 more authors
    In 1st International Workshop on Security and Privacy-Preserving AI/ML (SPAIML) at ECAI, 2025

2023

  1. Fuel-Efficient Switching Control for Platooning Systems With Deep Reinforcement Learning
    Tiago R Goncalves, Rafael F Cunha, Vineeth S Varma, and 1 more author
    IEEE Transactions on Intelligent Transportation Systems, 2023

2019

  1. Robust partial sampled-data state feedback control of Markov jump linear systems
    Rafael F Cunha, Gabriela W Gabriel, and José C Geromel
    International Journal of Systems Science, 2019