Carel van Niekerk carelvniekerk

Carel van Niekerk

Hi there 👋

I am a Research Scientist/Engineer specialising in Reinforcement Learning (RL) Post-training, LLM Alignment, and Uncertainty Quantification. Currently, I am a Postdoctoral Researcher at Heinrich Heine University, Düsseldorf. I have over 10 years of experience across academia, recently completing my PhD (magna cum laude) with a focus on uncertainty-aware decision-making. My goal is to build trustworthy, scalable, and agentic AI systems.

🎯 Interests

Reinforcement Learning for LLMs and Agentic LLMs: using intrinsic reward via model confidence for improved performance, calibration and reliability.
Uncertainty in Large Language Models: techniques for estimating and mitigating uncertainty to enhance model trustworthiness and decision-making.
Agentic LLMs: development of autonomous AI agents leveraging LLM capabilities for complex task execution.
Dialogue Systems & NLP: ConvLab 3 toolkit, unified data formats, state tracking, production-ready pipelines.

🛠 Tech Stack

🌍 Languages

📚 Selected Publications

📂 Notable Projects

RLSF-LLM: Self-supervised RL post-training pipelines for LLMs, using intrinsic confidence-based rewards.
ConvLab 3: Unified data format, improved state-tracking and Reinforcement learning tools.

Last updated: 2025-11-10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Carel van Niekerk carelvniekerk

Achievements

Achievements

Highlights

Block or report carelvniekerk