Seminars

Noah De Nicola | Inference Strategies for RL

Abstract Many real-world problems are extremely difficult, combinatorial in nature, and can require complex coordination. This level of complexity can cause even well-trained RL systems to hit a performance ceiling that they are unable to break through with zero-shot inference....

Lood van Niekerk | Machine Learning for Protein Modeling and Design

Abstract Proteins are the machines that perform most of the function in our cells and in the cells of all organisms (replicating DNA, performing reactions, responding to stimuli). A better understanding of proteins can be used for designing medicines, genetically...

Ruan de Kock | Multi-Agent RL

Abstract Multi-Agent Reinforcement Learning (MARL) provides a framework for modelling complex systems in which multiple decision-makers interact within a shared environment. Such settings are ubiquitous in the real world, yet introduce fundamental challenges that do not arise in single-agent learning....

Batsi Ziki | Meta-RL

Abstract Reinforcement Learning (RL) is known to be sample-inefficient, and RL agents often do not generalise to new environments. Meta-Reinforcement Learning (Meta-RL) is one solution to these issues. Meta-RL is about learning how to reinforcement learn; the model learns parts...

Associate Professor Elefelious Belay | Drought Prediction in Ethiopia: A Deep Learning Approach To Reduce Drought Driven Poverty

Abstract This study examines drought dynamics in the arid and semi-arid lowlands of Ethiopia, focusing on selected districts in the Afar and Somali regions using remote sensing and climate-based drought indicators. The results show ongoing environmental stress, limited ecological recovery,...

Daniel Wachira | Assessing AI-Amplified Counterfeit Medicine Threats in Africa

Abstract This study investigates whether frontier AI systems can systematically exploit structural vulnerabilities in African pharmaceutical supply chains, lowering technical barriers for bad actors seeking to produce, distribute, and evade detection of falsified medications. We address three key questions: First,...

Qi Guo | Inter-agent Influence Evaluation

Abstract Persuasion and deception pose distinct risks in multi-agent settings. In the context of misuse, an AI agent instructed to pursue a harmful goal could use persuasion, deception, and other forms of influence to recruit other AI agents — combining...

Omer Kamal Ali Ebead | Embedded Adversarial Agents in Multi-Agent LLM Systems

Abstract LLM agents now hold sensitive user data and interact autonomously at scale — but what happens when one agent is adversarial from the start? Unlike external attacks (prompt injection, jailbreaking), an embedded adversary is a trusted participant that exploits...

Claude Formanek | Offline RL

Abstract Offline Reinforcement Learning (RL) has emerged as a critical paradigm for real-world AI deployment, particularly in domains where active data collection is either prohibitively expensive or physically dangerous. By enabling agents to learn optimal policies from fixed, pre-collected datasets,...

St John Grimbly | Intro to RL

Abstract What does it take to learn purely from the consequences of your actions — no labels, no corrections, just a signal telling you how well you did? Reinforcement learning (RL) answers that question, and it turns out the solutions...

Pramod Kaushik | Strategic Vagueness in LLMs

Abstract This work investigates whether LLMs understand and deploy strategic vagueness, with implications for AI safety. Steven Pinker’s theory of indirect speech posits that humans use ambiguous language as a rational strategy when facing audiences with conflicting interests, enabling coordination...

Akash Kundu | Similarity as a Signal: Do AI Agents Cooperate More When They Know They’re Alike?

Abstract The Nash equilibrium for the Prisoner’s Dilemma is to defect. Always. But here’s a thought: what if you knew the coplayer across from you thought about the world the same way you do? Would you still defect? That’s the...

Dr. Elizabeth Oseku | HASH: Writing the Code for AI in Africa

Abstract This talk will explore the work of the Hub for Artificial Intelligence in Maternal, Sexual and Reproductive Health (HASH) since its establishment in 2021. It will highlight how a vision to create a research hub that advances responsible AI...

Joseph Low & Oscar Duys | Delegating Deliberation to Agents

Abstract Can AI agents learn what you think — and represent you in a discussion you never attended? As multi-agent AI systems become increasingly capable of deliberating on complex issues, a new possibility emerges: delegating your voice in collective decision-making...

Arsene Tayo Abichai | Parallel Direct Asynchronous Stochastic Quasi-Newton (DASQN) Scheme for Multi-core Architecture

Abstract Classic first-order optimization methods, such as gradient descent (GD), have significant limitations: generally linear convergence, sensitivity to the choice of learning rate, and significant oscillations around the minimum, which are particularly pronounced in ill-conditioned problems. To overcome these limitations,...

Mr KAPTCHOUANG Yvan Derrick | Using Parallelization and Compression for Frugal Learning

Speaker: Yvan Dérick KAPTCHOUANG Abstract This talk addresses the challenge of energy efficiency in Graph Neural Networks (GNNs), whose growing complexity leads to high computational and energy costs. We present a generic methodology for designing frugal Graph Convolutional Networks (GCNs)...

Asma Basly | Containerized Robotics: Accelerating Research from Prototype to Production

Speaker: Asma Basly (OORB Studio) Abstract Robotics development today is plagued by extreme fragmentation-dozens of simulation tools, multiple versions of ROS, incompatible CAD workflows, and countless dependency conflicts that vary across operating systems and hardware configurations. Research teams spend more time...

AKAMBA MANI Crescence Catherine – Credit Risk Prediction in Peer-to-Peer Lending Platforms using Graph Features

Speaker Bio My name is AKAMBA MANI Crescence Catherine, and I hold a Master’s degree in Computer Science, specializing in Data Science, from the University of Yaoundé 1 in Cameroon. My thesis focused on predicting credit risk in peer-to-peer lending...

Benjamin Cowley – Attention and learning in high performance cognition

Speaker Bio Benjamin Ultan Cowley is Professor of Learning in Humans and Machines at the Faculty of Educational Sciences, and a Docent of cognitive science. He defended his PhD in Computer Science at the University of Ulster, Northern Ireland, in...

Evgenii Rudakov – Action Atoms for Inferring Control Strategies from Movement

Speaker Bio Evgenii is a doctoral researcher in the HiPerCog group (since 2023), where he combines machine learning with computational modeling to understand human actions in dynamic environments and how learning shapes them. He holds a bachelor’s in Computer Science...

Pablo Flores – Latent Play: Unsupervised Neural Methods for Modeling Player Styles and Learning

Speaker Bio Pablo, generally known as Pipa, is a doctoral researcher on the CLIC program. In Chile, he completed his teaching degree and went on to teach high-school students in the fields of technology and physics. He joined the HiPerCog...

Louis Wei-Yu Feng – AI Safety in the African Context

Speaker: Louis Wei-Yu Feng, University of Cape Town Abstract Existing Large Language Model (LLM) safety benchmarks remain English-centric, severely limiting evaluations for marginalized populations in the Global South. Despite evidence that 85% of women experience online violence, no benchmark systematically...

Bill Jordan Tanekeu – Reinforcement Learning Parallelization applied to medical diagnosis

Speaker Bio Bill Tanekeu is a young Cameroonian graduate, 22 years old. He earned a scientific baccalaureate in 2019 in his hometown of Manjo, on the Cameroonian littoral, before moving to Yaoundé to continue his university studies in computer science....

Chris Emezue – Lanfrica, open science, open access, and AI in Africa

Speaker: Chris Emezue Abstract Our digital world is a rich tapestry of ideas, languages, cultures, and knowledge. However, our access to and understanding of these resources is skewed; some gain significant visibility, while others remain underrepresented and obscure (even when...

Tom Ringstrom – A Unified Theory of Compositionality, Modularity, and Interpretability in Markov Decision Processes

Speaker: Tom Ringstrom Abstract In this talk, Tom presents Option Kernel Bellman Equations (OKBEs) for a new reward-free Markov Decision Process. Rather than a value function, OKBEs directly construct and optimize a predictive map called a state-time option kernel (STOK)...

Baraah Sidahmed – Game-Aware Optimization for Multi-Agent Reinforcement Learning

Speaker Bio A phD candidate at the relational ML group at the CISPA Helmholtz center for information technology. Previously worked on optimizing multi-agent reinforcement learning using ideas from game theory. currently working on a general framework that enables a wide...

Everlyn Chimoto – Improving Quantized Multilingual LLMs

Speaker Bio Everlyn is a PhD student in Natural Language Processing at the University of Cape Town. She specializes in Neural Machine Translation for low-resource languages under Prof. Bruce Bassett’s supervision. Her research focuses on data and model-efficient methods for...

Dr Chinasa T. Okolo – Broadening Perspectives on African Governance in the Era of AI

Speaker Bio Chinasa T. Okolo, Ph.D., is the Founder of Technēcultură, a Fellow at The Brookings Institution, and a recent Computer Science Ph.D. graduate from Cornell University. Her research focuses on AI governance and safety for the Global Majority, datafication...

Dr Daniel Okoh – Efforts at Developing ML/AI-Driven Applications for Space Weather Prediction and Forecasting

Speaker Bio Dr. Daniel Okoh is a Postdoctoral Research Fellow at the Technical University of Kenya under the DARA (Development in Africa with Radio Astronomy) program. He has worked as researcher with the National Space Research and Development Agency (NASRDA)....

Prof. Patrick McSharry – Applied Intelligence: Machine Learning for Societal and Commercial Transformation

Speaker Bio Patrick McSharry is a Visiting Professor at the Department of Electrical and Computer Engineering, Carnegie Mellon University, Research Fellow at the Kigali Collaborative Research Centre (KCRC) and Strategic Advisor to the World Bank funded African Centre of Excellence...

Batsi Ziki – Meta-Learning the Intrinsic Reward Weighting in Curiosity-Driven RL

Speaker Bio Batsi is a Master’s student at the University of Cape Town with interests in curiosity-driven reinforcement learning and meta-reinforcement learning. His research focuses on improving the sample efficiency of reinforcement learning algorithms.See you there!

Alberto Cazzaniga – On image-text communication in vision-language models

Abstract Recent advances in multimodal training allow for integration of images and text within a unified model. Given their black-box nature, little is known on the strategies developed by vision-language models (VLMs) to allow efficient communication between the two modalities....

Homomorphism Counts Rule Everything Around Me – Emily Jin

Speaker: Emily Jin Abstract One of the key challenges in graph machine learning is how to effectively encode the topology of a graph into the model at hand. Standard message-passing GNNs are known to struggle with counting certain patterns (e.g.,...

Dr Tommaso Salvatori – On Predictive Coding Networks in Machine Learning

Speaker Bio Trained as a mathematician, I then did my PhD in machine learning and computational neuroscience at the University of Oxford, where I investigated the performance of biologically plausible algorithms in deep learning tasks. Following this, I pursued a...

Shocklab Seminar

Dr Ahmed El Hady – Functional ultrasound imaging during freely moving behavior

AfriClimate AI: Harnessing Artificial Intelligence for Climate Resilience in Africa – Dr Sabrina Amrouche

Speaker Bio Dr. Sabrina Amrouche is the co-founder of AfriClimate AI, a grassroots initiative leveraging AI to address climate challenges in Africa. She also serves as Head of Data Science at ZYTLYN, where she leads the development of advanced time...

Play-style Identification and Player Modelling for Generating Tailored Advice in Video Games – Branden Ingram

Speaker Bio I am a dedicated academic and researcher, currently serving as a Lecturer at Wits University. Throughout my academic journey, I sought to merge two of my greatest passions: video games and computer science. These passions led me to...

Solving Problems in Psychiatry with Machine Learning – Zach Wolpe

Abstract Machine Learning is playing an increasingly important role in biomedical engineering. In this talk I’ll discuss some of the hard medical problems we’re solving with data – focusing on our machine learning workflow & how we go from research...

Callum Tilbury

Naoya Muramatsu – The Motion Capture System for Wildlife

Abstract Understanding and monitoring wildlife behaviour is crucial in ecology and biomechanics, yet challenging due to the limitations of current methods. To address this issue, we introduce two motion capture system specifically tailored for free-ranging wildlife observation. These systems combine...

Siphelele Danisa – Learning at the Edge of Stability

Speaker Bio I am a Data Scientist at the Bank of Montreal, where my work primarily focuses on modeling volatility in the equity space. Previously, I completed an MSc in Computer Science at the University of Toronto and an MSc...

Narmeen Oozeer – Orbits classification of the CRTBP using deep learning approximations of the Koopman operator

Online link: https://uct-za.zoom.us/j/92750361177?pwd=QzNiRzBJRjRITVlwa2k5SVNkVmx5UT09

Ryan Smith – Novel approaches for understanding the neurocomputational basis of interoception and emotion-cognition interactions

Novel approaches for understanding the neurocomputational basis of interoception and emotion-cognition interactions SAVE THE

Bruce Bassett – Part 2: Is Artificial General Intelligence (AGI) really around the corner and how would it affect science?

TBA

Deep generative modelling aiding spatial statistics – Elizaveta Semenova

Speaker: Elizaveta Semenova, ML Researcher, Oxford & Imperial College London Abstract Disease mapping is an important surveillance tool that enables researchers and public health officials to analyse the spatial distribution of a disease, identify its geographical patterns, and plan interventions....

Is Artificial General Intelligence (AGI) really around the corner and how would it affect science? – Bruce Bassett

Speaker Bio Bruce Bassett has been a Full Professor of Applied Mathematics at the University of Cape Town since 2008 where his research explores both the theory and applications of AI and statistical models. Bruce was formerly head of Data...

Growing the MARL software ecosystem in JAX – InstaDeep MARL Team

Speaker Bio The MARL research team at InstaDeep works on large-scale multi-agent learning with a focus on algorithmic innovation in cooperative systems for industrial applications. The team regularly contributes to the research community through publications at venues such as NeurIPS...

Noah De Nicola

Efficient Representation of Natural Image Patches – Cheng Guo

Speaker Bio I have a Ph.D. in physics and currently work as an AI specialist at Allianz. In my spare time, I enjoy researching to understand how our visual system works, approaching it from first principles.

Felix Chalumeau – RL for Combinatorial Optimization: from Foundations to SOTA

Speaker: Felix Chalumeau, InstaDeep Research Abstract In this talk, we will introduce the challenges of combinatorial optimization and the motivation to tackle them with Deep Learning and Reinforcement Learning. We will walk through some core breakthroughs that happened through the...

Callum Tilbury – Generalisable Agents for Neural Network Optimisation (GANNO)

Speaker: Callum Rhys Tilbury, Junior Research Engineer @ InstaDeep Abstract Optimising deep neural networks is a challenging task due to complex training dynamics, high computational requirements, and long training times. To address this difficulty, we propose the framework of Generalisable...

Divanisha Patel – Reinforcement Learning and its Applications to Real-World Problems

Speaker: Divanisha Patel, PhD Candidate @ Wits | AI Research Engineer @ InstaDeep Abstract This talk will provide an introductory overview of reinforcement learning and its key concepts. We will then focus on how InstaDeep is using reinforcement learning to...

MARL for energy grid control – InstaDeep MARL Team

Speaker Bio We are the MARL research team from InstaDeep’s Cape Town office. We focus on the most recent advantages of MARL with a focus on JAX-based algorithms and environments

Tswelopele – A Proposal for Privacy Guarantees in Model Inference

Speaker: Tswelopele, BSc(Hons) Math UCT | MWR CyberSec Abstract This talk will go through some ideas behind my research proposal for my Masters. The proposal puts forth a contribution towards a systematisation of privacy guarantees for machine-learning model-inference with explicit...

ARGs: The Graph Theory of Evolution – Duncan Robertson

Abstract In this talk, I will introduce the ancestral recombination graph (ARG): a powerful way to encode the ancestry of a species through its DNA. ARGs have enabled us to simulate and study evolution on a massive scale, while also...

Categorical approach to concepts – Tali Beynon

Abstract I’ll outline an idea I had during our Betty’s Bay getaway, a “thought experiment” in how we might mathematically model symbolic concepts using ideas from category theory. Tali Beynon 17 January 2024

Scaling multi-agent reinforcement learning to eleven aside simulated robot soccer – Dries Smit

Abstract Robot soccer, where teams of autonomous agents compete against each other, has long been regarded as a grand challenge in artificial ntelligence. Despite recent successes of learned policies over heuristics and handcrafted rules in other domains, current teams in...

Subword Segmental Machine Translation for South African Languages – Francois Meyer

Abstract Deep learning has advanced the field of machine translation immensely. However, these advances have not been fully realised for all South African languages, because they are low-resourced and lack sufficient training data. Additionally, the Nguni languages of South Africa...

Reintegrating AI: Skills, Symbols, and the Sensorimotor Dilemma – Prof George Konidaris

Abstract AI has never settled on a widely accepted, or even well-formulated, definition of its primary scientific goal: designing a general intelligence. Instead it consists of siloed subfields studying isolated aspects of intelligence, each of which is important but none...

Concurrent and Temporal Composition for Zero-shot Transfer in Reinforcement Learning – Steven James

Abstract While reinforcement learning has achieved recent success in many challenging domains, these methods generally require millions of samples from the environment to learn optimal behaviours, limiting their real-world applicability. A major challenge is thus in designing sample-efficient agents that...

Street view images and the urban environment – measuring characteristics under assumptions of label scarcity – Emily Muller

Abstract Measurements which characterise urban neighbourhoods have often been collected using traditional survey techniques. This approach, while able to directly capture upstream determinants of health, are expensive and usually difficult to scale across entire cities. On the other hand, routinely...

Honours Projects

In this session Batsi and Ruan will share some aspects of their respective research areas. Though this is aimed at the current cohort of UCT honours students taking the RL module, you are invited to attend.

Hiking through the wilderness of neural network loss landscapes – Dr Anna Bosman

Abstract Deep neural network training is a highly non-convex optimisation problem with poorly understood properties. We know that a solution can be found by following the negative gradient to walk down the loss landscape, but we have little guarantees that...

An Introduction to Variational Inference and its Application in Deep Learning – Jacobie Mouton

Abstract Bayesian inference allows us to calculate the posterior distribution of unknown variables given observations, using Bayes’ Theorem. In practice however, it is typically the case that this posterior distribution is intractable to compute exactly. This tutorial introduces variational inference...

Shocklab x InstaDeep x UCT AI Society: Exclusive Film Screening

Presented by InstaDeep and AI Society 19 September 2023

Beyond Python: Why you should consider Julia for your next reinforcement learning project – Sasha Abramowitz

Abstract This talk covers a brief intro to Julia programming language. It then compares it to the other options out there for reinforcement learning (and deep learning in general) in terms of usability and speed. Sasha Abramowitz is a research...

Voice conversion with just nearest neighbours – Matthew Baas

Abstract Voice conversion aims to transform speech into a target voice with just a few example recordings of the target speaker. Recent methods produce convincing conversions, but at the cost of increased complexity – making results difficult to reproduce and...

Partially Automating the Improvement of Learning Agents (PAILA)

Abstract The PAILA project, undertaken during our InstaDeep internship, aims to bolster single-environment Reinforcement Learning (RL) algorithms through cross-environment knowledge sharing. To achieve this, we aimed to use symmetric learning agents (SymLA), a meta-reinforcement learning algorithm introducing backpropagation symmetries that...

Denoising Diffusion Models: Introduction and Applications

Abstract Denoising Diffusion Models are a type of generative modelling which serves backbone of recent advances in image synthesis including Dall-E 2, Midjourney, and Imagen. These models utilise an iterative denoising process during inference to produce high quality samples. In...

Modular Evolutionary Origami Robotics

Abstract Evolutionary robotics lends itself to exploring novel design paradigms in research to assess the efficacy of those designs relative to known paradigms in the space. Origami is one such paradigm that has been relatively under-explored, and has many potential...

Surveying research directions on AI safety – Benjamin Sturgeon

Abstract AI safety is a subject which has often been viewed with skepticism regarding its necessity and plausibility in the AI community. However, as we have progressed towards transformational AI systems the urgency of this research has become apparent.In this...

Efficient Inverse RL – Gokul Swamy

Abstract Interactive learning systems like self-driving cars, recommender systems, and large language model chatbots are becoming increasingly ubiquitous in everyday life. From a machine learning perspective, the key technical challenge underlying such systems is that rather than simple prediction on...

The Impact of Morphological Diversity in Robot Swarms

Abstract In nature, morphological diversity enhances functional diversity, however, there is little swarm (collective) robotics research on the impact of morphological and behavioral (body-brain) diversity that emerges in response to changing environments. This study investigates the impact of increasingly complex...

Molecule Design Based on Multi-objective Optimisation and Graph Transformers

Abstract I will be presenting an empirical exploration of using machine learning and evolutionary algorithms to automate chemical product design. Our study demonstrates how computational design can be controlled via hyper-parameters to generate solutions with desired features and has important...

Simulating the Past, Present and Future Using Agent-Based Models

Abstract Humans are fundamentally social creatures, we live in families, work in teams and our norms of formed from thousands of years of social interaction. What if, along that https://www.youtube.com/watch?v=t3GR91yjOzY Brandon Gower-Winter is a PhD Candi 31 May 2023

Intuitive explanations of the transformer model

Abstract In this talk I want to explain in as clear a way as possible what the key concepts are in a transformer model, explain key terms, and discuss why the transformer is so effective. Watch Benjamin Sturgeon I am...

Supporting RL Evaluation with Multi-Criteria Decision Analysis

Abstract The evaluation of empirical algorithm performances in RL appears a closed topic. However, some (sparse) recent research provides unattended criticisms of key elements of the evaluations which are central to the conclusions of many research papers. This talk discusses...

AI 4 Health in Production – Africa

Abstract I explore the challenges facing production AI for health systems in an African context. Progressively I step through the layers of complexity, one can expect to encounter, providing personal insight for addressing some challenges I have found to be...

A Folk Theorem from Learning in Games

Abstract We introduce a generalisation of smooth fictitious play with bounded m-memory strategies. We use this learning algorithm to prove a Folk theorem from learning in repeated potential games. If a payoff profile is supported by an m-memory pure strategy...

Selective Reincarnation in Multi-Agent Reinforcement Learning

Abstract Claude presents his work on selective reincarnation for MARL. Claude Formanek 5 April 2023

PyTorch and Weights and Biases for ML

Abstract Jeremy give’s an overview of PyTorch and Weights and Biases, emphasising how these are useful for ML in production and in research. Jeremy du Plessis 22 March 2023

Neurips in a nutshell

Abstract Ruan’s highlights and takeaways of NeurIPS 2022. Ruan de Kock 15 February 2023

Visual cortex is optimised for short timescale prediction using spikes

Visual cortex is optimised for short timescale prediction using spikes Abstract A key question in systems neuroscience is to understand what principles underly the sensory processing throughout the brain. Why are certain neurons in V1 selectively tuned to orientated bars?...

Towards Lifelong Reinforcement Learning through Logical Skill Composition

Towards Lifelong Reinforcement Learning through Logical Skill Composition Abstract Reinforcement learning has achieved recent success in a number of difficult, high-dimensional environments. However, these methods generally require millions of samples from the environment to learn optimal behaviours, limiting their real-world...

Harnessing the wisdom of an unreliable crowd for autonomous decision making

Generalisation in ML Abstract In Reinforcement Learning there is often a need for greater sample efficiency when learning an optimal policy, whether due to the complexity of the problem or the difficulty in obtaining data. One family of approaches to...

Offline MARL and how to effectively use WANDB for ML experiments

Offline MARL and how to use WANDB effectivly for ML experiments Abstract Claude gave a talk on his research topic, Offline MARL, and also gave a tutorial on how to use Weights and Biases for ML experiments. SPEAKER Claude Formanek...

Generalisation in a Nutshell

Abstract Ruan de Kock presents an overview of generalisation in RL.

Shocklab seminar playlist

ShockLab

NAVIGATION

Contact