PokéAgent Challenge (NeurIPS 2025)

22 August 2025, by Omkar Kondhalkar

Photo: Arian Urdu

Addressing the NeurIPS 2025 PokéAgent Challenge (Track 2: Speedrunning Pokémon Emerald) by comparing Large Language Model agents (LLMs) and Deep Reinforcement Learning (RL). The core problem is automating the speedrun of the game without hard-coded solutions, a task demanding long-term planning and efficient exploration. The game is an ideal testbed for long time horizons and complex strategic resource planning, where immediate actions lack immediate rewards. The research will evaluate LLMs (local and API-based) alongside RL agents trained with methods like PPO (Proximal Policy Optimization). The goal is to analyze the respective strengths and limitations of LLM agents and RL in dynamic, goal-oriented environments. The turn-based nature of battles and lack of penalty for waiting in the game world is particularly suited to the current computational constraints of LLM agents.

Latest articles

AI-Driven Cryptocurrency Price Prediction

Photo: base.camp

14.01.2026|basecamp-projects

AI-Driven Cryptocurrency Price Prediction: Evaluating Transfer Learning

Extreme volatility and hidden market dependencies make cryptocurrency price forecasting a persistent challenge. To address this, the work investigates deep learning models for high-frequency price prediction, comparing Multilayer Perceptrons, LSTM networks, and Time-Series Transformers. A central...

Initial prototype of the toilet-cleaning robot. The 3D-printed LeRobot SO-101 arm is mounted on a TurtleBot 4 mobile platform

Photo: base.camp

14.01.2026|basecamp-projects

Mobile Manipulation Robot for Toilet Cleaning

Household cleaning remains one of the most physically demanding and least automated domestic tasks. Responding to this gap, the work presents an intelligent toilet-cleaning robot built on a mobile platform with a dual-arm manipulation system. Instead of relying on task-specific hardware, the robot...