18–20 Feb 2025
Lamarr/RC Trust Dortmund
Europe/Berlin timezone

Concrete vs Abstract Planning for Large Language Models

19 Feb 2025, 12:20
20m
JvF25/3-303 - Conference Room (Lamarr/RC Trust Dortmund)

JvF25/3-303 - Conference Room

Lamarr/RC Trust Dortmund

Joseph-von-Fraunhofer-Straße 25 44227 Dortmund
30
Show room on map

Speaker

Florian Mai

Description

Large language models are strong heuristic reasoners, but their planning abilities remain poor. We introduce a method for language models to learn to plan from unlabeled data by using a planner model to predict many steps ahead and conditioning the language model on the predicted plans. A crucial parameter in this framework is the level of abstraction of the generated plans: While some tasks arguably benefit more from high-level planning (e.g. creative writing), others require planning in the concrete language space (e.g. mathematical reasoning). In this talk, we explore both ends of the spectrum and finally ask the question if the right granularity can be learned from data.

Presentation materials

There are no materials yet.