Stability-Guided Exploration for Diverse Motion Generation

Eckart Cobo-Briesewitz, Tilman Burghoff, Denis Shcherba, Armand Jordana, Marc Toussaint

The image shows the visual abstract of the paper. It visualizes how StaGE (the method) generates interaction. A large blue sheet shows the manifold of stable configurations, while lines spawing from one point, leaving the manifold, only to end up on it in the end symbolize how StaGE works: The interactions start and end in stability, but are allowed to move through unstable states in between.

Scaling up datasets is highly effective in improving the performance of deep learning models, including in the field of robot learning. However, data collection still proves to be a bottleneck. Approaches relying on collecting human demonstrations are labor-intensive and inherently limited: they tend to be narrow, task-specific, and fail to adequately explore the full space of feasible states. Synthetic data generation could remedy this, but current techniques mostly rely on local trajectory optimization and fail to find diverse solutions. In this work, we propose a novel method capable of finding diverse long-horizon manipulations through black-box simulation. We achieve this by combining an RRT-style search with sampling-based MPC, together with a novel sampling scheme that guides the exploration toward stable configurations. Specifically, we sample from a manifold of stable states while growing a search tree directly through simulation, without restricting the planner to purely stable motions. We demonstrate the method’s ability to discover diverse manipulation strategies, including pushing, grasping, pivoting, throwing, and tool use, across different robot morphologies, without task-specific guidance.

Read the full paper here

Accepted at ICRA Xplore Workshop 2026 (Oral)

arXiv · Code · Video · Bibtex