AlphaMaze: Enhancing Large Language Models' Spatial Intelligence via GRPO
Paper
•
2502.14669
•
Published
•
11
Great job guys, reasoning bringing so many potential!
we also have similiar idea! but only applied for maze
https://huggingface.co./homebrewltd/AlphaMaze-v0.2-1.5B