POMA-3D: The Point Map Way to 3D Scene Understanding - Embodied Localization Demo

Enter agent's situation text and choose Top-K; the most relevant views will turn red.