Language-conditioned manipulation with NVIDIA GR00T N1.5.
Speak the task ('pick up the red block, put it in the bowl'); a vision-language-action policy plans the grasp, executes the trajectory, and reports success. Runs on Jetson Thor-class (e.g., T4000/T5000) compute with TensorRT-optimized inference. Reference implementation: SIGRobotics' Matcha Bot, embodied-AI hackathon winner.