Module 4: Perception & Spatial Intelligence
Sensors, SLAM & 3D Scene Understanding
Duration: 7 hours · Level: Intermediate · Lessons: 4
G1 must understand its world in real-time: where objects are, what they are, and how to interact with them. Build the full perception stack from sensor fusion to semantic scene graphs.
Prerequisites
Learning outcomes
By the end of this module you will be able to:
- Design a complete sensor suite for a humanoid robot
- Implement real-time SLAM and 3D scene understanding
- Apply foundation models for open-vocabulary object detection
Lessons in this module
- 4.1 — Sensor Suite Design for Humanoids · 45 min
- 4.2 — Real-Time SLAM for Indoor Navigation · 55 min
- 4.3 — 3D Gaussian Splatting for Robot Scene Understanding · 50 min
- 4.4 — Foundation Models for Open-Vocabulary Perception · 55 min
👉 Start here: 4.1 — Sensor Suite Design for Humanoids