Skip to main content

Module 4: Perception & Spatial Intelligence

Sensors, SLAM & 3D Scene Understanding

Duration: 7 hours · Level: Intermediate · Lessons: 4

G1 must understand its world in real-time: where objects are, what they are, and how to interact with them. Build the full perception stack from sensor fusion to semantic scene graphs.

Prerequisites

Learning outcomes

By the end of this module you will be able to:

  • Design a complete sensor suite for a humanoid robot
  • Implement real-time SLAM and 3D scene understanding
  • Apply foundation models for open-vocabulary object detection

Lessons in this module

  1. 4.1 — Sensor Suite Design for Humanoids · 45 min
  2. 4.2 — Real-Time SLAM for Indoor Navigation · 55 min
  3. 4.3 — 3D Gaussian Splatting for Robot Scene Understanding · 50 min
  4. 4.4 — Foundation Models for Open-Vocabulary Perception · 55 min

👉 Start here: 4.1 — Sensor Suite Design for Humanoids