Figure 02
Figure AI

Figure 02

Figure 02 is the latest humanoid robot developed by Figure AI, a California-based company at the forefront of robotics innovation. This second-generation robot represents a significant leap forward in autonomous humanoid technology, designed for commercial viability and real-world applications.

Description

Figure 02, developed by Figure AI, represents a pivotal advancement in humanoid robotics, unveiled in August 2024 as the company's second-generation model. Standing at 168 cm tall and weighing 70 kg, it features a sleek matte-black exterior with fully integrated cabling throughout its limbs, enhancing durability, reliability, and suitability for industrial environments. The robot's architecture emphasizes a ground-up redesign, incorporating custom electric motors in each joint, a 2.25 kWh battery pack integrated into the torso for improved balance and up to 5 hours of runtime, and dual NVIDIA RTX GPU-based modules providing approximately 3x the on-device AI inference compute compared to its predecessor, Figure 01. At the core of Figure 02's intelligence is the Helix Vision-Language-Action (VLA) model, a generalist AI system that unifies perception, language understanding, and control. Helix employs a dual-system architecture: System 2 (S2), a 7B-parameter open-source Vision-Language Model operating at 7-9 Hz for semantic reasoning from monocular images, robot state, and natural language prompts; and System 1 (S1), an 80M-parameter transformer running at 200 Hz for high-rate visuomotor control across a 35-DoF upper-body action space, including wrists, fingers, torso, and head. Trained end-to-end on ~500 hours of teleoperated data with auto-labeled natural language instructions, Helix enables zero-shot generalization to thousands of novel household and industrial objects, precise manipulation, multi-robot collaboration, and real-time adaptation without task-specific fine-tuning. All inference runs onboard the dual embedded GPUs in a model-parallel fashion, ensuring low-latency, closed-loop operation. Complementing Helix is Figure's partnership with OpenAI for speech-to-speech capabilities, allowing natural voice conversations via onboard microphones and speakers. The vision system leverages six RGB cameras for perception, obstacle avoidance, and hand-eye coordination. Hands feature 16 degrees of freedom (4th generation design with 10 fingers mimicking human structure), capable of lifting 25 kg payloads at speeds up to 3 km/h walking. Real-world deployment history underscores Figure 02's maturity. In an 11-month pilot at BMW Group Plant Spartanburg starting in late 2024, a fleet of Figure 02 robots autonomously loaded sheet-metal parts onto welding fixtures, achieving >99% placement accuracy within 5 mm tolerance and 37-second load times. Over 1,250 operational hours across 10-hour shifts, they handled 90,000+ parts, contributed to 30,000+ BMW X3 vehicles, and logged 1.2 million steps (~200 miles), with minimal hardware failures primarily in the forearm subsystem due to thermal and cabling challenges. This data informed Figure 03's redesign, leading to Figure 02's retirement in November 2025. The deployment validated adaptive locomotion, field calibration, and AI-driven precision in dynamic manufacturing settings, marking a first for humanoid robots in production lines. Figure 02's blend of hardware robustness and end-to-end AI positions it as a bridge to scalable commercial humanoids.

Key Features

Helix VLA Model

Generalist Vision-Language-Action model enabling zero-shot manipulation of novel objects via natural language, with 200 Hz control and multi-robot collaboration, running fully onboard.

Speech-to-Speech Interaction

Powered by OpenAI-custom models, supports real-time human conversations and voice-commanded tasks through integrated mics and speakers.

Advanced Hands

16 DoF human-scale hands with 10 fingers, 25 kg payload capacity, and precise dexterity for industrial pick-and-place and household chores.

Integrated Torso Battery

2.25 kWh pack provides 5+ hours runtime, lowers center of mass for stability, 50% more capacity than predecessor.

Onboard Compute

Dual NVIDIA RTX GPUs deliver 3x inference power for autonomous perception, reasoning, and action without cloud dependency.

Vision System

Six RGB cameras enable robust environmental perception, obstacle avoidance, and visual reasoning via onboard VLM.

Specifications

AvailabilityPrototype
NationalityUS
Websitehttps://www.figure.ai/
Degrees Of Freedom, Overall28
Height [Cm]168
Manipulation Performance3
Navigation Performance3
Max Speed (Km/H)3
Strength [Kg]25
Weight [Kg]70
Runtime Pr Charge (Hours)5
Safe With HumansYes
Cpu/GpuOn‑robot compute with dual NVIDIA RTX GPU–based modules; ~3× on‑device inference vs Figure 01.
Camera Resolutionsix RGB cameras
Llm IntegrationOpenAI speech‑to‑speech voice; onboard VLM/VLA (“Helix”) for perception and reasoning
Motor TechCustom electric motors integrated with joint drivetrains; system is electric.
Main Structural Materialmatte‑black exterior with integrated cabling
Number Of Fingers10
Main MarketIndustries, logistics, Warehouse
VerifiedNot verified
Walking Speed [Km/H]3
ManufacturerFigure AI
Height Cm168
Weight Kg70
Dof Overall28
Dof Hands16
Payload Kg25
Max Speed Kmh3
Runtime Hours5
Battery Kwh2.25
Cameras6 RGB cameras
ComputeDual NVIDIA RTX GPU-based modules (~3x inference vs Figure 01)
Ai ModelsHelix VLA (S1: 80M params 200Hz, S2: 7B VLM 7-9Hz); OpenAI speech-to-speech
MotorsCustom electric motors with integrated joint drivetrains
SensorsOnboard microphones, speakers; force/torque in fingers
MaterialsMatte-black polymer exterior with integrated cabling; CNC-machined components
Thermal ManagementChallenges in forearm (addressed in successor); low-power embedded GPUs
Vision ProcessingOnboard VLM for perception/reasoning from raw pixels

Curated Videos

Video 1
Video 2
Video 3
Video 4
Video 5

Frequently Asked Questions

What is the primary AI model used in Figure 02?

Helix, a Vision-Language-Action (VLA) model with System 1 (80M params, 200 Hz control) and System 2 (7B VLM, 7-9 Hz reasoning), trained on 500 hours of data for zero-shot generalization to novel tasks and objects using natural language prompts.

What was Figure 02's real-world deployment at BMW?

An 11-month pilot at BMW Spartanburg plant where robots loaded sheet-metal parts with 99% accuracy, handling 90,000+ parts over 1,250 hours, contributing to 30,000 X3 vehicles before retirement in 2025.

What are the key hardware specs of Figure 02?

168 cm height, 70 kg weight, 28 DoF overall (16 in hands), 25 kg payload, 3 km/h speed, 2.25 kWh torso battery (5 hours runtime), six RGB cameras, dual NVIDIA RTX GPUs, custom electric motors, matte-black integrated cabling.

How does Figure 02 handle voice interaction?

Via OpenAI-trained speech-to-speech models integrated with onboard mics/speakers, enabling fluid human conversations, command following, and real-time task adjustment without external processing.

Is Figure 02 still in production?

No, it was retired in November 2025 following Figure 03's release; deployment data accelerated Figure 03's design, focusing on manufacturability and reliability improvements.

What tasks can Figure 02 perform autonomously?

Industrial pick-and-place (e.g., BMW sheet-metal), household chores like laundry folding/dishwasher loading, multi-robot collaboration for grocery storage, precise manipulation of novel objects via Helix AI.

×