Sensors & Cameras¶

Read simulation state through MuJoCo sensors, and capture the scene through cameras that output photoreal RGB, depth, or segmentation masks.

Sensors¶

A sensor reads a quantity from the simulation each step. URLab imports every MuJoCo sensor type from MJCF as a component under SensorsRoot, and you can add more in the Blueprint editor (set the sensor's Target to choose what it measures). See Articulations.

Rather than memorise individual classes, think in categories:

Joint and tendon state. Position, velocity, and force/torque for joints, tendons, and their limits.
Actuator state. Commanded position, velocity, and force per actuator.
Body and frame state. Position, orientation, axes, linear/angular velocity, and linear/angular acceleration of a body, site, or geom frame; subtree center of mass, linear velocity, and angular momentum.
Inertial measurement. Accelerometer, gyroscope, velocimeter, magnetometer (the building blocks of an IMU).
Contact and touch. Touch sensors, contact sensors, force and torque at a site, and tactile arrays.
Geometry queries. Range finders, geom-to-geom distance, normals, and whether a point lies inside a site.
Energy and clock. Kinetic and potential energy, simulation time.
Custom. User and plugin sensors for values your own code computes.

Reading sensors¶

Each sensor returns either a single scalar or a short vector. Read them by name from the articulation:

float Touch = Robot->GetSensorScalar("fingertip_touch"); // scalar sensors
TArray<float> Force = Robot->GetSensorReading("wrist_force"); // vector sensors
float Angle = Robot->GetJointAngle("elbow"); // joint-position shortcut

Use GetSensorScalar for scalar readings (touch, joint position, clock) and GetSensorReading for vector readings (force, accelerometer, gyro). Both are Blueprint-callable. Live readouts also appear in the Simulate Dashboard, and sensor values stream to Python over the bridge (see Python).

Cameras¶

A UMjCamera renders the scene from a fixed viewpoint into a render target you can preview in the dashboard or stream to Python. Imported cameras keep their MuJoCo intrinsics: where the MJCF used focal, focalpixel, or sensorsize, the importer derives the field of view so the Unreal camera matches MuJoCo. See Importing Robots.

Capture modes¶

Each camera has a CaptureMode property (EMjCameraMode), set per-camera at design time in the Details panel. It decides what the render target contains:

Mode	Output
`Real` (default)	Photoreal RGB. Matches the viewport, respects post-process.
`Depth`	Linear scene depth, in centimetres.
`SemanticSegmentation`	A color tint per actor class.
`InstanceSegmentation`	A unique color tint per body.

One camera produces one stream. To capture RGB plus depth plus masks from the same viewpoint, place several cameras at the same transform with different modes. Segmentation modes do not swap materials on the real meshes, so an RGB camera and a segmentation camera can stream the same frame at once without contaminating each other.

For Depth, also set DepthNearCm (default 10) and DepthFarCm (default 10000); the dashboard uses these to normalise the grayscale preview.

Note

CaptureMode is read when streaming starts. To change the mode of a camera that is already running, toggle streaming off and on:

Cam->SetStreamingEnabled(false);
Cam->CaptureMode = EMjCameraMode::Depth;
Cam->SetStreamingEnabled(true);

Previewing and streaming¶

The dashboard's camera panel shows every camera on the selected articulation, each rendering its configured mode live, with no extra setup. See Simulate Dashboard.

For headless or full-framerate capture, stream frames to Python over the bridge. Each camera can broadcast over ZMQ or shared memory; the consumer interprets the bytes per the camera's mode (RGBA for Real, float32 centimetres for Depth, BGRA tint for the segmentation modes). See Python Policies and the bridge protocol.

Warning

Camera capture modes are independent of the key-6 viewport segmentation overlay in Debug Visualization. The hotkey overlay swaps real materials and therefore appears in any capture; the per-camera modes are the right tool for clean, persistent RGB, depth, or mask streams. Note that the segmentation tints are lit-shaded rather than flat, so they are good for visual debugging but not pixel-exact ground truth.