Convert

Convert TrossenMCAP episodes to LeRobot V2 format (Parquet + MP4) using the trossen_mcap_to_lerobot_v2 tool that ships with the SDK. The resulting dataset is usable directly with the LeRobot framework.

What You Need 

A completed build. The conversion tool is always built as part of the standard SDK build (see Installation and Setup).
At least one .mcap episode recorded with the SDK. If you have not recorded anything yet, follow Record.
ffmpeg on PATH for video encoding (installed in the system dependencies step).

Building the Tool 

The tool is produced by the standard build:

cd build
cmake ..
make trossen_mcap_to_lerobot_v2

The resulting binary lives at build/scripts/trossen_mcap_to_lerobot_v2.

Usage 

The tool operates in two modes, distinguished by the first argument:

# Single-episode mode (first argument is an .mcap file)
./build/scripts/trossen_mcap_to_lerobot_v2 <path/to/episode_000000.mcap>

# Batch mode (first argument is a directory containing .mcap files)
./build/scripts/trossen_mcap_to_lerobot_v2 <path/to/dataset_folder/>

Examples per robot:

./build/scripts/trossen_mcap_to_lerobot_v2 \
    ~/.trossen_sdk/solo_dataset/

./build/scripts/trossen_mcap_to_lerobot_v2 \
    ~/.trossen_sdk/stationary_dataset/

./build/scripts/trossen_mcap_to_lerobot_v2 \
    ~/.trossen_sdk/mobile_dataset/

Note

Episode numbers are extracted from the filename (episode_NNNNNN.mcap, six-digit zero-padded). Batch mode processes all matching files in episode-index order.

Configuring the Conversion 

All conversion settings live in a single JSON config file. By default, the tool loads scripts/trossen_mcap_to_lerobot_v2/config.json from the repository root. Point it at a different config with --config <path>.

{
  "lerobot_v2_backend": {
    "type":                  "lerobot_v2_backend",
    "root":                  "~/.cache/huggingface/lerobot",   // Output directory
    "repository_id":         "TrossenRoboticsCommunity",       // Top-level folder name
    "dataset_id":            "mcap_converted_dataset",         // Sub-folder for this dataset
    "robot_name":            "trossen_solo_ai",                // Stored in meta/info.json
    "task_name":             "perform a generic task",         // Default task description
    "fps":                   30.0,                             // Frame rate written to info.json
    "encode_videos":         true,                             // Set false to skip MP4 encoding
    "encoder_threads":       2,                                // FFmpeg threads per camera
    "max_image_queue":       10,                               // Back-pressure bound
    "png_compression_level": 5,                                // Intermediate PNG quality
    "chunk_size":            2,                                // Episodes per chunk-NNN folder
    "episode_index":         0,                                // Starting episode index
    "overwrite_existing":    false                             // Refuse to clobber existing files
  }
}

Key fields to update for each new dataset:

Field	Meaning
`root`	Directory where the dataset tree is written
`repository_id` / `dataset_id`	Identify the output dataset; produce `<root>/<repository_id>/<dataset_id>/`
`robot_name`	Written into `meta/info.json` so consumers can distinguish robot types
`task_name`	Default task description written to `meta/tasks.jsonl`
`fps`	Frame rate stored in `info.json`. Should match the camera poll rate used at recording time.
`encode_videos`	Disable to produce Parquet only (useful for joint-state-only pipelines)

CLI Overrides 

Any JSON key can be overridden at runtime with --set KEY=VALUE. This is useful for batch scripting without editing the config file:

./build/scripts/trossen_mcap_to_lerobot_v2 \
    ~/.trossen_sdk/solo_dataset/ \
    --set lerobot_v2_backend.robot_name=trossen_solo_ai \
    --set lerobot_v2_backend.dataset_id=solo_v1

./build/scripts/trossen_mcap_to_lerobot_v2 \
    ~/.trossen_sdk/stationary_dataset/ \
    --set lerobot_v2_backend.robot_name=trossen_stationary_ai \
    --set lerobot_v2_backend.dataset_id=stationary_v1

./build/scripts/trossen_mcap_to_lerobot_v2 \
    ~/.trossen_sdk/mobile_dataset/ \
    --set lerobot_v2_backend.robot_name=trossen_mobile_ai \
    --set lerobot_v2_backend.dataset_id=mobile_v1

Inspect the merged config (JSON plus overrides) before running:

./build/scripts/trossen_mcap_to_lerobot_v2 --dump-config

Output Layout 

The tool writes a LeRobot V2 dataset rooted at the repository ID and dataset ID from your config:

<root>/
└── <repository_id>/
    └── <dataset_id>/
        ├── meta/
        │   ├── info.json              dataset metadata and feature descriptions
        │   ├── episodes.jsonl         one entry per episode
        │   ├── episodes_stats.jsonl   per-episode min/max/mean/std statistics
        │   └── tasks.jsonl            task descriptions
        ├── data/
        │   └── chunk-000/
        │       ├── episode_000000.parquet
        │       └── ...
        └── videos/
            └── chunk-000/
                └── observation.images.<camera_id>/
                    ├── episode_000000.mp4
                    └── ...

info.json is created on the first episode and updated after each subsequent conversion, so running the tool incrementally as you record is safe.

Parquet Schema 

Each episode produces one .parquet file (SNAPPY compression) with the following columns:

Column	Type	Description
`timestamp`	`float32`	Seconds from first frame (`0.0` at start)
`observation.state`	`list<float64>`	Concatenated follower joint observations
`action`	`list<float64>`	Concatenated leader joint actions
`episode_index`	`int64`	Zero-based episode number
`frame_index`	`int64`	Zero-based frame within episode

Dataset Metadata (info.json)

{
  "codebase_version": "v2.1",
  "robot_type": "<robot_name>",
  "fps": 30,
  "features": {
    "observation.state": { "dtype": "float32", "shape": [<N>], "names": ["joint_0", ...] },
    "action":            { "dtype": "float32", "shape": [<N>], "names": ["joint_0", ...] },
    "observation.images.<camera_id>": {
      "dtype": "video",
      "shape": [<H>, <W>, 3],
      "video_info": { "video.fps": 30.0, "video.codec": "av1", "video.pix_fmt": "yuv420p" }
    },
    "timestamp":     { "dtype": "float32", "shape": [1] },
    "frame_index":   { "dtype": "int64",   "shape": [1] },
    "episode_index": { "dtype": "int64",   "shape": [1] }
  },
  "total_episodes": <N>,
  "total_frames":   <N>
}

Episode Statistics (episodes_stats.jsonl)

One JSON line per episode, appended after each conversion:

{
  "episode_index": 0,
  "stats": {
    "observation.state": { "min": [...], "max": [...], "mean": [...], "std": [...] },
    "action":            { "min": [...], "max": [...], "mean": [...], "std": [...] },
    "observation.images.<camera_id>": {
      "mean": [[[<R>, <G>, <B>]]], "std": [[[<R>, <G>, <B>]]]
    }
  }
}

Statistics are computed directly from the converted frame data, so they always reflect the actual exported dataset.

Convert

What You Need 

Building the Tool 

Usage 

Configuring the Conversion 

CLI Overrides 

Output Layout 

Parquet Schema 

Dataset Metadata (info.json)

Episode Statistics (episodes_stats.jsonl)

Troubleshooting 

Missing Arrow/Parquet libraries 

FFmpeg not found or codec error 

Episode numbering mismatch 

What’s Next 