Model Training with LeRobot CLI

This guide walks you through the training imitation learning models for the AI Worker, based on datasets collected via the LeRobot CLI.

INFO

You can train the policy either on your local PC or an NVIDIA Jetson AGX Orin device.

After preparing your dataset, you can proceed to train the policy model.

Training on NVIDIA Jetson AGX Orin

1. Enter the Docker Container

Open a terminal on the Jetson device and enter the Docker container:

cd ai_worker && ./docker/container.sh enter

2. Navigate to the LeRobot Directory

bash

cd /root/ros2_ws/src/physical_ai_tools/lerobot

3. Train the Policy

Execute the following command to start training:

bash

python lerobot/scripts/train.py \
  --dataset.repo_id=${HF_USER}/ffw_test \
  --policy.type=act \
  --output_dir=outputs/train/act_ffw_test \
  --policy.device=cuda \
  --log_freq=100 \
  --save_freq=1000

👉 Key Training Parameters

Parameter	Description
`--dataset.repo_id`	The Hugging Face dataset ID you created in the data collection step
`--policy.type`	Model architecture to use (e.g., 'act' for Action Chunking Transformer)
`--output_dir`	Where to save model checkpoints and logs
`--policy.device`	Device to use for training ('cuda' for GPU, 'cpu' for CPU)
`--log_freq`	How often to log training statistics (in iterations)
`--save_freq`	How often to save model checkpoints (in iterations)

👉 Expected Training Output

During training, you will see output like this:

INFO 2025-05-28 12:12:40 ts/train.py:232 step:200 smpl:2K ep:3 epch:0.16 loss:7.490 grdn:154.502 lr:1.0e-05 updt_s:0.047 data_s:0.002
INFO 2025-05-28 12:12:48 ts/train.py:232 step:400 smpl:3K ep:7 epch:0.33 loss:3.128 grdn:85.109 lr:1.0e-05 updt_s:0.041 data_s:0.000
INFO 2025-05-28 12:12:57 ts/train.py:232 step:600 smpl:5K ep:10 epch:0.49 loss:2.615 grdn:74.954 lr:1.0e-05 updt_s:0.041 data_s:0.000
INFO 2025-05-28 12:13:05 ts/train.py:232 step:800 smpl:6K ep:13 epch:0.65 loss:2.331 grdn:68.764 lr:1.0e-05 updt_s:0.042 data_s:0.000
INFO 2025-05-28 12:13:14 ts/train.py:232 step:1K smpl:8K ep:16 epch:0.81 loss:2.075 grdn:64.323 lr:1.0e-05 updt_s:0.042 data_s:0.000
INFO 2025-05-28 12:13:22 ts/train.py:232 step:1K smpl:10K ep:20 epch:0.98 loss:1.903 grdn:61.364 lr:1.0e-05 updt_s:0.042 data_s:0.000
INFO 2025-05-28 12:13:31 ts/train.py:232 step:1K smpl:11K ep:23 epch:1.14 loss:1.716 grdn:57.887 lr:1.0e-05 updt_s:0.042 data_s:0.001
INFO 2025-05-28 12:13:40 ts/train.py:232 step:2K smpl:13K ep:26 epch:1.30 loss:1.558 grdn:54.819 lr:1.0e-05 updt_s:0.041 data_s:0.000
INFO 2025-05-28 12:13:48 ts/train.py:232 step:2K smpl:14K ep:29 epch:1.47 loss:1.454 grdn:53.859 lr:1.0e-05 updt_s:0.042 data_s:0.000
...

Training on Your PC

1. Set Up the LeRobot Framework

First, follow the LeRobot installation instructions to set up the framework locally.

2. Transfer Your Dataset to Your Local Machine

Use scp to copy the dataset directory from the Robot PC to your local machine:

bash

scp -r ~/ai_worker/docker/huggingface/lerobot/${HF_USER}/ffw_test/ <USER>@<IP>:/home/.cache/huggingface/lerobot/${HF_USER}/

INFO

Replace ${HF_USER} with your Hugging Face username.
Replace ffw_test with the actual dataset repository ID.
<USER> and <IP> refer to your local machine’s SSH credentials.

3. Train the Policy

Once the dataset has been transferred, you can train a policy using the following command:

bash

python lerobot/scripts/train.py \
  --dataset.repo_id=${HF_USER}/ffw_test \
  --policy.type=act \
  --output_dir=outputs/train/act_ffw_test \
  --policy.device=cuda \
  --log_freq=100 \
  --save_freq=1000

Training time depends on your hardware and dataset size, but typically ranges from several hours to a full day.

(Optional) Uploading Checkpoints to Hugging Face

To upload the latest trained checkpoint to the Hugging Face Hub:

bash

huggingface-cli upload ${HF_USER}/act_ffw_test \
  outputs/train/act_ffw_test/checkpoints/last/pretrained_model

This makes your model accessible from anywhere and simplifies deployment.

Dataset Preparation

Model Training

Model Inference

Overview

Model Training with LeRobot CLI

Training on NVIDIA Jetson AGX Orin

1. Enter the Docker Container

2. Navigate to the LeRobot Directory

3. Train the Policy

Training on Your PC

1. Set Up the LeRobot Framework

2. Transfer Your Dataset to Your Local Machine

3. Train the Policy

(Optional) Uploading Checkpoints to Hugging Face

Model Training with LeRobot CLI ​

Training on NVIDIA Jetson AGX Orin ​

1. Enter the Docker Container ​

2. Navigate to the LeRobot Directory ​

3. Train the Policy ​

Training on Your PC ​

1. Set Up the LeRobot Framework ​

2. Transfer Your Dataset to Your Local Machine ​

3. Train the Policy ​

(Optional) Uploading Checkpoints to Hugging Face ​

Model Training with LeRobot CLI

Training on NVIDIA Jetson AGX Orin

1. Enter the Docker Container

2. Navigate to the LeRobot Directory

3. Train the Policy

Training on Your PC

1. Set Up the LeRobot Framework

2. Transfer Your Dataset to Your Local Machine

3. Train the Policy

(Optional) Uploading Checkpoints to Hugging Face