Skip to content

Data Tools

The Data Tools page provides useful tools for editing and managing datasets/models as follows:

Web UI
ToolDescription
Hugging Face Upload & DownloadUpload your datasets/models to Hugging Face or download existing ones
Merging DatasetsCombine multiple datasets into a single dataset
Removing EpisodesRemove unwanted episodes from your dataset

Hugging Face Upload & Download

You can upload collected datasets or trained models to Hugging Face. You can also download datasets or models from Hugging Face.

1. Upload

First, select the Hugging Face user id and data type for upload

Web UI

INFO

If there is no registered User ID, click the Change button and enter your Hugging Face token to register a User ID.

Set the toggle button to upload.

Web UI

Select the path of the dataset or model you want to upload in the right panel.

Web UI
Web UI
Web UI

Enter the repository name to be created on Hugging Face.

Web UI

Click the Upload button. The upload progress will be displayed.

Web UI

INFO

You can cancel the upload by clicking Cancel during the upload process. However, the repository will still be created on Hugging Face.

After canceling, clicking the Upload button again will resume the upload from where it left off.

2. Download

First, select the Hugging Face user id and data type for download

Web UI

Set the toggle button to download.

Web UI

Enter the repository to download.

Web UI

Click the Download button.

  • Dataset: Progress bar is displayed Web UI

  • Model: Spinner is displayed Web UI

INFO

You can cancel the download by clicking Cancel during the download process. However, the folder created in the local path will remain.

INFO

The download paths are as follows (based on the Docker container’s file system):

  • Dataset: /root/.cache/huggingface/lerobot/
  • Model: /root/ros2_ws/src/physical_ai_tools/lerobot/outputs/train/

TIP

⚠️ Network recommendation

Policy models are typically large in size. When downloading a policy model, a wired Ethernet connection is highly recommended to ensure stable and fast download speeds.

Merging Datasets

You can merge multiple datasets into a single dataset.

1. Enter Datasets to Merge

In the Merge Datasets > Enter Datasets to Merge section:

Web UI

Click the folder button to open the file explorer and select your desired dataset. (You can also type the path directly in the text input field)

Web UI
Web UI

Click the + Add Dataset button to add a new dataset input field and select your dataset.

Web UI

Repeat the above steps to add as many datasets as needed.

2. Enter Output Path

In the Enter Output Path section, select the path where the merged dataset result will be saved.

Web UI

Click the folder button to open the file explorer and select your desired folder. (You can also type the path directly in the text input field)

Web UI
Web UI

Enter the name for the merged result dataset folder.

Web UI

Finally, click the Merge button to merge the datasets.

Web UI

Removing Episodes

You can remove episodes that were incorrectly recorded during the data collection process.

1. Select Dataset to Edit

In the Delete Episodes section, click the folder button to open the file explorer and select your desired dataset. (You can also type the path directly in the text input field)

Web UI
Web UI

2. Specify Episodes to Remove

Enter the episode numbers you want to remove in the input field. You can remove multiple episodes at once by separating the numbers with commas.

Web UI

INFO

After deletion, the remaining episode numbers and metadata will be automatically reorganized and reindexed.

Finally, click the Delete button to remove the specified episodes from the dataset.

Web UI

AI Worker and AI Manipulator released under the Apache-2.0 license.