Spot-Light

SpotLight: Robotic Scene Understanding through Interaction and Affordance Detection

Tim Engelbracht¹, René Zurbrügg¹, Marc Pollefeys^1,2, Hermann Blum^1,3, Zuria Bauer¹

¹ETH Zurich ²Microsoft ³Uni Bonn

Spot-Light

Spot-Light is a library and framework built on top of the Spot-Compose repository codebase to interact with Boston Dynamics' Spot robot. It enables processing point clouds, performing robotic tasks, and updating scene graphs.

Dataset

The dataset used in the paper is available at Roboflow

Setup Instructions

Heads up: this setup is a bit involved, since we will explain not only some example code, but also the enttire setup, including acquiring the point clouds, aligning them, setting up the docker tools and scene graphs and so on and so forth. So bear with me here. In case u run into issues, don't hesitate to leave an issue or just send me an email :)

Define SpotLight Path

Set the environment variable SPOTLIGHT to the path of the Spot-Light folder. Just to make sure we're all on the same page ... I mean path ;) Example:

export SPOTLIGHT=<spotlights-repository-on-your-computer>

Virtual Environment

Create a virtual environment:

python3 -m venv venv
source venv/bin/activate

Install required packages:
```
pip install -r requirements.txt
```

Point Cloud Capturing and Alignment

We need two point clouds for the scene understanding:

First, update the configuration file (configs/config.yaml) with the name of the specific low and high resolution scan that you are taking this day (e.g. 02-02-25-room1):

pre_scanned_graphs:
  low_res: '<low_res_name>' # Name for the low-resolution scan (acquired during autowalk with spot)
  high_res: '<high_res_name>' # Name for the high-resolution scan (acquired through 3D lidar scan with e.g. Ipad)

Low-Resolution Point Cloud

Position Spot in front of the AprilTag and start the autowalk (control spot to walk around the office, point cloud gets captured in the meantime)
Zip the resulting data and unzip it into $SPOTLIGHT/data/autowalk/
- The point cloud should be at: $SPOTLIGHT/data/autowalk/<low_res_name>.walk/point_cloud.ply

High-Resolution Point Cloud

Use the 3D Scanner App (iOS) to capture the point cloud
- Ensure the fiducial is visible during the scan
After the scan, export the following 2 files:
- All Data as a zip file.
- Point Cloud/PLY file with "High Density" enabled and "Z axis up" disabled.
Unzip the "All Data" into $SPOTLIGHT/data/prescans/ and copy the PLY file into this folder as well.

Important: Both the unzipped folder and PLY file must use the same name as specified in config.yaml under pre_scanned_graphs.high_res (<high_res_name>). For example, if your config specifies high_res: "office_scan", name your files:

Folder: $SPOTLIGHT/data/prescans/office_scan/
PLY file: $SPOTLIGHT/data/prescans/office_scan/office_scan.ply

Running the Point Cloud Alignment Script

# Make it executable (only need to do this once)
chmod +x setup_scripts/align_pointclouds.sh

# Run the script
./setup_scripts/align_pointclouds.sh

The script will:

Read scan names from config.yaml
Create required directories if needed
Copies the low resolution point cloud data to $SPOTLIGHT/data/point_clouds/<low_res_name>.ply
Copies the high resolution point cloud data to $SPOTLIGHT/data/prescans/<high_res_name>/pcd.ply (location expected by full_align.py)
Run point cloud alignment (full_align.py)
Save aligned results to $SPOTLIGHT/data/aligned_point_clouds/

Mask3D Semantic Instance Segmentation

# Make the script executable
chmod +x setup_scripts/setup_mask3d.sh

# Run the script
./setup_scripts/setup_mask3d.sh

The script will:

Set up Mask3D repository and download the checkpoint that we want to use
Pull and run the Mask3D Docker container
Renames the high-resolution point cloud (pcd.ply) to mesh.ply (required for Mask3D)
Processes mesh.ply and create a semantic instance segmentated mesh (mesh_labeled.ply)
Copies certain files to the correct location for further processing by the repo

Docker Dependencies

YoloDrawer: Required for drawer interactions in the scene graph.
OpenMask: Required for search_all_drawers.py.
GraspNet: Required for gs_grasp.py and openmask feature extraction

Refer to the Spot-Compose repository documentation for Docker downlaod and setup.

NOTE If u plan on using the graspnet Docker, make sure to run this one first, and the other containers afterwards! Othwerwise the container won't work...No idea why

Update python path

Just to make sure we don't run into pathing/ import issues

export PYTHONPATH=$SPOTLIGHT:$SPOTLIGHT/source:\$PYTHONPATH

Extracting OpenMask Features

In case you are using the OpenMask functionalities:

python3 $SPOTLIGHT/source/utils/openmask_interface.py

Configuration File

Create a hidden .environment.yaml file to store sensitive configurations:

spot:
  wifi-network: <password>
  spot_admin_console_username: <username>
  spot_admin_console_password: <password>
  wifi_default_address: 192.168.50.1
  wifi_password: <password>
  nuc1_user_password: <password>
api:
  openai:
    key: <api key>

Networking

Workstation Networking

This is an over view for workstation networking. Again, this information can also be found in the Spot-Compose repository.

On the workstation run

   $SPOTLIGHT/shells/ubuntu_routing.sh

(or $SPOTLIGHT/shells/mac_routing.sh depending on your workstation operating system).

Short Explanation for the curious: This shell script has only a single line of code: sudo ip route add 192.168.50.0/24 via

In this command:

192.168.50.0/24 represents the subnet for Spot.
<local NUC IP> is the IP address of your local NUC device.

If you're working with multiple Spot robots, each Spot must be assigned a distinct IP address within the subnet (e.g., 192.168.50.1, 192.168.50.2, etc.). In such cases, the routing needs to be adapted for each Spot. For example:

sudo ip route add 192.168.50.2 via <local NUC IP>

NUC Networking

First, ssh into the NUC, followed by running ./robot_routing.sh to configure the NUC as a network bridge. Note that this might also need to be adapted based on your robot and workstation IPs.

Example Scripts

Light Switch Demo

python3 -m source/scripts/my_robot_scripts/light_switch_demo.py

Scene Graph Update

python3 -m source/scripts/my_robot_scripts/scene_graph_demo.py

Swing Drawer Interaction

python3 -m source/scripts/my_robot_scripts/search_swing_drawer_dynamic.py

BibTeX 🙏

@misc{engelbracht2024spotlightroboticsceneunderstanding,
      title={SpotLight: Robotic Scene Understanding through Interaction and Affordance Detection},
      author={Tim Engelbracht and René Zurbrügg and Marc Pollefeys and Hermann Blum and Zuria Bauer},
      year={2024},
      eprint={2409.11870},
      archivePrefix={arXiv},
      primaryClass={cs.RO},
      url={https://arxiv.org/abs/2409.11870},
}

License

This project is licensed under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 82 Commits
.github/workflows		.github/workflows
cad		cad
configs		configs
data		data
setup_scripts		setup_scripts
shells		shells
source		source
weights/train30/weights		weights/train30/weights
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml
mask3d_label_mapping.csv		mask3d_label_mapping.csv
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
watchers.xml		watchers.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SpotLight: Robotic Scene Understanding through Interaction and Affordance Detection

Spot-Light

Dataset

Setup Instructions

Define SpotLight Path

Virtual Environment

Point Cloud Capturing and Alignment

Low-Resolution Point Cloud

High-Resolution Point Cloud

Running the Point Cloud Alignment Script

Mask3D Semantic Instance Segmentation

Docker Dependencies

Update python path

Extracting OpenMask Features

Configuration File

Networking

Workstation Networking

NUC Networking

Example Scripts

Light Switch Demo

Scene Graph Update

Swing Drawer Interaction

BibTeX 🙏

License

About

Releases

Packages

Languages

License

timengelbracht/Spot-Light

Folders and files

Latest commit

History

Repository files navigation

SpotLight: Robotic Scene Understanding through Interaction and Affordance Detection

Spot-Light

Dataset

Setup Instructions

Define SpotLight Path

Virtual Environment

Point Cloud Capturing and Alignment

Low-Resolution Point Cloud

High-Resolution Point Cloud

Running the Point Cloud Alignment Script

Mask3D Semantic Instance Segmentation

Docker Dependencies

Update python path

Extracting OpenMask Features

Configuration File

Networking

Workstation Networking

NUC Networking

Example Scripts

Light Switch Demo

Scene Graph Update

Swing Drawer Interaction

BibTeX 🙏

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages