Bardienus Duisterhof

I build generative models that learn to see, reconstruct, and imagine the 3D world.

CV ↗ Google Scholar ↗ GitHub ↗

About

I am a final-year Ph.D. student at Carnegie Mellon University (CMU)'s Robotics Institute, advised by Jeffrey Ichnowski, and I frequently collaborate with Deva Ramanan and Bowen Wen. I am currently a research intern at World Labs on the pre-training team, under Justin Johnson. Previously, I was a research intern in the DUSt3R Group at NAVER Labs Europe, advised by Jérome Revaud and Vincent Leroy. My interests lie in generative models for spatial intelligence, with a focus on the foundations of generation: diffusion and flow-based modeling, what these models learn, and how pre-training on the physical world transfers to perception and embodied agents.

During my first year at CMU I worked with Sebastian Scherer on geometric camera calibration. Prior to CMU I completed a Bachelor's and Master's degree in Aerospace Engineering at Delft University of Technology, advised by Guido de Croon, studying efficient bio-inspired algorithms for fully autonomous nano drones. In 2019 I was a visiting student at Vijay Janapa Reddi's Edge Computing lab at Harvard University. I am a recipient of the 2023 CMLH Fellowship in Digital Health Innovation and the 2024 CMLH Fellowship in Generative AI in Healthcare.

I'm on the job market for a research scientist or post-doc role.

Selected Research

Modality Forcing for Scalable Spatial Generation

Bardienus P. Duisterhof, Deva Ramanan, Jeffrey Ichnowski, Justin Johnson, Keunhong Park

arXiv preprint, 2026

Modality Forcing turns a pretrained text-to-image model into a joint image-depth generator with a simple post-training recipe: one DiT, separate noise levels per modality, and per-modality decoders that allow training on sparse real-world depth. Depth accuracy scales with T2I pre-training (300M → 3B), and our strongest model is competitive with state-of-the-art monocular depth estimators, reducing AbsRel by 57% over prior joint image-depth generative models.

project / arxiv / code / demo

3PoinTr: 3D Point Tracks for Learning Manipulation from Unconstrained Human Videos

Adam Hung, Bardienus P. Duisterhof, Jeffrey Ichnowski

arXiv preprint, 2026

3PoinTr learns manipulation from unconstrained human videos: videos where the human demonstrator can act freely rather than mimicking target robot kinematics. 3PoinTr first predicts dense 3D point tracks — how the scene should move to complete the task — and then conditions a closed-loop multitask policy on these tracks. 3PoinTr outperforms strong behavior cloning and learning-from-video baselines across simulated and real-world evaluations.

project / arxiv

Wiggle and Go! System Identification for Zero-Shot Dynamic Rope Manipulation

Arthur Jakobsson, Abhinav Mahajan, Karthik Pullalarevu, Krishna Suresh, Yunchao Yao, Yuemin Mao, Bardienus P. Duisterhof, Shahram Syed, Jeffrey Ichnowski

Preprint, 2025

Wiggle and Go! enables zero-shot dynamic rope manipulation through system identification: the robot first wiggles a rope to identify its physical parameters, then uses learned simulation priors to plan an accurate, goal-conditioned throw — without large real-world datasets or iterative retries. The two-stage framework completes dynamic rope tasks accurately on the first attempt across varied ropes and payloads.

project / paper / video

RaySt3R: Predicting Novel Depth Maps for Zero-Shot Object Completion

Bardienus P. Duisterhof, Jan Oberst, Bowen Wen, Stan Birchfield, Deva Ramanan, Jeffrey Ichnowski

NeurIPS 2025

Imagine if robots could fill in the blanks in cluttered scenes. Enter RaySt3R ✨: a single masked RGB-D image in, complete 3D out. It infers depth, object masks, and confidence for novel views, then merges the predictions into a single point cloud.

project / code / X thread

MASt3R-SfM: a Fully-Integrated Solution for Unconstrained Structure-from-Motion

Bardienus P. Duisterhof*, Lojze Zust*, Philippe Weinzaepfel, Vincent Leroy, Yohann Cabon, Jérome Revaud

International Conference on 3D Vision (3DV) 2025, Oral, Best Student Paper Award

MASt3R for SfM with 1000+ unordered images. We contribute a memory-efficient algorithm that leverages the MASt3R encoder for image retrieval without any overhead. MASt3R-SfM has overall linear complexity in the number of images, and handles any set of ordered or unordered images.

arxiv / code

DeformGS: Scene Flow in Highly Deformable Scenes for Deformable Object Manipulation

Bardienus P. Duisterhof, Zhao Mandi, Yunchao Yao, Jia-Wei Liu, Jenny Seidenschwarz, Mike Zheng Shou, Deva Ramanan, Shuran Song, Stan Birchfield, Bowen Wen, Jeffrey Ichnowski

Proc. Algorithmic Foundations of Robotics (WAFR) 2024

Deformable objects are common in household, industrial, and healthcare settings; tracking them would unlock applications across robotics, gen-AI, and AR. DeformGS performs dense 3D tracking and dynamic novel-view synthesis on real-world deformable cloths.

project website / arXiv / data / code / X thread

Cloth-Splatting: 3D State Estimation from RGB Supervision for Deformable Objects

Alberta Longhini*, Marcel Büsching*, Bardienus P. Duisterhof, Jens Lundell, Jeffrey Ichnowski, Mårten Björkman, Danica Kragic

Conference on Robot Learning (CoRL) 2024

Cloth-Splatting accurately estimates the state of deformable objects from RGB supervision. It uses a GNN as a prior to improve tracking accuracy and convergence speed.

project website / paper

DynOMo: Online Point Tracking by Dynamic Online Monocular Gaussian Reconstruction

Jenny Seidenschwarz, Qunjie Zhou, Bardienus P. Duisterhof, Deva Ramanan, Laura Leal-Taixé

International Conference on 3D Vision (3DV) 2025

Online 3D tracking can unlock many new applications in robotics, AR, and VR. Most prior work targets offline tracking on full sequences. DynOMo simultaneously performs 3D tracking, 3D reconstruction, novel-view synthesis, and pose estimation.

arXiv / project website

Residual-NeRF: Learning Residual NeRFs for Transparent Object Manipulation

Bardienus P. Duisterhof, Yuemin Mao, Si Heng Teng, Jeffrey Ichnowski

IEEE International Conference on Robotics and Automation (ICRA) 2024
🌟 Spotlight 🌟 presentation at the ICCV23 — TRICKY Workshop

Residual-NeRF improves depth perception and training speed for transparent objects. By first learning a background NeRF of the workspace without the transparent objects to be manipulated, we recover better depth quality and faster convergence.

project website / arXiv / video / code

All Publications

The complete bibliography. The selected works above are repeated here in compact form.

Modality Forcing for Scalable Spatial Generation
Bardienus P. Duisterhof, Deva Ramanan, Jeffrey Ichnowski, Justin Johnson, Keunhong Park

arXiv 2026 project / arxiv / code / demo
3PoinTr: 3D Point Tracks for Learning Manipulation from Unconstrained Human Videos
Adam Hung, Bardienus P. Duisterhof, Jeffrey Ichnowski

arXiv 2026 project / arxiv
Wiggle and Go! System Identification for Zero-Shot Dynamic Rope Manipulation
Arthur Jakobsson, Abhinav Mahajan, Karthik Pullalarevu, Krishna Suresh, Yunchao Yao, Yuemin Mao, Bardienus P. Duisterhof, Shahram Syed, Jeffrey Ichnowski

Preprint 2025 project / paper / video
RaySt3R: Predicting Novel Depth Maps for Zero-Shot Object Completion
Bardienus P. Duisterhof, Jan Oberst, Bowen Wen, Stan Birchfield, Deva Ramanan, Jeffrey Ichnowski

NeurIPS 2025 project / code / X thread
MASt3R-SfM: a Fully-Integrated Solution for Unconstrained Structure-from-Motion
Bardienus P. Duisterhof*, Lojze Zust*, Philippe Weinzaepfel, Vincent Leroy, Yohann Cabon, Jérome Revaud

3DV 2025 · Best Student Paper arxiv / code
DeformGS: Scene Flow in Highly Deformable Scenes for Deformable Object Manipulation
Bardienus P. Duisterhof, Zhao Mandi, Yunchao Yao, Jia-Wei Liu, Jenny Seidenschwarz, Mike Zheng Shou, Deva Ramanan, Shuran Song, Stan Birchfield, Bowen Wen, Jeffrey Ichnowski

WAFR 2024 project / arXiv / data / code / X thread
Cloth-Splatting: 3D State Estimation from RGB Supervision for Deformable Objects
Alberta Longhini*, Marcel Büsching*, Bardienus P. Duisterhof, Jens Lundell, Jeffrey Ichnowski, Mårten Björkman, Danica Kragic

CoRL 2024 project / paper
DynOMo: Online Point Tracking by Dynamic Online Monocular Gaussian Reconstruction
Jenny Seidenschwarz, Qunjie Zhou, Bardienus P. Duisterhof, Deva Ramanan, Laura Leal-Taixé

3DV 2025 arXiv / project
Residual-NeRF: Learning Residual NeRFs for Transparent Object Manipulation
Bardienus P. Duisterhof, Yuemin Mao, Si Heng Teng, Jeffrey Ichnowski

ICRA 2024 · ICCV23 TRICKY Spotlight project / arXiv / video / code
GSL-Bench: High Fidelity Gas Source Localisation Benchmarking
Hajo H. Erwich, Bardienus P. Duisterhof, Guido de Croon

ICRA 2024 project / paper / video / code
TartanCalib: Iterative Wide-Angle Lens Calibration using Adaptive SubPixel Refinement of AprilTags
Bardienus P. Duisterhof, Yaoyu Hu, Si Heng Teng, Michael Kaess, Sebastian Scherer

arXiv project / arXiv / video / code
Sniffy Bug: A Fully Autonomous Swarm of Gas-Seeking Nano Quadcopters in Cluttered Environments
Bardienus P. Duisterhof, Shushuai Li, Javier Burgués, Vijay Janapa Reddi, Guido C.H.E. de Croon

IROS 2021 arXiv / video / code
Tiny Robot Learning (tinyRL) for Source Seeking on a Nano Quadcopter
Bardienus P. Duisterhof, Srivatsan Krishnan, Jonathan J. Cruz, Colby R. Banbury, William Fu, Aleksandra Faust, Guido C.H.E. de Croon, Vijay Janapa Reddi

ICRA 2021 paper / video / code
A Tailless Flapping Wing MAV Performing Monocular Visual Servoing Tasks
Diana A. Olejnik, Bardienus P. Duisterhof, Matej Karásek, Kirk Y. W. Scheper, Tom van Dijk, Guido C.H.E. de Croon

Unmanned Systems, Vol. 08, No. 04, pp. 287–294, 2020 paper / video

News

Sept 2025 RaySt3R has been accepted at NeurIPS 2025! See you in San Diego 🌴
June 2025 We are excited to release RaySt3R, a method for predicting novel depth maps for zero-shot object completion!
Mar 2025 MASt3R-SfM has received the best student paper award at 3DV 2025! Thanks to all collaborators at Naver Labs Europe 🧗
Dec 2024 MASt3R-SfM and DynOMo have been accepted at 3DV 2025!
Nov 2024 Thanks to the CMLH Fellowship in Generative AI in Healthcare for generously supporting my research!
Sept 2024 Cloth-Splatting has been accepted at CoRL 2024!

Show earlier news

Aug 2024 DeformGS has been accepted at WAFR 2024!
July 2024 I started my internship at NAVER Labs Europe! Excited to work with Jérome Revaud, Vincent Leroy and the rest of the DUSt3R team.
Jan 2024 2 papers accepted at ICRA 2024! See you in Japan 🇯🇵.
Nov 2023 Check out our recent work on MD-Splatting, a method for dense tracking and novel view synthesis of cloth 🧣.
July 2023 Our paper on NeRFs for transparent objects has been accepted for a 🌟 spotlight 🌟 presentation at the ICCV23 — TRICKY Workshop.
April 2023 Thanks to the CMLH Fellowship in Digital Health Innovation for generously supporting my research!

Teaching

Media Coverage

Awards