Dantong Niu

I am a second-year Ph.D. student advised by Prof. Trevor Darrell at UC Berkeley.

I develop vision-language models for robotics.

Email  /  Google Scholar  /  Github

profile photo

Selected Publications

I am fortunate that my PhD research unfolds in an era where the explosion of groundbreaking LLMs and VLMs is revolutionizing robotics learning with unprecedented possibilities.

fast-texture Pre-training Auto-regressive Robotic Models with 4D Representations
Dantong Niu*, Yuvan Sharma*, Haoru Xue, Giscard Biamby, Junyi Zhang, Ziteng Ji, Trevor Darrell†, Roei Herzig
Forty-Second International Conference on Machine Learning (ICML), 2025
arxiv
fast-texture In-Context Learning Enables Robot Action Prediction in LLMs
Yida Yin*, Zekai Wang*, Yuvan Sharma, Dantong Niu, Trevor Darrell, Roei Herzig
IEEE International Conf. on Robotics and Automation (ICRA), 2025
project page / code / paper
fast-texture LLARVA: Vision-Action Instruction Tuning Enhances Robot Learning fast-texture
Dantong Niu*, Yuvan Sharma*, Giscard Biamby, Jerome Quenum, Yutong Bai, Baifeng Shi, Trevor Darrell†, Roei Herzig
Conference on Robot Learning (CoRL), 2024
project page / code / paper

We propose LLARVA, a model trained with a novel instruction tuning method that leverages structured prompts to unify a range of robotic configurations and introduces the concept of visual traces to further align the vision and action spaces.

fast-texture U2Seg: Unsupervised Universal Image Segmentation
Dantong Niu*, Xudong Wang*, Xinyang Han*, Long Lian, Roei Herzig, Trevor Darrell
IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2024
project page / code / paper

We present U2Seg, a unified framework for Unsupervised Universal image Segmentation that consistently outperforms previous state-of-the-art methods.

Previous Work

Before starting my Ph.D. studies, I worked on general machine learning and vision topics such as object segmentation, adversarial attacks, and video understanding.

fast-texture MoriƩ Attack (MA): A New Potential Risk of Screen Photos
Dantong Niu*, Ruohao Guo*, Yisen Wang
NeurIPS, 2021
code / paper
fast-texture SOTR: Segmenting Objects with Transformers
Ruohao Guo*, Dantong Niu*, Liao Qu, Zhenbo Li
IEEE Conf. on International Conference on Computer Vision (ICCV), 2021
code / paper
fast-texture AdvDrop: Adversarial Attack to DNNs by Dropping Information
Ranjie Duan, Yuefeng Chen, Dantong Niu*, Yun Yang, A. K. Qin, Yuan He
IEEE Conf. on International Conference on Computer Vision (ICCV), 2021
code / paper
fast-texture LeafMask: Towards Greater Accuracy on Leaf Segmentation
Ruohao Guo*, Liao Qu, Dantong Niu*, Zhenbo Li, Jun Yue
IEEE Conf. on International Conference on Computer Vision Workshops (ICCVW), 2021
code / paper