Selected Publications
I am fortunate that my PhD research unfolds in an era where the explosion of groundbreaking LLMs and VLMs is revolutionizing robotics learning with unprecedented possibilities.
|
|
Pre-training Auto-regressive Robotic Models with 4D Representations
Dantong Niu*,
Yuvan Sharma*,
Haoru Xue,
Giscard Biamby,
Junyi Zhang,
Ziteng Ji,
Trevor Darrell†,
Roei Herzig†
Forty-Second International Conference on Machine Learning (ICML), 2025
arxiv
|
|
In-Context Learning Enables Robot Action Prediction in LLMs
Yida Yin*,
Zekai Wang*,
Yuvan Sharma,
Dantong Niu,
Trevor Darrell,
Roei Herzig
IEEE International Conf. on Robotics and Automation (ICRA), 2025
project page /
code /
paper
|
|
LLARVA: Vision-Action Instruction Tuning Enhances Robot Learning
Dantong Niu*,
Yuvan Sharma*, Giscard Biamby, Jerome Quenum, Yutong Bai, Baifeng Shi,
Trevor Darrell†,
Roei Herzig†
Conference on Robot Learning (CoRL), 2024
project page /
code /
paper
We propose LLARVA, a model trained with a novel instruction tuning method that
leverages structured prompts to unify a range of robotic configurations
and introduces the concept of visual traces to further align the vision and action spaces.
|
|
U2Seg: Unsupervised Universal Image Segmentation
Dantong Niu*,
Xudong Wang*,
Xinyang Han*,
Long Lian,
Roei Herzig,
Trevor Darrell
IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2024
project page /
code /
paper
We present U2Seg, a unified framework for Unsupervised Universal image Segmentation
that consistently outperforms previous state-of-the-art methods.
|
Previous Work
Before starting my Ph.D. studies, I worked on general machine learning and vision topics such as object segmentation, adversarial attacks, and video understanding.
|
|
MoriƩ Attack (MA): A New Potential Risk of Screen Photos
Dantong Niu*,
Ruohao Guo*,
Yisen Wang
NeurIPS, 2021
code /
paper
|
|
SOTR: Segmenting Objects with Transformers
Ruohao Guo*,
Dantong Niu*,
Liao Qu,
Zhenbo Li
IEEE Conf. on International Conference on Computer Vision (ICCV), 2021
code /
paper
|
|
AdvDrop: Adversarial Attack to DNNs by Dropping Information
Ranjie Duan,
Yuefeng Chen,
Dantong Niu*,
Yun Yang,
A. K. Qin,
Yuan He
IEEE Conf. on International Conference on Computer Vision (ICCV), 2021
code /
paper
|
|
LeafMask: Towards Greater Accuracy on Leaf Segmentation
Ruohao Guo*,
Liao Qu,
Dantong Niu*,
Zhenbo Li,
Jun Yue
IEEE Conf. on International Conference on Computer Vision Workshops (ICCVW), 2021
code /
paper
|
|