Publications

My research interests lies in general robotic manipulation, foundation models in robotics, multi-agent system and small-scale computer vision models. Below are my publications (* denotes equal contribution). You can also find my articles on my Google Scholar profile.

Conference Papers

ManipBench: Benchmarking Vision-Language Models for Low-Level Robot Manipulation

Published in Conference on Robot Learning (CoRL), 2025

Propose a novel benchmark, ManipBench, to evaluate the low-level robot manipulation reasoning capabilities of VLMs across various dimensions, including how well they understand object-object interactions and deformable object manipulation Read more

Recommended citation: Enyu Zhao*, Vedant Raval*, Hejia Zhang*, Jiageng Mao, Zeyu Shangguan, Stefanos Nikolaidis, Yue Wang and Daniel Seita. (2025). "ManipBench: Benchmarking Vision-Language Models for Low-Level Robot Manipulation. " Conference on Robot Learning (CoRL).
Download Paper

HRIBench: Benchmarking Vision-Language Models for Real-Time Human Perception in Human-Robot Interaction

Published in International Symposium on Experimental Robotics (ISER), 2025

Propose HRIBench designed to evaluate VLMs across a diverse set of human perceptual tasks critical for HRI. Read more

Recommended citation: Zhonghao Shi, Enyu Zhao, Nathaniel Dennler, Jingzhen Wang, Xinyang Xu, Kaleen Shrestha, Mengxue Fu, Daniel Seita, Maja Mataric (2025). "HRIBench: Benchmarking Vision-Language Models for Real-Time Human Perception in Human-Robot Interaction " International Symposium on Experimental Robotics (ISER), 2025.
Download Paper

GPT-Fabric: Smoothing and Folding Fabric by Leveraging Pre-Trained Foundation Models

Published in International Symposium of Robotics Research (ISRR), 2024

Propose GPT-Fabric for fabric folding and smoothing, achieve comparable and even better folding and smoothing performance comparing to previous methods with no training data required. Read more

Recommended citation: Vedant Raval*, Enyu Zhao*, Hejia Zhang, Stefanos Nikolaidis, and Daniel Seita. (2024). "GPT-Fabric: Smoothing and Folding Fabric by Leveraging Pre-Trained Foundation Models." International Symposium of Robotics Research (ISRR).
Download Paper

Journal Articles

SDI: A tool for speech differentiation in user identification

Published in Expert Systems with Applications, 2024

This paper proposed a speech differentiator with integrated SVM (SDI-SVM) by implementing threshold checking and frequency matching mechanisms. Read more

Recommended citation: Muhammad Abdul Basit, Chanjuan Liu, Enyu Zhao. (2024). "SDI: A tool for speech differentiation in user identification." Expert Systems with Applications. Volume 243, 122866.
Download Paper

Time-aware MADDPG with LSTM for multi-agent obstacle avoidance: a comparative study

Published in Complex & Intelligent Systems, 2024

This paper addresses the limitations of MADDPG in multi-agent navigation and obstacle avoidance tasks, providing insights for developing intelligent agents and multi-agent systems. Read more

Recommended citation: Enyu Zhao, Ning Zhou, Chanjuan Liu, Houfu Su, Yang Liu & Jinmiao Cong. (2024). "Time-aware MADDPG with LSTM for multi-agent obstacle avoidance: a comparative study." Complex & Intelligent Systems. Volume 10, pages 4141–4155.
Download Paper

Preprints

Instant Photorealistic Style Transfer: A Lightweight and Adaptive Approach

Published in arXiv, 2023

This paper proposes an Instant Photorealistic Style Transfer (IPST) approach, designed to achieve instant photorealistic style transfer on super-resolution inputs without the need for pre-training on pair-wise datasets or imposing extra constraints. Read more

Recommended citation: Rong Liu, Enyu Zhao, Zhiyuan Liu, Andrew Feng, Scott John Easley. (2023). "Instant Photorealistic Style Transfer: A Lightweight and Adaptive Approach." arXiv preprint arXiv:2309.10011.
Download Paper

Enyu Zhao / 赵恩宇