Ruisheng Cao's Blog
Open Menu
Close Menu
Biography
Publications
Experience
CV
Interface Interaction and Understanding
Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?
Jul 15, 2024
OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
Apr 11, 2024
Mobile-Env: Building Qualified Evaluation Benchmarks for LLM-GUI Interaction
May 14, 2023