Ruisheng Cao's Blog
Open Menu
Close Menu
Biography
Publications
Experience
CV
Large Language Model
Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?
Jul 15, 2024
CoE-SQL: In-Context Learning for Multi-Turn Text-to-SQL with Chain-of-Editions
Jun 16, 2024
OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
Apr 11, 2024
ACT-SQL: In-Context Learning for Text-to-SQL with Automatically-Generated Chain-of-Thought
Dec 6, 2023
Mobile-Env: Building Qualified Evaluation Benchmarks for LLM-GUI Interaction
May 14, 2023