Ruisheng Cao's Blog
Open Menu
Close Menu
Biography
Publications
Experience
CV
Paper-Conference
Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?
Jul 15, 2024
CoE-SQL: In-Context Learning for Multi-Turn Text-to-SQL with Chain-of-Editions
Jun 16, 2024
A Birgat Model for Multi-Intent Spoken Language Understanding with Hierarchical Semantic Frames
Apr 19, 2024
OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
Apr 11, 2024
Hierarchical Multimodal Pre-training for Visually Rich Webpage Understanding
Mar 4, 2024
ACT-SQL: In-Context Learning for Text-to-SQL with Automatically-Generated Chain-of-Thought
Dec 6, 2023
SPM: A Split-Parsing Method for Joint Multi-Intent Detection and Slot Filling
Jul 9, 2023
Exploring Schema Generalizability of Text-to-SQL
Jul 9, 2023
CSS: A Large-scale Cross-schema Chinese Text-to-SQL Medical Dataset
Jul 9, 2023
TIE: Topological Information Enhanced Structural Reading Comprehension on Web Pages
Jul 10, 2022
Next »