Publications

(2024). Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?. Advances in Neural Information Processing Systems (NeurIPS), 2024.
(2024). CoE-SQL: In-Context Learning for Multi-Turn Text-to-SQL with Chain-of-Editions. Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers).
(2024). A Birgat Model for Multi-Intent Spoken Language Understanding with Hierarchical Semantic Frames. ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
(2024). OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments. Advances in Neural Information Processing Systems (NeurIPS), 2024.
(2024). Hierarchical Multimodal Pre-training for Visually Rich Webpage Understanding. Proceedings of the 17th ACM International Conference on Web Search and Data Mining, WSDM 2024, Merida, Mexico, March 4-8, 2024.
(2023). ACT-SQL: In-Context Learning for Text-to-SQL with Automatically-Generated Chain-of-Thought. Findings of the Association for Computational Linguistics: EMNLP 2023, Singapore, December 6-10, 2023.
(2023). ASTormer: An AST Structure-aware Transformer Decoder for Text-to-SQL. CoRR.
(2023). A Heterogeneous Graph to Abstract Syntax Tree Framework for Text-to-SQL. IEEE Trans. Pattern Anal. Mach. Intell..
(2023). SPM: A Split-Parsing Method for Joint Multi-Intent Detection and Slot Filling. Proceedings of the The 61st Annual Meeting of the Association for Computational Linguistics: Industry Track, ACL 2023, Toronto, Canada, July 9-14, 2023.
(2023). Exploring Schema Generalizability of Text-to-SQL. Findings of the Association for Computational Linguistics: ACL 2023, Toronto, Canada, July 9-14, 2023.