Publications

(2025). NeuSym-RAG: Hybrid Neural Symbolic Retrieval with Multiview Structuring for PDF Question Answering. Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Vienna, Austria, July 27 - August 1, ACL 2025.
(2025). Reducing Tool Hallucination via Reliability Alignment. Proceedings of the 42nd International Conference on Machine Learning, Vancouver, Canada. PMLR 267, ICML 2025.
(2025). EveMRC: Two-Stage Bidirectional Evidence Modeling for Multi-Choice Machine Reading Comprehension. IEEE Transaction on Audio, Speech and Language Processing, vol. 33, pp. 1011-1022, TASLP 2025.
(2025). Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows. International Conference on Learning Representations (ICLR), 2025.
(2024). Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?. Advances in Neural Information Processing Systems (NeurIPS), 2024.
(2024). CoE-SQL: In-Context Learning for Multi-Turn Text-to-SQL with Chain-of-Editions. Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers).
(2024). A Birgat Model for Multi-Intent Spoken Language Understanding with Hierarchical Semantic Frames. ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
(2024). OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments. Advances in Neural Information Processing Systems (NeurIPS), 2024.
(2024). Hierarchical Multimodal Pre-training for Visually Rich Webpage Understanding. Proceedings of the 17th ACM International Conference on Web Search and Data Mining, WSDM 2024, Merida, Mexico, March 4-8, 2024.
(2023). ACT-SQL: In-Context Learning for Text-to-SQL with Automatically-Generated Chain-of-Thought. Findings of the Association for Computational Linguistics: EMNLP 2023, Singapore, December 6-10, 2023.