Hierarchical Multimodal Pre-training for Visually Rich Webpage Understanding

Mar 4, 2024·
Hongshen Xu
,
Lu Chen
,
Zihan Zhao
,
Da Ma
Ruisheng Cao
Ruisheng Cao
,
Zichen Zhu
,
Kai Yu
· 0 min read
Type
Publication
Proceedings of the 17th ACM International Conference on Web Search and Data Mining, WSDM 2024, Merida, Mexico, March 4-8, 2024
Ruisheng Cao
Authors
4th-year CS PhD candidate
Research interests include structured natural language understanding, model-based data generation and iterative training, and LLM-based multi-modal agents.