publications
more details can be found in my google scholar profile.
(* indicates equal contribution. ** indicates corresponding author.)
2025
2024
-  NeurIPS ’24
 -  ACL ’24LMDX: Language Model-based Document Information Extraction and LocalizationIn ACL 2024 (Findings), 2024
 -  COLING 2024Towards Few-shot Entity Recognition in Document Images: A Graph Neural Network Approach Robust to Image ManipulationIn COLING ’24, 2024
 
2023
-  EMNLP ’23Towards Zero-shot Relation Extraction in Web Mining: A Multimodal Approach with Relative XML PathIn EMNLP 2023 Findings, 2023
 
2022
-  EMNLP ’22MGDoc: Pre-training with Multi-granular Hierarchy for Document Image UnderstandingIn EMNLP 2022, 2022
 
2021
2020
-  EMNLP ’20DocStruct: A Multimodal Method to Extract Hierarchy Structure in Document for General Form UnderstandingIn EMNLP 2020 Findings, 2020
 -  EMNLP ’20
 -  WWW ’20TransModality: An End2End Fusion Method with Transformer for Multimodal Sentiment AnalysisIn WWW 2020, 2020