Zilong Wang

PhD Student at UC San Diego, CSE.

profile.jpg

Welcome! I am a fourth-year PhD student at UC San Diego advised by Prof. Jingbo Shang. I spent wonderful time doing research at Google Cloud AI, Google Research, Adobe Research, and Microsoft Research Asia. I received my B.S. in Computer Science from Peking University in 2020, where I was advised by Prof. Xiaojun Wan.

My research focuses on applying NLP to real-world problems. I am particularly interested in building systems that can process and understand a wide range of data, including tabular data, visually-rich documents, web contents, etc. My goal is to bridge the gap between vast knowledge sources and practical NLP applications.

news

Feb 29, 2024 LDB: A Large Language Model Debugger via Verifying Runtime Execution Step-by-step New preprint on debugging programs with large language models!
Jan 16, 2024 Chain-of-Table: Evolving Tables in the Reasoning Chain for Table Understanding has been accepted by ICLR 2024!
Dec 9, 2023 A Study on Robustness and Reliability of Large Language Model Code Generation got received by AAAI 2024!
Nov 6, 2023 Join Google Cloud AI as a Student Researcher in Fall 2023!

selected publications

  1. ICLR
    Chain-of-Table: Evolving Tables in the Reasoning Chain for Table Understanding
    Zilong Wang, Hao Zhang, Chun-Liang Li, and 8 more authors
    In ICLR 2024, Jan 2024
  2. VRDU: A Benchmark for Visually-rich Document Understanding
    Zilong Wang, Yichao Zhou, Wei Wei, and 2 more authors
    In KDD 2023, Jan 2023
  3. LMDX: Language Model-based Document Information Extraction and Localization
    Vincent Perot, Kai Kang, Florian Luisier, and 7 more authors
    arXiv, Jan 2023
  4. Towards Zero-shot Relation Extraction in Web Mining: A Multimodal Approach with Relative XML Path
    Zilong Wang, and Jingbo Shang
    In EMNLP 2023 Findings, Dec 2023
  5. A Study on Robustness and Reliability of Large Language Model Code Generation
    Li Zhong, and Zilong Wang
    arXiv, Dec 2023
  6. Towards Few-shot Entity Recognition in Document Images: A Label-aware Sequence-to-Sequence Framework
    Zilong Wang, and Jingbo Shang
    In ACL 2022 Findings, Dec 2022
  7. MGDoc: Pre-training with Multi-granular Hierarchy for Document Image Understanding
    Zilong Wang, Jiuxiang Gu, Chris Tensmeyer, and 5 more authors
    In EMNLP 2022, Dec 2022
  8. LayoutReader: Pre-training of Text and Layout for Reading Order Detection
    Zilong Wang, Yiheng Xu, Lei Cui, and 2 more authors
    In EMNLP 2021, Dec 2021
  9. DocStruct: A Multimodal Method to Extract Hierarchy Structure in Document for General Form Understanding
    Zilong Wang, Mingjie Zhan, Xuebo Liu, and 1 more author
    In EMNLP 2020 Findings, Dec 2020