publications

more details can be found in my google scholar profile.
(* indicates equal contribution. ** indicates corresponding author.)

2024

  1. Speculative RAG: Enhancing Retrieval Augmented Generation through Drafting
    Zilong Wang, Zifeng Wang, Long Le, and 9 more authors
    arXiv preprint arXiv:2407.08223, 2024
  2. OfficeBench: Benchmarking Language Agents across Multiple Applications for Office Automation
    Zilong Wang, Yuedong Cui, Li Zhong, and 4 more authors
    arXiv preprint arXiv:2407.19056, 2024
  3. NeurIPS ’24
    TableRAG: Million-Token Table Understanding with Language Models
    Si-An Chen, Lesly Miculicich, Julian Martin Eisenschlos, and 7 more authors
    In NeurIPS 2024, 2024
  4. ACL ’24
    Debug like a Human: A Large Language Model Debugger via Verifying Runtime Execution Step-by-step
    Li Zhong, Zilong Wang**, and Jingbo Shang
    In ACL 2024 (Findings), 2024
  5. ICLR ’24
    Chain-of-Table: Evolving Tables in the Reasoning Chain for Table Understanding
    Zilong Wang, Hao Zhang, Chun-Liang Li, and 8 more authors
    In ICLR 2024, 2024
  6. ACL ’24
    Answer is All You Need: Instruction-following Text Embedding via Answering the Question
    Letian Peng, Yuwei Zhang, Zilong Wang, and 4 more authors
    In ACL 2024, 2024
  7. ACL ’24
    LMDX: Language Model-based Document Information Extraction and Localization
    Vincent Perot, Kai Kang, Florian Luisier, and 7 more authors
    In ACL 2024 (Findings), 2024
  8. AAAI ’24
    A Study on Robustness and Reliability of Large Language Model Code Generation
    Li Zhong, and Zilong Wang
    In AAAI 2024, 2024
  9. COLING 2024
    Towards Few-shot Entity Recognition in Document Images: A Graph Neural Network Approach Robust to Image Manipulation
    Prashant Krishnan, Zilong Wang, Yangkun Wang, and 1 more author
    In COLING ’24, 2024

2023

  1. KDD 2023
    VRDU: A Benchmark for Visually-rich Document Understanding
    Zilong Wang, Yichao Zhou, Wei Wei, and 2 more authors
    In KDD 2023, 2023
  2. EMNLP ’23
    Towards Zero-shot Relation Extraction in Web Mining: A Multimodal Approach with Relative XML Path
    Zilong Wang, and Jingbo Shang
    In EMNLP 2023 Findings, 2023

2022

  1. ACL ’22
    Towards Few-shot Entity Recognition in Document Images: A Label-aware Sequence-to-Sequence Framework
    Zilong Wang, and Jingbo Shang
    In ACL 2022 Findings, 2022
  2. EMNLP ’22
    Formulating Few-shot Fine-tuning Towards Language Model Pre-training: A Pilot Study on Named Entity Recognition
    Zihan Wang, Kewen Zhao, Zilong Wang, and 1 more author
    In EMNLP 2022 Findings, 2022
  3. EMNLP ’22
    MGDoc: Pre-training with Multi-granular Hierarchy for Document Image Understanding
    Zilong Wang, Jiuxiang Gu, Chris Tensmeyer, and 5 more authors
    In EMNLP 2022, 2022

2021

  1. EMNLP ’21
    LayoutReader: Pre-training of Text and Layout for Reading Order Detection
    Zilong Wang, Yiheng Xu, Lei Cui, and 2 more authors
    In EMNLP 2021, 2021
  2. GroupLink: An End-to-end Multitask Method for Word Grouping and Relation Extraction in Form Understanding
    Zilong Wang, Mingjie Zhan, Houxing Ren, and 4 more authors
    arXiv, 2021

2020

  1. EMNLP ’20
    DocStruct: A Multimodal Method to Extract Hierarchy Structure in Document for General Form Understanding
    Zilong Wang, Mingjie Zhan, Xuebo Liu, and 1 more author
    In EMNLP 2020 Findings, 2020
  2. EMNLP ’20
    Exploring Semantic Capacity of Terms
    Jie Huang*, Zilong Wang*, Kevin Chang, and 2 more authors
    In EMNLP 2020, 2020
  3. WWW ’20
    TransModality: An End2End Fusion Method with Transformer for Multimodal Sentiment Analysis
    Zilong Wang, Zhaohong Wan, and Xiaojun Wan
    In WWW 2020, 2020