publications | Zilong Wang

2025

ICLR ’25

Speculative RAG: Enhancing Retrieval Augmented Generation through Drafting

Zilong Wang, Zifeng Wang, Long Le, and 9 more authors

In ICLR 2025, 2025

arXiv Blog

2024

arXiv

OfficeBench: Benchmarking Language Agents across Multiple Applications for Office Automation

Zilong Wang, Yuedong Cui, Li Zhong, and 4 more authors

arXiv preprint arXiv:2407.19056, 2024

arXiv
NeurIPS ’24

TableRAG: Million-Token Table Understanding with Language Models

Si-An Chen, Lesly Miculicich, Julian Martin Eisenschlos, and 7 more authors

In NeurIPS 2024, 2024

arXiv
ACL ’24

Debug like a Human: A Large Language Model Debugger via Verifying Runtime Execution Step-by-step

Li Zhong, Zilong Wang**, and Jingbo Shang

In ACL 2024 (Findings), 2024

arXiv Code X Demo
ICLR ’24

Chain-of-Table: Evolving Tables in the Reasoning Chain for Table Understanding

Zilong Wang, Hao Zhang, Chun-Liang Li, and 8 more authors

In ICLR 2024, 2024

arXiv Code Blog X
ACL ’24

Answer is All You Need: Instruction-following Text Embedding via Answering the Question

Letian Peng, Yuwei Zhang, Zilong Wang, and 4 more authors

In ACL 2024, 2024

arXiv Code X
ACL ’24

LMDX: Language Model-based Document Information Extraction and Localization

Vincent Perot, Kai Kang, Florian Luisier, and 7 more authors

In ACL 2024 (Findings), 2024

arXiv
AAAI ’24

A Study on Robustness and Reliability of Large Language Model Code Generation

Li Zhong, and Zilong Wang

In AAAI 2024, 2024

arXiv Code X
COLING 2024

Towards Few-shot Entity Recognition in Document Images: A Graph Neural Network Approach Robust to Image Manipulation

Prashant Krishnan, Zilong Wang, Yangkun Wang, and 1 more author

In COLING ’24, 2024

arXiv

2023

KDD 2023

VRDU: A Benchmark for Visually-rich Document Understanding

Zilong Wang, Yichao Zhou, Wei Wei, and 2 more authors

In KDD 2023, 2023

arXiv HTML Code
EMNLP ’23

Towards Zero-shot Relation Extraction in Web Mining: A Multimodal Approach with Relative XML Path

Zilong Wang, and Jingbo Shang

In EMNLP 2023 Findings, 2023

arXiv

2022

ACL ’22

Towards Few-shot Entity Recognition in Document Images: A Label-aware Sequence-to-Sequence Framework

Zilong Wang, and Jingbo Shang

In ACL 2022 Findings, 2022

arXiv Code
EMNLP ’22

Formulating Few-shot Fine-tuning Towards Language Model Pre-training: A Pilot Study on Named Entity Recognition

Zihan Wang, Kewen Zhao, Zilong Wang, and 1 more author

In EMNLP 2022 Findings, 2022

arXiv Code
EMNLP ’22

MGDoc: Pre-training with Multi-granular Hierarchy for Document Image Understanding

Zilong Wang, Jiuxiang Gu, Chris Tensmeyer, and 5 more authors

In EMNLP 2022, 2022

arXiv

2021

EMNLP ’21

LayoutReader: Pre-training of Text and Layout for Reading Order Detection

Zilong Wang, Yiheng Xu, Lei Cui, and 2 more authors

In EMNLP 2021, 2021

arXiv Code
arXiv

GroupLink: An End-to-end Multitask Method for Word Grouping and Relation Extraction in Form Understanding

Zilong Wang, Mingjie Zhan, Houxing Ren, and 4 more authors

arXiv, 2021

arXiv

2020

EMNLP ’20

DocStruct: A Multimodal Method to Extract Hierarchy Structure in Document for General Form Understanding

Zilong Wang, Mingjie Zhan, Xuebo Liu, and 1 more author

In EMNLP 2020 Findings, 2020

arXiv
EMNLP ’20

Exploring Semantic Capacity of Terms

Jie Huang*, Zilong Wang*, Kevin Chang, and 2 more authors

In EMNLP 2020, 2020

arXiv
WWW ’20

TransModality: An End2End Fusion Method with Transformer for Multimodal Sentiment Analysis

Zilong Wang, Zhaohong Wan, and Xiaojun Wan

In WWW 2020, 2020