news
Sep 1, 2024 | Speculative RAG: Enhancing Retrieval Augmented Generation through Drafting New paper alert! Achieve state-of-the-art performance both in accuracy and efficiency for RAG. |
---|---|
Jul 26, 2024 | OFFICEBENCH: Benchmarking Language Agents across Multiple Applications for Office Automation New paper alert! Check our latest LLM agent benchmark on the office automation scenario! |
May 27, 2024 | Debug like a Human: A Large Language Model Debugger via Verifying Runtime Execution Step-by-step has been accepted by ACL 2024 Findings! |
Jan 16, 2024 | Chain-of-Table: Evolving Tables in the Reasoning Chain for Table Understanding has been accepted by ICLR 2024! |
Dec 9, 2023 | A Study on Robustness and Reliability of Large Language Model Code Generation got received by AAAI 2024! |