🔬Advanced agent systems, RAG evaluation, instruction-following and more. Our team's accepted papers at
#NAACL2025 span from professional CRM research to parallel in-context learning.
🎉A huge congrats to our researchers and thanks to
@naacl — we're excited to share and discuss with the community this spring! 💫
👇📑Bookmark and explore the research below! 📑👇
📎CRMArena: Understanding the Capacity of LLM Agents to Perform Professional CRM Tasks in Realistic Environments:
➡️
arxiv.org/abs/2411.02305
👏Steeve Huang, Akshara Prabhakar, Sidharth Dhawan, Yixin Mao, Huan Wang, Silvio Savarese, Caiming Xiong, Philippe Laban, Chien-Sheng (Jason) Wu
📎Evaluating Cultural and Social Awareness in LLM Agents
➡️
arxiv.org/abs/2410.23252
👏Haoyi Qiu, Alexander R. Fabbri, Divyansh Agarwal, Kung-Hsiang Huang, Sarah Tan, Nanyun Peng, Chien-Sheng (Jason) Wu
📎Do RAG Systems Cover What Matters? Evaluating and Optimizing Responses with Sub-Question Coverage
➡️
arxiv.org/abs/2410.15531
👏Kaige Xie, Philippe Laban, Prafulla Kumar Choubey, Caiming Xiong, Chien-Sheng (Jason) Wu
📎Measuring Progress in Evaluating Instruction Following with Large Language Models
➡️
arxiv.org/abs/2410.07069
👏Yixin Liu, Kejian Shi, Alex Fabbri, Yilun Zhao, Peifeng Wang, Chien-Sheng (Jason) Wu, Shafiq Rayhan Joty, Arman Cohan
📎CodeTree: Agent-guided Tree Search for Code Generation with Large Language Models
➡️
arxiv.org/abs/2411.04329
👏Jierui Li, Hung Le, Yingbo Zhou, Caiming Xiong, Silvio Savarese, Doyen Sahoo
📎On Positional Bias of Faithfulness for Long-form Summarization
➡️
arxiv.org/abs/2410.23609
👏David Wan, Jesse Vig, Mohit Bansal, Shafiq Joty
📎LLMs Are Biased Towards Output Formats! Systematically Evaluating and Mitigating Output Format Bias of LLMs
➡️
arxiv.org/abs/2408.08656
👏Do Xuan Long, Hai Nguyen Ngoc, Tiviatis Sim, Hieu Dao, Shafiq Joty, Kenji Kawaguchi, Nancy F. Chen, Min-Yen Kan
📎ParaICL: Towards Parallel In-Context Learning
➡️
arxiv.org/abs/2404.00570
👏Li Xingxuan, Xuan Phi Nguyen, Shafiq Joty, Lidong Bing
📎xLAM: A Family of Large Action Models to Empower AI Agent Systems
➡️Paper:
arxiv.org/abs/2409.03215
✍️Blog:
salesforce.com/blog/ai-agent…
🧠Models:
bit.ly/4faoYaQ
👏Jianguo Zhang, Tian Lan, Ming Zhu, Zuxin Liu, Thai Hoang, Shirley Kokane, Weiran Yao, Juntao Tan, Akshara Prabhakar, Haolin Chen, Zhiwei Liu, Yihao Feng, Tulika Awalgaonkar, Rithesh Murthy, Zeyuan Chen, Ran Xu, Juan Carlos Niebles, Shelby Heinecke, Huan Wang, Silvio Savarese, Caiming Xiong
📎Tutorial: Adaptation of Large Language Models
🖥️Website coming soon!
👏Zixuan Ke, Yifei Ming, Shafiq Rayhan Joty
Congrats again to our talented team!