Findings of ACL

FAER: Benchmarking VLMs for Failure-Aware Embodied Reasoning

Findings of ACL

Hao, Song and Kaifeng, Liu and Yuanxing, Liu and Xiang, Tian and Xuesong, Wang and Yifan, Chen and Weinan, Zhang and Ting, Liu

FAER: Benchmarking VLMs for Failure-Aware Embodied Reasoning

Findings of ACL

Hao, Song and Kaifeng, Liu and Yuanxing, Liu and Xiang, Tian and Xuesong, Wang and Yifan, Chen and Weinan, Zhang and Ting, Liu

I run as fast as a rabbit, can you? A Multilingual Simile Dialogues Datasets

Findings of ACL

Ma, Longxuan and Zhang, Weinan and Zhou, Shuhan and Sun, Churui and Ke, Changxin and Liu, Ting

I run as fast as a rabbit, can you? A Multilingual Simile Dialogues Datasets

Findings of ACL

Ma, Longxuan and Zhang, Weinan and Zhou, Shuhan and Sun, Churui and Ke, Changxin and Liu, Ting

What did you refer to? Evaluating Co-references in Dialogue

Findings of ACL

Zhang, Weinan and Zhang, Yue and Tang, Hanlin and Zhao, Zhengyu and Zhu, Caihai and Liu, Ting

What did you refer to? Evaluating Co-references in Dialogue

Findings of ACL

Zhang, Weinan and Zhang, Yue and Tang, Hanlin and Zhao, Zhengyu and Zhu, Caihai and Liu, Ting

Unraveling and Mitigating Retriever Inconsistencies in Retrieval-Augmented Large Language Models

findings of ACL

Mingda, Li and Xinyu, Li and Yifan, Chen and Wenfeng, Xuan and Weinan, Zhang

Unraveling and Mitigating Retriever Inconsistencies in Retrieval-Augmented Large Language Models

findings of ACL

Mingda, Li and Xinyu, Li and Yifan, Chen and Wenfeng, Xuan and Weinan, Zhang