Unnatural Instructions: Tuning Language Models with (Almost) No Human Labor O Honovich, T Scialom, O Levy, T Schick arXiv preprint arXiv:2212.09689, 2022 | 252 | 2022 |
TRUE: Re-evaluating Factual Consistency Evaluation O Honovich, R Aharoni, J Herzig, H Taitelbaum, D Kukliansy, V Cohen, ... NAACL 2022, 2022 | 185 | 2022 |
: Evaluating Factual Consistency in Knowledge-Grounded Dialogues via Question Generation and Question Answering O Honovich, L Choshen, R Aharoni, E Neeman, I Szpektor, O Abend EMNLP 2021, 2021 | 174 | 2021 |
Instruction Induction: From Few Examples to Natural Language Task Descriptions O Honovich, U Shaham, SR Bowman, O Levy arXiv preprint arXiv:2205.10782, 2022 | 110 | 2022 |
DisentQA: Disentangling Parametric and Contextual Knowledge with Counterfactual Question Answering E Neeman, R Aharoni, O Honovich, L Choshen, I Szpektor, O Abend arXiv preprint arXiv:2211.05655, 2022 | 46 | 2022 |
LMentry: A Language Model Benchmark of Elementary Language Tasks A Efrat, O Honovich, O Levy arXiv preprint arXiv:2211.02069, 2022 | 21 | 2022 |
Machine reading of historical events O Honovich, LT Hennigen, O Abend, SB Cohen Proceedings of the 58th Annual Meeting of the Association for Computational …, 2020 | 9 | 2020 |
A Chain-of-Thought Is as Strong as Its Weakest Link: A Benchmark for Verifiers of Reasoning Chains A Jacovi, Y Bitton, B Bohnet, J Herzig, O Honovich, M Tseng, M Collins, ... arXiv preprint arXiv:2402.00559, 2024 | 5 | 2024 |
Surfacing Biases in Large Language Models using Contrastive Input Decoding G Yona, O Honovich, I Laish, R Aharoni arXiv preprint arXiv:2305.07378, 2023 | 5 | 2023 |