Fig. 6

The human reviewer and ChatGPT-3.5 Turbo performance variability. (a) ChatGPT-3.5 Turbo prediction performance in each individual run in Steps 1 and 2. (b) The reviewer’s performance in different independent assessment rounds in Steps 1 and 2