Select the split & model below to get automated analyses of the model's performance on the SWE-bench split.
Viewing 's performance on the SWE-bench split, which resolved % of issues. (Logs)
Loading README.md...

% Resolved by Repository

Repository Resolved Total % Resolved


% Resolved by Year

Year Resolved Total % Resolved


Instances by Outcomes




Log Viewer