Sample stimuli

sample 0 sample 1 sample 2 sample 3 sample 4 sample 5 sample 6 sample 7 sample 8 sample 9

How to use

from brainscore_vision import load_benchmark
benchmark = load_benchmark("Geirhos2021edge-error_consistency")
score = benchmark(my_model)

Model scores

Min Alignment Max Alignment

Rank

Model

Score

1
1.0
2
1.0
3
1.0
4
1.0
5
.995
6
.979
7
.899
8
.854
9
.807
10
.753
11
.744
12
.728
13
.719
14
.699
15
.670
16
.645
17
.635
18
.619
19
.618
20
.608
21
.543
22
.543
23
.531
24
.519
25
.516
26
.516
27
.502
28
.447
29
.395
30
.363
31
.285
32
.264
33
.264
34
.252
35
.243
36
.232
37
.219
38
.200
39
.198
40
.195
41
.194
42
.192
43
.187
44
.184
45
.183
46
.182
47
.182
48
.182
49
.175
50
.173
51
.168
52
.163
53
.163
54
.160
55
.158
56
.158
57
.153
58
.153
59
.151
60
.151
61
.151
62
.151
63
.150
64
.148
65
.148
66
.148
67
.147
68
.144
69
.144
70
.140
71
.139
72
.139
73
.139
74
.138
75
.133
76
.133
77
.132
78
.132
79
.132
80
.132
81
.132
82
.131
83
.128
84
.124
85
.124
86
.124
87
.124
88
.123
89
.122
90
.116
91
.116
92
.116
93
.115
94
.115
95
.115
96
.114
97
.113
98
.113
99
.113
100
.109
101
.109
102
.107
103
.107
104
.107
105
.107
106
.107
107
.107
108
.106
109
.106
110
.106
111
.106
112
.105
113
.105
114
.105
115
.104
116
.104
117
.101
118
.100
119
.099
120
.099
121
.099
122
.098
123
.098
124
.098
125
.098
126
.098
127
.093
128
.092
129
.092
130
.092
131
.092
132
.092
133
.092
134
.092
135
.092
136
.092
137
.092
138
.091
139
.091
140
.091
141
.091
142
.091
143
.091
144
.091
145
.091
146
.090
147
.090
148
.085
149
.085
150
.084
151
.084
152
.084
153
.084
154
.084
155
.084
156
.084
157
.084
158
.084
159
.084
160
.084
161
.084
162
.084
163
.084
164
.084
165
.084
166
.083
167
.083
168
.083
169
.079
170
.078
171
.078
172
.077
173
.077
174
.077
175
.077
176
.077
177
.077
178
.077
179
.077
180
.077
181
.076
182
.076
183
.076
184
.076
185
.076
186
.076
187
.076
188
.076
189
.076
190
.076
191
.075
192
.075
193
.075
194
.075
195
.074
196
.074
197
.074
198
.074
199
.072
200
.070
201
.070
202
.070
203
.070
204
.069
205
.069
206
.069
207
.069
208
.069
209
.069
210
.069
211
.068
212
.068
213
.068
214
.068
215
.068
216
.068
217
.068
218
.068
219
.068
220
.066
221
.064
222
.063
223
.063
224
.063
225
.063
226
.062
227
.062
228
.062
229
.062
230
.062
231
.062
232
.062
233
.062
234
.062
235
.062
236
.062
237
.062
238
.061
239
.061
240
.061
241
.060
242
.059
243
.059
244
.056
245
.056
246
.055
247
.055
248
.055
249
.055
250
.055
251
.055
252
.055
253
.055
254
.055
255
.055
256
.055
257
.055
258
.055
259
.055
260
.055
261
.055
262
.055
263
.055
264
.055
265
.055
266
.055
267
.055
268
.055
269
.055
270
.055
271
.055
272
.055
273
.055
274
.055
275
.055
276
.055
277
.053
278
.053
279
.053
280
.053
281
.053
282
.050
283
.046
284
.046
285
.042
286
.042
287
.042
288
.039
289
.039
290
.038
291
.037
292
.036
293
.035
294
.034
295
.034
296
.032
297
.032
298
.032
299
.032
300
.032
301
.032
302
.032
303
.024
304
.024
305
.024
306
.024
307
.024
308
.024
309
.024
310
.023
311
.021
312
.021
313
.021
314
.021
315
.021
316
.015
317
.014
318
.011
319
.010
320
.007
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480

Benchmark bibtex

@article{geirhos2021partial,
              title={Partial success in closing the gap between human and machine vision},
              author={Geirhos, Robert and Narayanappa, Kantharaju and Mitzkus, Benjamin and Thieringer, Tizian and Bethge, Matthias and Wichmann, Felix A and Brendel, Wieland},
              journal={Advances in Neural Information Processing Systems},
              volume={34},
              year={2021},
              url={https://openreview.net/forum?id=QkljT4mrfs}
        }

Ceiling

0.32.

Note that scores are relative to this ceiling.

Data: Geirhos2021edge

Metric: error_consistency