Sample stimuli

sample 0 sample 1 sample 2 sample 3 sample 4 sample 5 sample 6 sample 7 sample 8 sample 9

How to use

from brainscore_vision import load_benchmark
benchmark = load_benchmark("Geirhos2021eidolonI-error_consistency")
score = benchmark(my_model)

Model scores

Min Alignment Max Alignment

Rank

Model

Score

1
.789
2
.783
3
.777
4
.776
5
.731
6
.724
7
.715
8
.698
9
.689
10
.677
11
.667
12
.663
13
.661
14
.655
15
.651
16
.647
17
.646
18
.638
19
.637
20
.637
21
.636
22
.629
23
.629
24
.621
25
.619
26
.619
27
.619
28
.615
29
.614
30
.610
31
.607
32
.603
33
.593
34
.593
35
.584
36
.583
37
.582
38
.577
39
.575
40
.574
41
.574
42
.567
43
.556
44
.553
45
.551
46
.549
47
.546
48
.541
49
.541
50
.537
51
.529
52
.527
53
.527
54
.524
55
.522
56
.520
57
.519
58
.513
59
.513
60
.512
61
.505
62
.499
63
.498
64
.497
65
.492
66
.491
67
.487
68
.482
69
.481
70
.476
71
.476
72
.472
73
.471
74
.470
75
.464
76
.459
77
.459
78
.459
79
.456
80
.453
81
.448
82
.446
83
.445
84
.443
85
.443
86
.443
87
.442
88
.436
89
.435
90
.432
91
.432
92
.431
93
.430
94
.429
95
.424
96
.417
97
.417
98
.416
99
.413
100
.412
101
.409
102
.408
103
.406
104
.403
105
.401
106
.400
107
.400
108
.400
109
.400
110
.394
111
.393
112
.384
113
.383
114
.378
115
.377
116
.371
117
.367
118
.363
119
.359
120
.355
121
.355
122
.354
123
.352
124
.350
125
.349
126
.346
127
.344
128
.341
129
.338
130
.336
131
.336
132
.335
133
.334
134
.333
135
.331
136
.329
137
.328
138
.327
139
.326
140
.325
141
.324
142
.324
143
.318
144
.318
145
.317
146
.309
147
.304
148
.301
149
.299
150
.297
151
.297
152
.297
153
.296
154
.295
155
.295
156
.293
157
.291
158
.290
159
.288
160
.287
161
.287
162
.286
163
.281
164
.281
165
.281
166
.281
167
.281
168
.281
169
.281
170
.281
171
.281
172
.281
173
.281
174
.281
175
.281
176
.281
177
.281
178
.277
179
.276
180
.274
181
.272
182
.269
183
.264
184
.260
185
.256
186
.255
187
.253
188
.252
189
.251
190
.250
191
.249
192
.247
193
.245
194
.243
195
.242
196
.242
197
.242
198
.241
199
.239
200
.238
201
.235
202
.230
203
.229
204
.223
205
.223
206
.222
207
.218
208
.217
209
.213
210
.210
211
.210
212
.208
213
.208
214
.208
215
.206
216
.204
217
.198
218
.197
219
.196
220
.196
221
.194
222
.192
223
.188
224
.186
225
.182
226
.181
227
.180
228
.180
229
.180
230
.180
231
.180
232
.180
233
.179
234
.179
235
.175
236
.171
237
.169
238
.168
239
.164
240
.161
241
.161
242
.160
243
.160
244
.159
245
.159
246
.156
247
.156
248
.150
249
.149
250
.148
251
.143
252
.141
253
.140
254
.140
255
.138
256
.132
257
.132
258
.131
259
.131
260
.129
261
.126
262
.126
263
.125
264
.122
265
.120
266
.120
267
.113
268
.113
269
.111
270
.107
271
.098
272
.097
273
.097
274
.094
275
.094
276
.094
277
.093
278
.089
279
.085
280
.085
281
.085
282
.085
283
.084
284
.084
285
.077
286
.077
287
.070
288
.065
289
.064
290
.063
291
.059
292
.058
293
.058
294
.057
295
.056
296
.055
297
.051
298
.044
299
.042
300
.041
301
.038
302
.036
303
.029
304
.015
305
.014
306
.013
307
.009
308
.008
309
.008
310
.004
311
.002
312
.002
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460

Benchmark bibtex

@article{geirhos2021partial,
              title={Partial success in closing the gap between human and machine vision},
              author={Geirhos, Robert and Narayanappa, Kantharaju and Mitzkus, Benjamin and Thieringer, Tizian and Bethge, Matthias and Wichmann, Felix A and Brendel, Wieland},
              journal={Advances in Neural Information Processing Systems},
              volume={34},
              year={2021},
              url={https://openreview.net/forum?id=QkljT4mrfs}
        }

Ceiling

0.39.

Note that scores are relative to this ceiling.

Data: Geirhos2021eidolonI

Metric: error_consistency