Sample stimuli

sample 0 sample 1 sample 2 sample 3 sample 4 sample 5 sample 6 sample 7 sample 8 sample 9

How to use

from brainscore_vision import load_benchmark
benchmark = load_benchmark("Geirhos2021eidolonIII-error_consistency")
score = benchmark(my_model)

Model scores

Min Alignment Max Alignment

Rank

Model

Score

1
.726
2
.644
3
.641
4
.638
5
.629
6
.610
7
.597
8
.597
9
.582
10
.579
11
.578
12
.574
13
.569
14
.567
15
.564
16
.561
17
.558
18
.556
19
.556
20
.554
21
.551
22
.550
23
.548
24
.546
25
.541
26
.535
27
.535
28
.535
29
.521
30
.518
31
.517
32
.511
33
.505
34
.502
35
.502
36
.501
37
.500
38
.495
39
.494
40
.490
41
.489
42
.489
43
.484
44
.480
45
.477
46
.476
47
.474
48
.473
49
.473
50
.470
51
.469
52
.468
53
.468
54
.468
55
.465
56
.464
57
.464
58
.459
59
.453
60
.453
61
.451
62
.445
63
.444
64
.443
65
.442
66
.441
67
.439
68
.438
69
.437
70
.433
71
.433
72
.432
73
.432
74
.429
75
.427
76
.424
77
.423
78
.418
79
.416
80
.415
81
.415
82
.415
83
.412
84
.412
85
.400
86
.397
87
.395
88
.393
89
.391
90
.388
91
.383
92
.380
93
.379
94
.377
95
.373
96
.369
97
.368
98
.366
99
.364
100
.360
101
.360
102
.357
103
.356
104
.353
105
.353
106
.352
107
.350
108
.346
109
.343
110
.340
111
.334
112
.333
113
.327
114
.324
115
.320
116
.320
117
.319
118
.319
119
.319
120
.318
121
.317
122
.316
123
.315
124
.314
125
.313
126
.313
127
.312
128
.304
129
.302
130
.302
131
.299
132
.299
133
.297
134
.297
135
.295
136
.295
137
.294
138
.294
139
.292
140
.292
141
.291
142
.291
143
.291
144
.291
145
.290
146
.290
147
.287
148
.286
149
.286
150
.285
151
.284
152
.284
153
.283
154
.282
155
.281
156
.278
157
.276
158
.274
159
.273
160
.273
161
.272
162
.271
163
.267
164
.263
165
.262
166
.258
167
.257
168
.256
169
.255
170
.252
171
.251
172
.250
173
.247
174
.245
175
.245
176
.244
177
.242
178
.241
179
.240
180
.238
181
.233
182
.231
183
.228
184
.226
185
.221
186
.220
187
.219
188
.219
189
.219
190
.215
191
.214
192
.213
193
.212
194
.204
195
.203
196
.200
197
.196
198
.190
199
.189
200
.174
201
.173
202
.171
203
.168
204
.168
205
.167
206
.167
207
.167
208
.167
209
.167
210
.167
211
.167
212
.167
213
.167
214
.167
215
.167
216
.162
217
.161
218
.157
219
.154
220
.154
221
.151
222
.151
223
.146
224
.145
225
.142
226
.138
227
.137
228
.136
229
.132
230
.130
231
.129
232
.129
233
.129
234
.127
235
.126
236
.125
237
.124
238
.124
239
.124
240
.123
241
.123
242
.121
243
.121
244
.117
245
.113
246
.112
247
.112
248
.112
249
.112
250
.108
251
.105
252
.105
253
.104
254
.103
255
.101
256
.099
257
.097
258
.096
259
.095
260
.094
261
.093
262
.093
263
.093
264
.091
265
.091
266
.090
267
.090
268
.089
269
.089
270
.085
271
.081
272
.080
273
.079
274
.078
275
.076
276
.071
277
.069
278
.063
279
.062
280
.057
281
.056
282
.040
283
.039
284
.037
285
.037
286
.037
287
.037
288
.037
289
.036
290
.034
291
.033
292
.033
293
.026
294
.023
295
.018
296
.010
297
.007
298
.003
299
.001
300
.000
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433

Benchmark bibtex

@article{geirhos2021partial,
              title={Partial success in closing the gap between human and machine vision},
              author={Geirhos, Robert and Narayanappa, Kantharaju and Mitzkus, Benjamin and Thieringer, Tizian and Bethge, Matthias and Wichmann, Felix A and Brendel, Wieland},
              journal={Advances in Neural Information Processing Systems},
              volume={34},
              year={2021},
              url={https://openreview.net/forum?id=QkljT4mrfs}
        }

Ceiling

0.46.

Note that scores are relative to this ceiling.

Data: Geirhos2021eidolonIII

Metric: error_consistency