Sample stimuli

sample 0 sample 1 sample 2 sample 3 sample 4 sample 5 sample 6 sample 7 sample 8 sample 9

How to use

from brainscore_vision import load_benchmark
benchmark = load_benchmark("Geirhos2021eidolonIII-error_consistency")
score = benchmark(my_model)

Model scores

Min Alignment Max Alignment

Rank

Model

Score

1
.726
2
.644
3
.641
4
.638
5
.629
6
.610
7
.597
8
.597
9
.582
10
.579
11
.578
12
.574
13
.569
14
.567
15
.564
16
.561
17
.558
18
.556
19
.556
20
.554
21
.551
22
.550
23
.548
24
.546
25
.541
26
.535
27
.535
28
.535
29
.521
30
.518
31
.517
32
.511
33
.505
34
.502
35
.502
36
.501
37
.500
38
.495
39
.494
40
.490
41
.489
42
.489
43
.484
44
.480
45
.477
46
.476
47
.474
48
.473
49
.473
50
.470
51
.469
52
.468
53
.468
54
.468
55
.465
56
.464
57
.464
58
.459
59
.453
60
.453
61
.451
62
.445
63
.444
64
.443
65
.442
66
.441
67
.439
68
.438
69
.437
70
.433
71
.433
72
.432
73
.432
74
.429
75
.427
76
.424
77
.423
78
.418
79
.416
80
.415
81
.415
82
.415
83
.412
84
.412
85
.400
86
.397
87
.395
88
.393
89
.391
90
.388
91
.383
92
.380
93
.379
94
.377
95
.373
96
.369
97
.368
98
.366
99
.364
100
.363
101
.360
102
.360
103
.357
104
.356
105
.353
106
.353
107
.352
108
.350
109
.346
110
.343
111
.340
112
.334
113
.333
114
.327
115
.324
116
.320
117
.320
118
.319
119
.319
120
.319
121
.318
122
.317
123
.316
124
.315
125
.314
126
.313
127
.313
128
.312
129
.304
130
.302
131
.302
132
.299
133
.299
134
.297
135
.297
136
.295
137
.295
138
.294
139
.294
140
.292
141
.292
142
.291
143
.291
144
.291
145
.291
146
.290
147
.290
148
.287
149
.286
150
.286
151
.285
152
.284
153
.284
154
.283
155
.282
156
.281
157
.278
158
.276
159
.274
160
.273
161
.273
162
.272
163
.271
164
.267
165
.263
166
.262
167
.258
168
.257
169
.256
170
.255
171
.252
172
.251
173
.250
174
.247
175
.245
176
.245
177
.244
178
.242
179
.241
180
.240
181
.238
182
.233
183
.231
184
.228
185
.226
186
.221
187
.220
188
.219
189
.219
190
.219
191
.215
192
.214
193
.213
194
.212
195
.204
196
.203
197
.200
198
.196
199
.190
200
.189
201
.174
202
.173
203
.171
204
.168
205
.168
206
.167
207
.167
208
.167
209
.167
210
.167
211
.167
212
.167
213
.167
214
.167
215
.167
216
.167
217
.162
218
.161
219
.157
220
.154
221
.154
222
.151
223
.151
224
.146
225
.145
226
.142
227
.138
228
.137
229
.136
230
.132
231
.130
232
.129
233
.129
234
.129
235
.127
236
.126
237
.125
238
.124
239
.124
240
.124
241
.123
242
.123
243
.121
244
.121
245
.117
246
.113
247
.112
248
.112
249
.112
250
.112
251
.108
252
.105
253
.105
254
.104
255
.103
256
.101
257
.099
258
.097
259
.096
260
.095
261
.094
262
.093
263
.093
264
.093
265
.091
266
.091
267
.090
268
.090
269
.089
270
.089
271
.085
272
.081
273
.080
274
.079
275
.078
276
.076
277
.071
278
.069
279
.063
280
.062
281
.057
282
.056
283
.040
284
.039
285
.037
286
.037
287
.037
288
.037
289
.037
290
.036
291
.034
292
.033
293
.033
294
.026
295
.023
296
.018
297
.010
298
.007
299
.003
300
.001
301
.000
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442

Benchmark bibtex

@article{geirhos2021partial,
              title={Partial success in closing the gap between human and machine vision},
              author={Geirhos, Robert and Narayanappa, Kantharaju and Mitzkus, Benjamin and Thieringer, Tizian and Bethge, Matthias and Wichmann, Felix A and Brendel, Wieland},
              journal={Advances in Neural Information Processing Systems},
              volume={34},
              year={2021},
              url={https://openreview.net/forum?id=QkljT4mrfs}
        }

Ceiling

0.46.

Note that scores are relative to this ceiling.

Data: Geirhos2021eidolonIII

Metric: error_consistency