Sample stimuli

sample 0 sample 1 sample 2 sample 3 sample 4 sample 5 sample 6 sample 7 sample 8 sample 9

How to use

from brainscore_vision import load_benchmark
benchmark = load_benchmark("Geirhos2021silhouette-error_consistency")
score = benchmark(my_model)

Model scores

Min Alignment Max Alignment

Rank

Model

Score

1
1.2
2
1.2
3
1.2
4
1.1
5
1.1
6
1.1
7
1.1
8
1.1
9
1.0
10
1.0
11
1.0
12
.993
13
.990
14
.966
15
.966
16
.959
17
.936
18
.914
19
.908
20
.908
21
.904
22
.900
23
.897
24
.891
25
.888
26
.883
27
.870
28
.856
29
.848
30
.841
31
.841
32
.840
33
.827
34
.825
35
.822
36
.818
37
.817
38
.810
39
.793
40
.781
41
.777
42
.773
43
.767
44
.763
45
.763
46
.750
47
.749
48
.737
49
.728
50
.728
51
.718
52
.706
53
.704
54
.701
55
.685
56
.681
57
.675
58
.674
59
.668
60
.663
61
.660
62
.658
63
.658
64
.648
65
.648
66
.647
67
.646
68
.640
69
.628
70
.618
71
.615
72
.614
73
.610
74
.610
75
.608
76
.602
77
.601
78
.584
79
.578
80
.574
81
.560
82
.558
83
.554
84
.553
85
.553
86
.551
87
.549
88
.548
89
.546
90
.546
91
.546
92
.546
93
.544
94
.535
95
.534
96
.531
97
.524
98
.523
99
.521
100
.521
101
.520
102
.520
103
.510
104
.510
105
.509
106
.500
107
.497
108
.492
109
.492
110
.492
111
.492
112
.491
113
.491
114
.491
115
.482
116
.482
117
.482
118
.478
119
.474
120
.474
121
.469
122
.469
123
.468
124
.465
125
.463
126
.458
127
.451
128
.450
129
.449
130
.445
131
.439
132
.439
133
.438
134
.437
135
.434
136
.421
137
.418
138
.417
139
.409
140
.395
141
.394
142
.392
143
.390
144
.388
145
.384
146
.379
147
.378
148
.378
149
.375
150
.373
151
.369
152
.369
153
.365
154
.364
155
.362
156
.360
157
.358
158
.358
159
.357
160
.353
161
.353
162
.352
163
.349
164
.336
165
.336
166
.335
167
.333
168
.332
169
.328
170
.317
171
.315
172
.314
173
.310
174
.309
175
.304
176
.298
177
.297
178
.296
179
.296
180
.287
181
.287
182
.286
183
.285
184
.280
185
.279
186
.276
187
.270
188
.266
189
.263
190
.262
191
.257
192
.256
193
.255
194
.252
195
.252
196
.249
197
.247
198
.246
199
.245
200
.243
201
.240
202
.236
203
.236
204
.231
205
.231
206
.228
207
.220
208
.214
209
.214
210
.205
211
.204
212
.204
213
.203
214
.203
215
.203
216
.203
217
.203
218
.203
219
.203
220
.203
221
.203
222
.203
223
.203
224
.194
225
.194
226
.193
227
.192
228
.186
229
.185
230
.185
231
.185
232
.180
233
.176
234
.169
235
.166
236
.164
237
.163
238
.161
239
.154
240
.150
241
.149
242
.149
243
.149
244
.148
245
.147
246
.146
247
.145
248
.143
249
.139
250
.139
251
.139
252
.139
253
.139
254
.135
255
.134
256
.129
257
.127
258
.117
259
.112
260
.109
261
.107
262
.106
263
.102
264
.102
265
.098
266
.090
267
.089
268
.089
269
.084
270
.084
271
.084
272
.083
273
.082
274
.082
275
.081
276
.077
277
.077
278
.077
279
.077
280
.077
281
.075
282
.073
283
.070
284
.066
285
.065
286
.059
287
.056
288
.054
289
.053
290
.052
291
.051
292
.049
293
.046
294
.043
295
.042
296
.041
297
.033
298
.023
299
.023
300
.012
301
.004
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443

Benchmark bibtex

@article{geirhos2021partial,
              title={Partial success in closing the gap between human and machine vision},
              author={Geirhos, Robert and Narayanappa, Kantharaju and Mitzkus, Benjamin and Thieringer, Tizian and Bethge, Matthias and Wichmann, Felix A and Brendel, Wieland},
              journal={Advances in Neural Information Processing Systems},
              volume={34},
              year={2021},
              url={https://openreview.net/forum?id=QkljT4mrfs}
        }

Ceiling

0.48.

Note that scores are relative to this ceiling.

Data: Geirhos2021silhouette

Metric: error_consistency