Sample stimuli

sample 0 sample 1 sample 2 sample 3 sample 4 sample 5 sample 6 sample 7 sample 8 sample 9

How to use

from brainscore_vision import load_benchmark
benchmark = load_benchmark("Geirhos2021rotation-error_consistency")
score = benchmark(my_model)

Model scores

Min Alignment Max Alignment

Rank

Model

Score

1
.646
2
.614
3
.606
4
.589
5
.581
6
.571
7
.568
8
.565
9
.561
10
.559
11
.546
12
.539
13
.528
14
.527
15
.516
16
.506
17
.500
18
.489
19
.488
20
.478
21
.476
22
.467
23
.457
24
.431
25
.429
26
.425
27
.417
28
.416
29
.413
30
.405
31
.404
32
.404
33
.404
34
.400
35
.391
36
.385
37
.380
38
.370
39
.368
40
.367
41
.357
42
.356
43
.351
44
.344
45
.343
46
.338
47
.334
48
.325
49
.308
50
.282
51
.273
52
.273
53
.273
54
.268
55
.266
56
.264
57
.261
58
.259
59
.255
60
.255
61
.254
62
.250
63
.249
64
.239
65
.236
66
.236
67
.226
68
.225
69
.225
70
.224
71
.223
72
.221
73
.218
74
.213
75
.212
76
.206
77
.206
78
.204
79
.204
80
.201
81
.197
82
.196
83
.195
84
.195
85
.194
86
.190
87
.190
88
.182
89
.181
90
.181
91
.179
92
.177
93
.175
94
.172
95
.172
96
.172
97
.167
98
.166
99
.165
100
.164
101
.161
102
.161
103
.161
104
.160
105
.160
106
.156
107
.155
108
.154
109
.154
110
.154
111
.154
112
.153
113
.152
114
.152
115
.150
116
.150
117
.150
118
.149
119
.149
120
.149
121
.144
122
.143
123
.143
124
.140
125
.137
126
.137
127
.135
128
.133
129
.132
130
.131
131
.130
132
.130
133
.130
134
.130
135
.130
136
.129
137
.124
138
.124
139
.123
140
.123
141
.121
142
.120
143
.119
144
.118
145
.117
146
.116
147
.116
148
.116
149
.115
150
.115
151
.115
152
.114
153
.114
154
.114
155
.114
156
.112
157
.109
158
.105
159
.105
160
.101
161
.100
162
.098
163
.098
164
.096
165
.096
166
.095
167
.094
168
.094
169
.092
170
.092
171
.091
172
.091
173
.090
174
.089
175
.089
176
.088
177
.087
178
.087
179
.085
180
.085
181
.085
182
.085
183
.085
184
.084
185
.081
186
.081
187
.080
188
.080
189
.079
190
.079
191
.079
192
.078
193
.075
194
.074
195
.073
196
.073
197
.073
198
.070
199
.070
200
.069
201
.069
202
.068
203
.067
204
.064
205
.064
206
.064
207
.063
208
.063
209
.063
210
.062
211
.062
212
.062
213
.061
214
.061
215
.061
216
.061
217
.060
218
.060
219
.059
220
.059
221
.059
222
.058
223
.058
224
.058
225
.058
226
.058
227
.058
228
.057
229
.057
230
.056
231
.055
232
.055
233
.055
234
.055
235
.054
236
.054
237
.053
238
.052
239
.052
240
.052
241
.052
242
.050
243
.050
244
.050
245
.049
246
.049
247
.049
248
.049
249
.049
250
.049
251
.049
252
.049
253
.049
254
.049
255
.049
256
.049
257
.049
258
.048
259
.046
260
.045
261
.044
262
.044
263
.042
264
.042
265
.038
266
.037
267
.037
268
.035
269
.035
270
.033
271
.032
272
.029
273
.023
274
.021
275
.018
276
.018
277
.018
278
.016
279
.015
280
.012
281
.011
282
.011
283
.009
284
.009
285
.009
286
.009
287
.009
288
.009
289
.008
290
.008
291
.008
292
.007
293
.007
294
.007
295
.006
296
.004
297
.003
298
.002
299
.001
300
.001
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443

Benchmark bibtex

@article{geirhos2021partial,
              title={Partial success in closing the gap between human and machine vision},
              author={Geirhos, Robert and Narayanappa, Kantharaju and Mitzkus, Benjamin and Thieringer, Tizian and Bethge, Matthias and Wichmann, Felix A and Brendel, Wieland},
              journal={Advances in Neural Information Processing Systems},
              volume={34},
              year={2021},
              url={https://openreview.net/forum?id=QkljT4mrfs}
        }

Ceiling

0.44.

Note that scores are relative to this ceiling.

Data: Geirhos2021rotation

Metric: error_consistency