Sample stimuli

sample 0 sample 1 sample 2 sample 3 sample 4 sample 5 sample 6 sample 7 sample 8 sample 9

How to use

from brainscore_vision import load_benchmark
benchmark = load_benchmark("Geirhos2021falsecolour-error_consistency")
score = benchmark(my_model)

Model scores

Min Alignment Max Alignment

Rank

Model

Score

1
.818
2
.804
3
.780
4
.757
5
.756
6
.755
7
.754
8
.752
9
.752
10
.739
11
.738
12
.735
13
.726
14
.708
15
.706
16
.702
17
.699
18
.688
19
.688
20
.682
21
.668
22
.667
23
.648
24
.640
25
.633
26
.622
27
.622
28
.617
29
.610
30
.610
31
.605
32
.599
33
.588
34
.581
35
.580
36
.580
37
.579
38
.572
39
.568
40
.566
41
.564
42
.564
43
.560
44
.559
45
.559
46
.558
47
.554
48
.547
49
.546
50
.542
51
.539
52
.537
53
.536
54
.532
55
.532
56
.531
57
.528
58
.520
59
.519
60
.517
61
.511
62
.507
63
.505
64
.504
65
.497
66
.495
67
.493
68
.493
69
.492
70
.487
71
.486
72
.483
73
.475
74
.471
75
.470
76
.462
77
.461
78
.453
79
.449
80
.447
81
.440
82
.434
83
.429
84
.428
85
.419
86
.416
87
.409
88
.407
89
.401
90
.392
91
.390
92
.371
93
.368
94
.367
95
.367
96
.364
97
.362
98
.359
99
.356
100
.353
101
.353
102
.353
103
.351
104
.351
105
.348
106
.346
107
.338
108
.338
109
.337
110
.337
111
.336
112
.336
113
.336
114
.336
115
.335
116
.333
117
.333
118
.332
119
.326
120
.324
121
.312
122
.309
123
.309
124
.309
125
.309
126
.306
127
.300
128
.299
129
.299
130
.297
131
.296
132
.295
133
.292
134
.291
135
.289
136
.282
137
.280
138
.269
139
.269
140
.268
141
.267
142
.263
143
.262
144
.257
145
.257
146
.256
147
.256
148
.255
149
.253
150
.252
151
.251
152
.251
153
.249
154
.246
155
.240
156
.238
157
.237
158
.233
159
.231
160
.230
161
.228
162
.226
163
.225
164
.225
165
.223
166
.221
167
.220
168
.219
169
.219
170
.218
171
.218
172
.216
173
.215
174
.212
175
.212
176
.212
177
.209
178
.195
179
.193
180
.193
181
.188
182
.187
183
.183
184
.181
185
.179
186
.175
187
.174
188
.164
189
.163
190
.157
191
.157
192
.155
193
.152
194
.145
195
.144
196
.143
197
.139
198
.137
199
.136
200
.123
201
.119
202
.116
203
.114
204
.113
205
.107
206
.107
207
.106
208
.104
209
.103
210
.101
211
.101
212
.100
213
.092
214
.088
215
.087
216
.086
217
.085
218
.082
219
.073
220
.073
221
.070
222
.069
223
.068
224
.067
225
.066
226
.065
227
.064
228
.064
229
.064
230
.063
231
.056
232
.055
233
.055
234
.055
235
.055
236
.055
237
.055
238
.055
239
.055
240
.055
241
.055
242
.055
243
.055
244
.054
245
.051
246
.049
247
.048
248
.045
249
.045
250
.043
251
.043
252
.043
253
.039
254
.038
255
.037
256
.036
257
.035
258
.035
259
.034
260
.034
261
.034
262
.034
263
.033
264
.031
265
.029
266
.027
267
.027
268
.025
269
.019
270
.019
271
.019
272
.017
273
.016
274
.016
275
.016
276
.016
277
.015
278
.015
279
.014
280
.013
281
.013
282
.012
283
.011
284
.011
285
.010
286
.010
287
.010
288
.010
289
.010
290
.010
291
.010
292
.009
293
.009
294
.009
295
.008
296
.004
297
.004
298
.003
299
.003
300
.003
301
.001
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443

Benchmark bibtex

@article{geirhos2021partial,
              title={Partial success in closing the gap between human and machine vision},
              author={Geirhos, Robert and Narayanappa, Kantharaju and Mitzkus, Benjamin and Thieringer, Tizian and Bethge, Matthias and Wichmann, Felix A and Brendel, Wieland},
              journal={Advances in Neural Information Processing Systems},
              volume={34},
              year={2021},
              url={https://openreview.net/forum?id=QkljT4mrfs}
        }

Ceiling

0.44.

Note that scores are relative to this ceiling.

Data: Geirhos2021falsecolour

Metric: error_consistency