Sample stimuli

sample 0 sample 1 sample 2 sample 3 sample 4 sample 5 sample 6 sample 7 sample 8 sample 9

How to use

from brainscore_vision import load_benchmark
benchmark = load_benchmark("Geirhos2021cueconflict-error_consistency")
score = benchmark(my_model)

Model scores

Min Alignment Max Alignment

Rank

Model

Score

1
.943
2
.938
3
.855
4
.840
5
.804
6
.784
7
.780
8
.752
9
.748
10
.740
11
.698
12
.686
13
.656
14
.651
15
.633
16
.611
17
.596
18
.595
19
.571
20
.560
21
.560
22
.546
23
.540
24
.482
25
.477
26
.451
27
.448
28
.441
29
.405
30
.402
31
.394
32
.393
33
.390
34
.383
35
.380
36
.376
37
.372
38
.371
39
.371
40
.357
41
.355
42
.346
43
.345
44
.344
45
.343
46
.332
47
.328
48
.327
49
.325
50
.321
51
.317
52
.316
53
.316
54
.316
55
.313
56
.311
57
.309
58
.309
59
.309
60
.300
61
.299
62
.297
63
.294
64
.293
65
.292
66
.292
67
.284
68
.284
69
.282
70
.278
71
.272
72
.271
73
.271
74
.266
75
.260
76
.255
77
.254
78
.254
79
.253
80
.250
81
.244
82
.242
83
.240
84
.238
85
.237
86
.236
87
.236
88
.235
89
.235
90
.233
91
.233
92
.233
93
.233
94
.232
95
.232
96
.229
97
.229
98
.228
99
.228
100
.228
101
.226
102
.226
103
.226
104
.226
105
.220
106
.219
107
.218
108
.213
109
.213
110
.213
111
.212
112
.211
113
.210
114
.210
115
.210
116
.208
117
.206
118
.204
119
.204
120
.202
121
.196
122
.191
123
.191
124
.190
125
.189
126
.189
127
.189
128
.189
129
.188
130
.187
131
.186
132
.184
133
.182
134
.181
135
.181
136
.181
137
.180
138
.179
139
.179
140
.177
141
.177
142
.177
143
.175
144
.175
145
.173
146
.173
147
.173
148
.171
149
.167
150
.166
151
.166
152
.165
153
.164
154
.164
155
.163
156
.163
157
.162
158
.162
159
.160
160
.160
161
.160
162
.159
163
.157
164
.157
165
.157
166
.157
167
.156
168
.156
169
.155
170
.155
171
.154
172
.153
173
.153
174
.153
175
.153
176
.153
177
.153
178
.153
179
.153
180
.152
181
.152
182
.149
183
.147
184
.146
185
.146
186
.146
187
.145
188
.144
189
.144
190
.144
191
.144
192
.144
193
.142
194
.142
195
.141
196
.140
197
.137
198
.137
199
.136
200
.136
201
.135
202
.135
203
.135
204
.134
205
.134
206
.133
207
.133
208
.133
209
.132
210
.132
211
.132
212
.132
213
.132
214
.132
215
.131
216
.131
217
.131
218
.130
219
.128
220
.128
221
.128
222
.128
223
.128
224
.128
225
.128
226
.128
227
.128
228
.128
229
.128
230
.128
231
.127
232
.127
233
.127
234
.125
235
.125
236
.125
237
.123
238
.122
239
.121
240
.121
241
.121
242
.119
243
.117
244
.114
245
.112
246
.112
247
.110
248
.110
249
.109
250
.107
251
.106
252
.102
253
.102
254
.102
255
.101
256
.101
257
.098
258
.098
259
.097
260
.096
261
.096
262
.096
263
.093
264
.091
265
.090
266
.090
267
.087
268
.083
269
.081
270
.079
271
.078
272
.068
273
.068
274
.068
275
.068
276
.067
277
.066
278
.066
279
.063
280
.060
281
.059
282
.055
283
.053
284
.052
285
.050
286
.049
287
.046
288
.045
289
.034
290
.034
291
.032
292
.032
293
.031
294
.028
295
.013
296
.011
297
.011
298
.011
299
.003
300
.003
301
.003
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444

Benchmark bibtex

@article{geirhos2021partial,
              title={Partial success in closing the gap between human and machine vision},
              author={Geirhos, Robert and Narayanappa, Kantharaju and Mitzkus, Benjamin and Thieringer, Tizian and Bethge, Matthias and Wichmann, Felix A and Brendel, Wieland},
              journal={Advances in Neural Information Processing Systems},
              volume={34},
              year={2021},
              url={https://openreview.net/forum?id=QkljT4mrfs}
        }

Ceiling

0.33.

Note that scores are relative to this ceiling.

Data: Geirhos2021cueconflict

Metric: error_consistency