Sample stimuli

sample 0 sample 1 sample 2 sample 3 sample 4 sample 5 sample 6 sample 7 sample 8 sample 9

How to use

from brainscore_vision import load_benchmark
benchmark = load_benchmark("Geirhos2021cueconflict-error_consistency")
score = benchmark(my_model)

Model scores

Min Alignment Max Alignment

Rank

Model

Score

1
.943
2
.938
3
.855
4
.840
5
.804
6
.784
7
.780
8
.752
9
.748
10
.740
11
.698
12
.686
13
.656
14
.651
15
.633
16
.611
17
.596
18
.595
19
.571
20
.560
21
.560
22
.546
23
.540
24
.482
25
.477
26
.451
27
.448
28
.441
29
.405
30
.402
31
.394
32
.393
33
.390
34
.383
35
.380
36
.376
37
.372
38
.371
39
.371
40
.357
41
.355
42
.346
43
.345
44
.344
45
.343
46
.332
47
.328
48
.327
49
.325
50
.321
51
.317
52
.316
53
.316
54
.316
55
.313
56
.311
57
.309
58
.309
59
.309
60
.300
61
.299
62
.297
63
.294
64
.292
65
.292
66
.284
67
.284
68
.282
69
.278
70
.272
71
.271
72
.271
73
.266
74
.260
75
.255
76
.254
77
.254
78
.253
79
.250
80
.244
81
.242
82
.240
83
.238
84
.237
85
.236
86
.236
87
.235
88
.235
89
.233
90
.233
91
.233
92
.233
93
.232
94
.232
95
.229
96
.228
97
.228
98
.228
99
.226
100
.226
101
.226
102
.226
103
.220
104
.219
105
.218
106
.213
107
.213
108
.213
109
.212
110
.211
111
.210
112
.210
113
.210
114
.208
115
.206
116
.204
117
.204
118
.202
119
.196
120
.191
121
.191
122
.190
123
.189
124
.189
125
.189
126
.189
127
.188
128
.187
129
.186
130
.184
131
.182
132
.181
133
.181
134
.181
135
.180
136
.179
137
.179
138
.177
139
.177
140
.177
141
.175
142
.175
143
.173
144
.173
145
.173
146
.171
147
.167
148
.166
149
.166
150
.165
151
.164
152
.164
153
.163
154
.163
155
.162
156
.162
157
.160
158
.160
159
.159
160
.157
161
.157
162
.157
163
.157
164
.156
165
.156
166
.155
167
.155
168
.154
169
.153
170
.153
171
.153
172
.153
173
.153
174
.153
175
.153
176
.153
177
.152
178
.152
179
.149
180
.147
181
.146
182
.146
183
.146
184
.145
185
.144
186
.144
187
.144
188
.144
189
.144
190
.142
191
.142
192
.141
193
.140
194
.137
195
.137
196
.136
197
.136
198
.135
199
.135
200
.135
201
.134
202
.134
203
.133
204
.133
205
.133
206
.132
207
.132
208
.132
209
.132
210
.132
211
.132
212
.131
213
.131
214
.131
215
.130
216
.128
217
.128
218
.128
219
.128
220
.128
221
.128
222
.128
223
.128
224
.128
225
.128
226
.128
227
.128
228
.127
229
.127
230
.127
231
.125
232
.125
233
.125
234
.123
235
.122
236
.121
237
.121
238
.121
239
.119
240
.117
241
.114
242
.112
243
.112
244
.110
245
.110
246
.109
247
.107
248
.106
249
.102
250
.102
251
.102
252
.101
253
.101
254
.098
255
.098
256
.097
257
.096
258
.096
259
.096
260
.093
261
.091
262
.090
263
.090
264
.087
265
.083
266
.081
267
.079
268
.078
269
.068
270
.068
271
.068
272
.068
273
.067
274
.066
275
.066
276
.063
277
.060
278
.059
279
.055
280
.053
281
.052
282
.050
283
.049
284
.046
285
.045
286
.034
287
.034
288
.032
289
.032
290
.031
291
.028
292
.013
293
.011
294
.011
295
.011
296
.003
297
.003
298
.003
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413

Benchmark bibtex

@article{geirhos2021partial,
              title={Partial success in closing the gap between human and machine vision},
              author={Geirhos, Robert and Narayanappa, Kantharaju and Mitzkus, Benjamin and Thieringer, Tizian and Bethge, Matthias and Wichmann, Felix A and Brendel, Wieland},
              journal={Advances in Neural Information Processing Systems},
              volume={34},
              year={2021},
              url={https://openreview.net/forum?id=QkljT4mrfs}
        }

Ceiling

0.33.

Note that scores are relative to this ceiling.

Data: Geirhos2021cueconflict

Metric: error_consistency