Sample stimuli

sample 0 sample 1 sample 2 sample 3 sample 4 sample 5 sample 6 sample 7 sample 8 sample 9

How to use

from brainscore_vision import load_benchmark
benchmark = load_benchmark("Maniquet2024-confusion_similarity")
score = benchmark(my_model)

Model scores

Min Alignment Max Alignment

Rank

Model

Score

1
1.0
2
1.0
3
1.0
4
1.0
5
1.0
6
1.0
7
1.0
8
1.0
9
1.0
10
1.0
11
.994
12
.955
13
.946
14
.944
15
.928
16
.928
17
.928
18
.928
19
.917
20
.896
21
.893
22
.892
23
.890
24
.886
25
.886
26
.878
27
.873
28
.873
29
.867
30
.858
31
.850
32
.832
33
.832
34
.832
35
.832
36
.831
37
.826
38
.823
39
.822
40
.821
41
.820
42
.818
43
.813
44
.813
45
.808
46
.808
47
.804
48
.803
49
.798
50
.798
51
.794
52
.793
53
.785
54
.763
55
.759
56
.759
57
.753
58
.753
59
.753
60
.751
61
.751
62
.748
63
.743
64
.738
65
.738
66
.737
67
.736
68
.729
69
.710
70
.708
71
.708
72
.695
73
.689
74
.679
75
.678
76
.677
77
.669
78
.668
79
.665
80
.665
81
.662
82
.651
83
.645
84
.640
85
.634
86
.632
87
.632
88
.630
89
.626
90
.622
91
.607
92
.603
93
.600
94
.598
95
.596
96
.588
97
.587
98
.586
99
.583
100
.571
101
.569
102
.568
103
.565
104
.564
105
.562
106
.562
107
.562
108
.561
109
.560
110
.559
111
.555
112
.555
113
.553
114
.553
115
.552
116
.550
117
.547
118
.546
119
.541
120
.541
121
.540
122
.538
123
.535
124
.533
125
.533
126
.530
127
.529
128
.524
129
.520
130
.520
131
.516
132
.513
133
.508
134
.508
135
.502
136
.502
137
.502
138
.500
139
.498
140
.498
141
.497
142
.496
143
.496
144
.495
145
.494
146
.490
147
.490
148
.488
149
.487
150
.486
151
.486
152
.482
153
.482
154
.480
155
.475
156
.473
157
.473
158
.472
159
.460
160
.460
161
.459
162
.456
163
.454
164
.453
165
.453
166
.452
167
.450
168
.448
169
.444
170
.441
171
.438
172
.437
173
.436
174
.436
175
.435
176
.431
177
.430
178
.428
179
.424
180
.419
181
.418
182
.418
183
.416
184
.412
185
.410
186
.409
187
.407
188
.407
189
.406
190
.406
191
.405
192
.403
193
.395
194
.392
195
.391
196
.387
197
.381
198
.381
199
.375
200
.371
201
.367
202
.365
203
.365
204
.362
205
.358
206
.358
207
.355
208
.351
209
.350
210
.348
211
.348
212
.346
213
.345
214
.341
215
.341
216
.341
217
.340
218
.337
219
.326
220
.326
221
.324
222
.324
223
.323
224
.322
225
.318
226
.315
227
.314
228
.312
229
.309
230
.305
231
.302
232
.298
233
.293
234
.289
235
.287
236
.284
237
.281
238
.280
239
.277
240
.270
241
.270
242
.262
243
.258
244
.255
245
.249
246
.247
247
.239
248
.238
249
.232
250
.227
251
.220
252
.212
253
.208
254
.206
255
.197
256
.195
257
.195
258
.194
259
.192
260
.189
261
.186
262
.183
263
.175
264
.167
265
.164
266
.163
267
.162
268
.156
269
.138
270
.123
271
.106
272
.100
273
.098
274
.092
275
.085
276
.065
277
.062
278
.019
279
.000
280
.000
281
.000
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299

Benchmark bibtex

@article {Maniquet2024.04.02.587669,
	author = {Maniquet, Tim and de Beeck, Hans Op and Costantino, Andrea Ivan},
	title = {Recurrent issues with deep neural network models of visual recognition},
	elocation-id = {2024.04.02.587669},
	year = {2024},
	doi = {10.1101/2024.04.02.587669},
	publisher = {Cold Spring Harbor Laboratory},
	URL = {https://www.biorxiv.org/content/early/2024/04/10/2024.04.02.587669},
	eprint = {https://www.biorxiv.org/content/early/2024/04/10/2024.04.02.587669.full.pdf},
	journal = {bioRxiv}
}

Ceiling

0.54.

Note that scores are relative to this ceiling.

Data: Maniquet2024

Metric: confusion_similarity