Sample stimuli

sample 0 sample 1 sample 2 sample 3 sample 4 sample 5 sample 6 sample 7 sample 8 sample 9

How to use

from brainscore_vision import load_benchmark
benchmark = load_benchmark("tong.Coggan2024_behavior-ConditionWiseAccuracySimilarity")
score = benchmark(my_model)

Model scores

Min Alignment Max Alignment

Rank

Model

Score

1
.844
2
.779
3
.707
4
.700
5
.689
6
.677
7
.673
8
.659
9
.636
10
.635
11
.628
12
.626
13
.619
14
.613
15
.611
16
.603
17
.597
18
.595
19
.591
20
.590
21
.590
22
.588
23
.588
24
.586
25
.575
26
.573
27
.571
28
.566
29
.558
30
.555
31
.554
32
.552
33
.552
34
.552
35
.551
36
.542
37
.542
38
.541
39
.540
40
.537
41
.536
42
.535
43
.534
44
.529
45
.526
46
.523
47
.519
48
.519
49
.515
50
.515
51
.507
52
.505
53
.504
54
.501
55
.497
56
.483
57
.482
58
.478
59
.476
60
.471
61
.469
62
.469
63
.467
64
.466
65
.465
66
.463
67
.460
68
.459
69
.457
70
.454
71
.453
72
.452
73
.449
74
.448
75
.448
76
.446
77
.434
78
.430
79
.425
80
.421
81
.412
82
.411
83
.409
84
.409
85
.407
86
.406
87
.405
88
.398
89
.397
90
.393
91
.392
92
.383
93
.383
94
.379
95
.374
96
.372
97
.372
98
.371
99
.371
100
.360
101
.359
102
.356
103
.351
104
.342
105
.339
106
.338
107
.333
108
.331
109
.331
110
.331
111
.329
112
.326
113
.323
114
.320
115
.320
116
.317
117
.317
118
.316
119
.315
120
.312
121
.311
122
.305
123
.303
124
.303
125
.300
126
.299
127
.298
128
.295
129
.293
130
.288
131
.284
132
.277
133
.273
134
.258
135
.258
136
.256
137
.251
138
.250
139
.236
140
.235
141
.229
142
.229
143
.228
144
.226
145
.226
146
.219
147
.216
148
.215
149
.214
150
.211
151
.211
152
.210
153
.207
154
.201
155
.198
156
.191
157
.186
158
.180
159
.180
160
.180
161
.170
162
.166
163
.165
164
.162
165
.162
166
.162
167
.159
168
.156
169
.154
170
.154
171
.153
172
.148
173
.144
174
.144
175
.143
176
.142
177
.139
178
.133
179
.133
180
.132
181
.132
182
.132
183
.129
184
.129
185
.129
186
.123
187
.123
188
.122
189
.113
190
.113
191
.113
192
.111
193
.111
194
.110
195
.110
196
.108
197
.106
198
.103
199
.101
200
.101
201
.099
202
.096
203
.096
204
.095
205
.092
206
.092
207
.092
208
.089
209
.079
210
.079
211
.078
212
.077
213
.077
214
.077
215
.064
216
.063
217
.062
218
.058
219
.052
220
.048
221
.044
222
.043
223
.038
224
.036
225
.035
226
.023
227
.023
228
.022
229
.021
230
.020
231
.019
232
.018
233
.012
234
.009
235
.007
236
.007
237
.007
238
.006
239
.006
240
.006
241
.006
242
.005
243
.005
244
.005
245
.004
246
.003
247
.003
248
.002
249
.002
250
.001
251
.000
252
.000
253
.000
254
.000
255
.002
256
.002
257
.002
258
.002
259
.004
260
.006
261
.007
262
.008
263
.009
264
.017
265
.023
266
.023
267
.033
268
.034
269
.067
270
.069
271
.142
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298

Benchmark bibtex

None

Ceiling

0.69.

Note that scores are relative to this ceiling.

Data: tong.Coggan2024_behavior

Metric: ConditionWiseAccuracySimilarity