Sample stimuli

How to use

from brainscore_vision import load_benchmark
benchmark = load_benchmark("Rajalingham2018-i2n")
score = benchmark(my_model)

Benchmark API

Code examples

Model scores

Score Legend Min Alignment Max Alignment

Rank	Model	Score
1	effnetb1_cutmix_augmix_sam_e1_5avg_424x377	.664
2	effnetb1_cutmixpatch_augmix_robust32_avge4e7_manylayers_324x288	.652
3	effnetb1_cutmixpatch_augmix_robust32_e5_324x288	.646
4	effnetb1_cutmixrespatch_SAM_robust32_e10_manylayers_324x288	.625
5	effnetb1_cutmixpatch_SAM_robust32_avge6e8e9e10_manylayers_324x288	.623
6	vit_large_patch14_clip_224:laion2b_ft_in12k_in1k	.617
7	effnetb1_cutmixrespatch_SAM_robust32_e2_manylayers_324x288	.608
8	vit_large_patch14_clip_224:laion2b_ft_in1k	.607
9	cvt_cvt-21-384-in22k_finetuned-in1k_4_LucyV4	.606
10	cvt_cvt-w24-384-in22k_finetuned-in1k_4	.600
11	swin_small_patch4_window7_224:ms_in22k_ft_in1k	.600
12	vit_large_patch14_clip_336:laion2b_ft_in1k	.596
13	resnext101_32x32d_wsl	.594
14	vit_huge_patch14_clip_224:laion2b_ft_in12k_in1k	.593
15	effnetv2m_custom384	.590
16	convnext_xlarge:fb_in22k_ft_in1k	.588
17	vision_transformer_vit_large_patch16_224	.585
18	resnet18_imagenet21kP	.585
19	vit_huge_patch14_clip_336:laion2b_ft_in12k_in1k	.584
20	convnext_large:fb_in22k_ft_in1k	.584
21	convnext_small_imagenet_full_seed-0	.581
22	convnext_large_imagenet_full_seed-0	.575
23	efficientnet-b0	.573
24	convnext_tiny_imagenet_full_seed-0	.573
25	resnet_152_v2	.573
26	convnext_large_mlp:clip_laion2b_augreg_ft_in1k_384	.573
27	convnext_tiny:in12k_ft_in1k	.570
28	deit_base_imagenet_full_seed-0	.564
29	cvt_cvt-13-384-in22k_finetuned-in1k_4_LucyV4	.564
30	vit_base_patch16_clip_224:openai_ft_in1k	.564
31	convnext_xxlarge:clip_laion2b_soup_ft_in1k	.563
32	vit_base_patch16_clip_224:openai_ft_in12k_in1k	.563
33	custom_model_cv_18_dagger_408	.562
34	resnet-101_v1-tf	.561
35	resnext101_32x48d_wsl	.561
36	fixres_resnext101_32x48d_wsl	.561
37	ReAlnet02	.560
38	ReAlnet02_cornet	.560
39	SWSL_resnet50	.560
40	resnet_50_v2	.560
41	vit_large_patch14_clip_224:openai_ft_in12k_in1k	.558
42	ReAlnet10_cornet	.555
43	ReAlnet10	.555
44	resnet-101_v2-tf	.555
45	resnet_101_v2	.554
46	grcnn_robust_v1	.554
47	deit_large_imagenet_full_seed-0	.554
48	resnext101_32x8d_wsl	.551
49	effnetb1_272x240	.549
50	resnet50_finetune_cutmix_e3_robust_linf8255_e0_247x234	.549
51	ReAlnet07	.549
52	ReAlnet07_cornet	.549
53	convnext_base_imagenet_full_seed-0	.549
54	resnet-34	.546
55	ReAlnet01_cornet	.546
56	ReAlnet01	.546
57	ReAlnet03	.545
58	ReAlnet03_cornet	.545
59	CORnet-S	.545
60	voneresnet-50-robust	.545
61	deit_small_imagenet_full_seed-0	.543
62	densenet-169-keras	.543
63	resnet_152_v1	.542
64	ReAlnet08	.541
65	ReAlnet08_cornet	.541
66	resnet50-SIN_IN	.541
67	ReAlnet06_cornet	.540
68	ReAlnet06	.540
69	inception_v4	.539
70	vit_relpos_base_patch16_clsgap_224:sw_in1k	.538
71	densenet-201-keras	.537
72	densenet-201	.537
73	resnet152_ecoset_full	.537
74	inception_v4-tf	.537
75	resnet50_finetune_cutmix_AVGe2e3_robust_linf8255_e0_247x234	.536
76	ReAlnet09_cornet	.536
77	ReAlnet09	.536
78	efficientnet-b4	.535
79	densenet-121-keras	.535
80	resnet50-vicreg	.534
81	vit_large_patch14_clip_224:openai_ft_in1k	.534
82	ReAlnet05_cornet	.534
83	ReAlnet05	.534
84	mobilenet_v2_0.75_224-tf	.533
85	resnet-152_v1-tf	.533
86	voneresnet-50	.532
87	inception_v1	.532
88	resnet-50_v2-tf	.531
89	voneresnet-50-non_stochastic	.530
90	resnet-152_v2-tf	.528
91	resnet-50-pytorch	.528
92	resnet50-sup	.528
93	resnet_50_v1	.528
94	yudixie_resnet18_imagenet1kpret_0_240719	.528
95	convnext_base:clip_laiona_augreg_ft_in1k_384	.528
96	resnet18_ecoset_full	.527
97	ReAlnet04_cornet	.527
98	ReAlnet04	.527
99	efficientnet-b2	.526
100	resnet-50_v1-tf	.526
101	mobilenet_v2_0.75_192-tf	.524
102	resnet-18_test_m	.524
103	resnet-18	.524
104	BiT-M-R50x3	.523
105	resnet50-SIN_IN_IN	.523
106	vit_large_patch14_clip_336:openai_ft_in12k_in1k	.523
107	densenet_201_pytorch	.522
108	vonegrcnn_52e_full	.522
109	resnet_101_v1	.521
110	grcnn_109	.521
111	mobilenet_v2_1.0_224-tf	.521
112	resnet50_ecoset_full	.521
113	resnet152_imagenet_full	.520
114	resnet50_imagenet_full	.520
115	grcnn	.520
116	resnet101_ecoset_full	.520
117	cvt_cvt-13-224-in1k_4_LucyV4	.519
118	inception_v1-tf	.518
119	vonegrcnn_52e_full	.518
120	resnet101	.517
121	vonegrcnn_47e	.517
122	vonegrcnn_62e_nobn	.515
123	resnet50_random_l2_perturb	.515
124	resnet-50-robust	.515
125	pnasnet_large-tf	.515
126	resnet50_random_linf8_perturb	.515
127	SWSL_resnext101_32x8d	.514
128	resnet101_imagenet_full	.513
129	resnet50-barlow	.513
130	efficientnet-b6	.513
131	pnasnet_large	.512
132	convnext_femto_ols:d1_in1k	.512
133	vgg_16	.512
134	vit_tiny_r_s16_p8_384:augreg_in21k_ft_in1k	.511
135	resnet50_byol	.511
136	CLIP_resnet50_float32	.511
137	CLIP_resnet50	.511
138	vonegrcnn_47e	.510
139	resnext101_32x16d_wsl	.509
140	BiT-M-R101x3	.509
141	resnet50_linf_4_robust	.508
142	xception-keras	.508
143	resnet50-moclr8deg	.507
144	efficientnet_b1_imagenet_full	.507
145	efficientnet_b2_imagenet_full	.506
146	cvt_cvt-13-384-in1k_4_LucyV4	.505
147	inception_v2-tf	.505
148	resnet34_imagenet_full	.504
149	mobilenet_v2_1.0_192-tf	.503
150	AT_efficientnet-b4	.503
151	inception_v3	.503
152	vonegrcnn_62e_nobn	.503
153	resnet_50_v1_spiking_l4a	.503
154	antialiased-rnext101_32x8d	.502
155	CLIP_ViT-B_32	.502
156	mobilenet_v1_1.0_224	.502
157	shufflenet_v2_x1_0	.500
158	mobilenet_v2_1.4_224-tf	.500
159	cv_18_dagger_408_pretrained	.500
160	mobilenet_v2_1.3_224-tf	.500
161	inception_resnet_v2	.499
162	resnet50_tutorial	.499
163	resnet_SIN_IN_FT_IN	.499
164	resnet34_ecoset_full	.499
165	BiT-M-R50x1	.499
166	focalnet_tiny_lrf_in1k	.498
167	omnivore_swinS	.498
168	VOneCORnet-S	.497
169	resnet50-vicregl0p9	.496
170	nasnet_mobile	.496
171	resnet50-vicregl0p75	.495
172	vgg-19-keras	.494
173	resnet-152_v2_pytorch	.493
174	efficientnet_b0	.492
175	nasnet_large	.491
176	antialias-resnet152	.490
177	convnext_tiny_sup	.488
178	mobilenet_v2_0.5_224-tf	.488
179	efficientnet_b0_imagenet_full	.488
180	antialiased-r50	.488
181	omnivore_swinB	.487
182	resnet50_robust_l2_eps1	.485
183	resnet50	.484
184	yudixie_resnet50_imagenet1kpret_0_240312	.481
185	yudixie_resnet50_imagenet1kpret_0_240908	.481
186	mobilenet_v1_1.0_160	.480
187	AT_efficientnet-b2	.480
188	Res2Net50_26w_4s	.479
189	resnet_50_v1_spiking	.478
190	BiT-S-R152x4	.478
191	cvt_cvt-21-384-in1k_4_LucyV4	.478
192	BiT-M-R152x4	.477
193	mobilenet_v1_0.75_224	.477
194	inception_v3-tf	.477
195	densenet-169	.476
196	vit_relpos_base_patch32_plus_rpn_256:sw_in1k	.475
197	AT_efficientnet-b7	.475
198	resnet50_l2_3_robust	.475
199	blt_vs	.474
200	resnet18_imagenet_full	.474
201	mobilenet_v2_0.35_224	.474
202	mobilenet_v1_0.5_224	.474
203	mobilenet_v2_0.75_160-tf	.473
204	BiT-M-R152x2	.472
205	mobilenet_v2_1.0_160-tf	.472
206	efficientnet-b7	.471
207	nasnet_large-tf	.470
208	cvt_cvt-21-224-in1k_4_LucyV4	.470
209	resnet50_moco_v2	.469
210	mobilenet_v1_1.0_192	.466
211	cornet_s	.465
212	mobilenet_v2_0.75_128	.464
213	BiT-S-R101x3	.462
214	grcnn_v2_text_noise	.461
215	vgg-16-keras	.461
216	mobilenet_v2_0_75_224	.458
217	densenet-121	.458
218	imagenet_l2_3_0	.456
219	resnet50_robust_l2_eps3	.456
220	mobilenet_v2_0.5_192-tf	.454
221	mobilenet_v1_0.5_192	.454
222	resnet50_linf_8_robust	.452
223	vgg_19	.451
224	mobilenet_v2_0.75_96	.451
225	BiT-S-R101x1	.449
226	mobilenet_v1_0.75_192	.449
227	mobilenet_v2_0.5_160	.448
228	AdvProp_efficientnet-b6	.448
229	resnet50-SIN	.448
230	AT_efficientnet-b0	.447
231	resnet50-VITO-8deg-cc	.447
232	mobilenet_v2_1.0_128-tf	.447
233	resnet18-supervised	.446
234	BiT-S-R152x2	.446
235	mobilenet_v2_1_4_224	.445
236	mobilenet_v2_1-4_224_pytorch	.445
237	AdvProp_efficientnet-b7	.445
238	BiT-S-R50x3	.444
239	mobilenet_v2_1.0_96	.443
240	resnet50-simclr-vissl	.443
241	xception	.441
242	mobilenet_v2_0.5_128	.440
243	resnet50_simclr	.438
244	regnet_y_400mf	.438
245	resnet50-vitoimagevidnet8	.437
246	mobilenet_v2_0.35_192	.437
247	mobilenet_v1_1.0_128	.437
248	BiT-S-R50x1	.435
249	omnivore_swinT	.434
250	BiT-M-R101x1	.433
251	AdvProp_efficientnet-b8	.433
252	mobilenet_v2_1_3_224	.428
253	yudixie_resnet18_category_class_0_240719	.428
254	mobilenet_v2_0_75_192	.427
255	mobilenet_v1_0.75_128	.425
256	deit_base_patch16_384_id	.425
257	convnext_small_imagenet_100_seed-0	.424
258	mobilenet_v2_0.35_160	.424
259	mobilenet_v2_1_0_192	.419
260	texture_shape_resnet50_trained_on_SIN	.415
261	mobilenet_v1_0.75_160	.413
262	mobilenet_v2_1_0_160	.413
263	mobilenet_v1_0.5_160	.410
264	AdvProp_efficientnet-b2	.410
265	resnet-18-LC_w_sh_1_iter_conv_init_m	.410
266	AdvProp_efficientnet-b4	.408
267	tf_efficientnetv2_s_in21ft1k_robust_linf12255_400x400	.407
268	nasnet_mobile-tf	.406
269	yudixie_resnet50_category_class_0_240908	.405
270	AdvProp_efficientnet-b0	.396
271	resnet-18-LC_w_sh_10_iter_conv_init_m	.395
272	tv_efficientnet-b1	.392
273	mobilenet_v2_0_75_160	.386
274	mobilenet_v2_1_0_128	.383
275	resnet-18-LC_w_sh_10_iter_m	.381
276	deit_base_patch16_224_id	.376
277	AlexNet_SIN	.375
278	mobilenet_v1_0.5_128	.373
279	mobilenet_v2_0_5_192	.372
280	alexnet_random_linf8_perturb	.371
281	mobilenet_v2_0.5_96	.370
282	alexnet_ks_torevert	.370
283	alexnet	.370
284	alexnet	.370
285	alexnet	.370
286	alexnet	.370
287	alexnet	.370
288	alexnet	.370
289	alexnet	.370
290	alexnet	.370
291	alexnet	.370
292	alexnet-baseline	.370
293	konkle_alexnetgn_ipcl_ref12_supervised_ipcl_aug	.370
294	mobilenet_v2_0.35_128	.367
295	alexnet_random_l2_3_perturb	.366
296	mobilenet_v2_0_5_224	.365
297	alexnet_l2_3_robust	.363
298	texture_shape_alexnet_trained_on_SIN	.362
299	resnet-18-LC_w_sh_100_iter_m	.360
300	alexnet	.359
301	CORnet-Z	.356
302	alexnet_early_checkpoint	.354
303	mobilenet_v2_0.35_96	.351
304	alexnet_reduced_aliasing_early_checkpoint	.348
305	cornetz_contrastive	.346
306	mobilenet_v1_0.25_192	.344
307	alexnet_linf_8_robust	.341
308	vonealexnet_gaussian_noise_std4_fixed	.341
309	ViT_B_16_imagenet1k	.335
310	resnet-18-LC_w_sh_1_iter_m	.334
311	mobilenet_v1_0.25_224	.333
312	ViT_L_16_imagenet1k	.333
313	alexnet_robust_correct	.332
314	mobilenet_v1_0.25_160	.330
315	resnet50	.330
316	ViT_L_32_imagenet1k	.324
317	resnet-18-LC_w_sh_100_iter_conv_init_m	.322
318	ViT_B_16	.320
319	resnet-18-LC_d_w_sh_1x1_conv_init_m	.311
320	imagenet_l2_10_0	.310
321	bagnet9	.307
322	resnet50_imagenet_100_seed-0	.305
323	squeezenet1_1	.291
324	ViT_L_32	.286
325	mobilenet_v1_0.25_128	.286
326	konkle_alexnetgn_ipcl_ref01_primary_model	.285
327	ViT-B/32	.284
328	deit_small_distilled_patch16_224_id	.283
329	dorinet_cornet_z	.279
330	ViT_B_32_imagenet1k	.276
331	barlow-twins-resnet50	.276
332	yudixie_resnet18_cat_obj_class_all_latents_0_240719	.270
333	ViT_B_32	.270
334	yudixie_resnet50_cat_obj_class_all_latents_0_240908	.265
335	squeezenet1_0	.263
336	yudixie_resnet18_object_class_0_240719	.261
337	deit_base_distilled_patch16_384	.256
338	deit_base_distilled_patch16_224	.256
339	deit_base_distilled_patch16_224_id	.256
340	deit_tiny_patch16_224	.256
341	deit_small_patch16_224	.256
342	deit_small_distilled_patch16_224	.256
343	deit_base_patch16_384	.256
344	deit_base_patch16_224	.256
345	deit_tiny_distilled_patch16_224	.256
346	0.5x_resnet-18_LC_w_sh_1_iter	.255
347	resnet-18-LC_conv_init_m	.254
348	resnet-18-LC_m	.250
349	deit_small_imagenet_100_seed-0	.245
350	deit_base_distilled_patch16_384_id	.243
351	r3m_resnet50	.243
352	yolos_tiny	.242
353	resnet18-simclr	.231
354	deit_small_patch16_224_id	.226
355	CLIP-RN50	.225
356	RN50	.220
357	RN50	.219
358	0.5x_resnet-18_LC_w_sh_10_iter	.216
359	r3m_resnet50_nocrop	.211
360	deit_tiny_patch16_224_id	.211
361	deit_tiny_distilled_patch16_224_id	.209
362	r3m_resnet18	.209
363	yudixie_resnet50_object_class_0_240908	.208
364	resnet_50_v1_spiking_l4	.200
365	resnet-18-LC_1st_conv_m	.187
366	yudixie_resnet50_distance_rotation_0_240908	.186
367	resnet50-meshes-lt-100-original-pretrained	.185
368	resnet18-local_aggregation	.177
369	FrankRobWobv0	.167
370	resnet50_imagenet_10_seed-0	.165
371	resnet18-contrastive_multiview	.161
372	r3m_resnet34	.160
373	yudixie_resnet50_distance_translation_rotation_0_240908	.157
374	r3m_resnet34_nocrop	.150
375	0.5x_resnet-18	.148
376	yudixie_resnet18_distance_rotation_0_240719	.144
377	yudixie_resnet18_distance_translation_rotation_0_240719	.137
378	alexnet_training_seed_02	.131
379	yudixie_resnet18_distance_reg_0_240719	.129
380	alexnet_training_seed_04	.127
381	yudixie_resnet50_distance_reg_0_240908	.116
382	alexnet_training_seed_10	.114
383	fitvid_trained_on_physion	.113
384	resnet50_FractalDB	.112
385	alexnet_training_seed_06	.108
386	0.5x_resnet-18_LC	.108
387	alexnet_training_seed_09	.104
388	mobilevit_small	.103
389	alexnet_training_seed_07	.103
390	resnet18-depth_prediction	.102
391	0.5x_resnet-18_LC_w_sh_100_iter	.101
392	alexnet_training_seed_01	.098
393	artResNet18_1	.096
394	yudixie_resnet50_translation_rotation_0_240908	.095
395	resnet50_imagenet_1_seed-0	.092
396	resnet18-instance_recognition	.090
397	resnet18-autoencoder	.083
398	resnet50-meshes-lt-100-original-scratch	.083
399	alexnet_training_seed_08	.083
400	yudixie_resnet18_translation_rotation_0_240719	.082
401	vggface	.078
402	CORnetZ_CIFAR10	.076
403	deit_small_imagenet_10_seed-0	.075
404	resnet-18_untrained	.067
405	alexnet_training_seed_05	.065
406	yudixie_resnet50_rotation_reg_0_240908	.061
407	resnet18-colorization	.060
408	resnet50-cifar	.054
409	yudixie_resnet18_rotation_reg_0_240719	.049
410	yudixie_resnet18_distance_translation_0_240719	.046
411	yudixie_resnet18_random_0_240719	.045
412	briaai_rmbg_1_4	.041
413	yudixie_resnet18_translation_reg_0_240719	.032
414	yudixie_resnet50_distance_translation_0_240908	.027
415	dcgan	.023
416	pixels	.020
417	fulltest_microblockvf_nobottleneck_freeat_nopretrain_eps2_m4	.020
418	CORnetZ_CIFAR10_bs32_20_04	.014
419	prednet	.014
420	resnet50_primary_visual_cortex	.012
421	resnet18-contrastive_predictive	.010
422	convnext_small_imagenet_10_seed-0	.009
423	v1-pyr-nodown	.009
424	resnet_50_v1_spiking_l3	.009
425	my-model	.004
426	yudixie_resnet50_translation_reg_0_240908	-0.002
427	resnet18-relative_position	-0.006
428	unet_entire	-0.006
429	deit_small_imagenet_1_seed-0	-0.010
430	yudixie_resnet50_random_0_240908	-0.016
431	resnet50_pretrained_with_retinal_waves	-0.053
432	my-model	-0.053
433	resnet-18-LC_untrained	-0.055
434	convnext_small_imagenet_1_seed-0	-0.067
435	resnet18-deepcluster
436	inception_v3_pytorch
437	mobilenet_v2_1_0_224
438	pnasnet_large_pytorch
439	hmax
440	MIM
441	TAU
442	SimVP
443	PredRNN
444	ConvLSTM
445	alexnet_training_seed_03
446	resnet-50_untrained
447	resnet-50x2_untrained
448	openclip
449	resnet50_ImageNet

Benchmark bibtex

@article {Rajalingham240614,
                author = {Rajalingham, Rishi and Issa, Elias B. and Bashivan, Pouya and Kar, Kohitij and Schmidt, Kailyn and DiCarlo, James J.},
                title = {Large-scale, high-resolution comparison of the core visual object recognition behavior of humans, monkeys, and state-of-the-art deep artificial neural networks},
                elocation-id = {240614},
                year = {2018},
                doi = {10.1101/240614},
                publisher = {Cold Spring Harbor Laboratory},
                abstract = {Primates{	extemdash}including humans{	extemdash}can typically recognize objects in visual images at a glance even in the face of naturally occurring identity-preserving image transformations (e.g. changes in viewpoint). A primary neuroscience goal is to uncover neuron-level mechanistic models that quantitatively explain this behavior by predicting primate performance for each and every image. Here, we applied this stringent behavioral prediction test to the leading mechanistic models of primate vision (specifically, deep, convolutional, artificial neural networks; ANNs) by directly comparing their behavioral signatures against those of humans and rhesus macaque monkeys. Using high-throughput data collection systems for human and monkey psychophysics, we collected over one million behavioral trials for 2400 images over 276 binary object discrimination tasks. Consistent with previous work, we observed that state-of-the-art deep, feed-forward convolutional ANNs trained for visual categorization (termed DCNNIC models) accurately predicted primate patterns of object-level confusion. However, when we examined behavioral performance for individual images within each object discrimination task, we found that all tested DCNNIC models were significantly non-predictive of primate performance, and that this prediction failure was not accounted for by simple image attributes, nor rescued by simple model modifications. These results show that current DCNNIC models cannot account for the image-level behavioral patterns of primates, and that new ANN models are needed to more precisely capture the neural mechanisms underlying primate object vision. To this end, large-scale, high-resolution primate behavioral benchmarks{	extemdash}such as those obtained here{	extemdash}could serve as direct guides for discovering such models.SIGNIFICANCE STATEMENT Recently, specific feed-forward deep convolutional artificial neural networks (ANNs) models have dramatically advanced our quantitative understanding of the neural mechanisms underlying primate core object recognition. In this work, we tested the limits of those ANNs by systematically comparing the behavioral responses of these models with the behavioral responses of humans and monkeys, at the resolution of individual images. Using these high-resolution metrics, we found that all tested ANN models significantly diverged from primate behavior. Going forward, these high-resolution, large-scale primate behavioral benchmarks could serve as direct guides for discovering better ANN models of the primate visual system.},
                URL = {https://www.biorxiv.org/content/early/2018/02/12/240614},
                eprint = {https://www.biorxiv.org/content/early/2018/02/12/240614.full.pdf},
                journal = {bioRxiv}
            }

Ceiling

0.48.

Note that scores are relative to this ceiling.

Data: Rajalingham2018

240 stimuli match-to-sample task

Metric: i2n