Scores on benchmarks

Model rank shown below is with respect to all public models.
.451 average_language rank 3
5 benchmarks
.451
0
ceiling
best
median
.471 neural_language rank 5
4 benchmarks
.471
0
ceiling
best
median
.675 Pereira2018-ridge rank 12
2 benchmarks
.675
0
ceiling
best
median
.731 Pereira2018.243sentences-ridge v1 rank 11
.731
0
ceiling
best
median
sample 0 sample 1 sample 2 sample 3 sample 4 sample 5 sample 6 sample 7 sample 8 sample 9
.619 Pereira2018.384sentences-ridge v1 rank 9
.619
0
ceiling
best
median
sample 0 sample 1 sample 2 sample 3 sample 4 sample 5 sample 6 sample 7 sample 8 sample 9
.109 Blank2014-ridge v1 rank 5
.109
0
ceiling
best
median
sample 0 sample 1 sample 2 sample 3 sample 4 sample 5 sample 6 sample 7 sample 8 sample 9
.630 Fedorenko2016-ridge v3 rank 4
.630
0
ceiling
best
median
sample 0 sample 1 sample 2 sample 3 sample 4 sample 5 sample 6 sample 7 sample 8 sample 9
.431 behavior_language rank 1
1 benchmark
.431
0
ceiling
best
median
.431 Futrell2018-pearsonr v1 [reference] rank 1
.431
0
ceiling
best
median
sample 0 sample 1 sample 2 sample 3 sample 4 sample 5 sample 6 sample 7 sample 8 sample 9

How to use

from brainscore_language import load_model
model = load_model("falcon-7b")
model.start_task(...)
model.start_recording(...)
model.look_at(...)

Brain Encoding Response Generator (BERG)

Through the BERG you can easily generate neural responses to text sentences of your choice using any Brain-Score language model.

For more information on how to use BERG, see the documentation and tutorial.

Benchmarks bibtex

@proceedings{futrell2018natural,
  title={The Natural Stories Corpus},
  author={Futrell, Richard and Gibson, Edward and Tily, Harry J. and Blank, Idan and Vishnevetsky, Anastasia and
          Piantadosi, Steven T. and Fedorenko, Evelina},
  conference={International Conference on Language Resources and Evaluation (LREC)},
  url={http://www.lrec-conf.org/proceedings/lrec2018/pdf/337.pdf},
  year={2018}
}
        

Layer Commitment

No layer commitments found for this model. Older submissions might not have stored this information but will be updated when evaluated on new benchmarks.

Visual Angle

None degrees