Available Models
This page provides an overview of the brain encoding models currently available in BERG.
Model Naming Convention
BERG contains several encoding models, defined by the following model ID naming convention:
{modality}-{dataset}-{model}
where
modality: The neural recording modality on which the encoding model was trained.dataset: The neural dataset on which the encoding model was trained.model: The type of encoding model used.
For example:
fmri-nsd-fwrf: An fMRI encoding model trained on the NSD using feature-weighted receptive fields.eeg-things_eeg_2-vit_b_32: An EEG model trained on the THINGS-EEG2 dataset using the ViT-B/32 visual transformer architecture.
Model Naming Convention (BrainScore models)
BERG additionally contains a special class of encoding models that are trained on BrainScore benchmarks. These models have a slightly different naming convention:
brainscore_{modality}-{model}
where
modality: The neural recording modality of the BrainScore benchmark on which the encoding model was trained (i.e. vision, language).model: The specific BrainScore model used (e.g. AlexNet, GPT2-XL).
For example:
brainscore_vision-alexnet: A vision model trained on the BrainScore vision benchmark using the AlexNet architecture.brainscore_language-gpt2: A language model trained on the BrainScore language benchmark using GPT-2.
Get Model Information
You can get detailed information about any model using:
from berg import BERG
berg = BERG("path/to/brain-encoding-response-generator")
# List all available models
all_models = berg.list_models()
# Get detailed model information
model_info = berg.describe("fmri-nsd-fwrf")
Available models
Following is a list of all available models, grouped by modality. The ✅ icon indicates the best model for each dataset.
modality-fmri
Encoding models trained on neural responses recorded with functional Magnetic Resonance Imaging (fMRI).
Best model |
Model ID |
Description |
Training dataset |
Species |
Stimuli |
Encoding accuracy |
|---|---|---|---|---|---|---|
✅ |
Mapping of vision transformer image features onto fMRI responses. |
Natural Scenes Dataset (surface space) |
Human |
Images |
||
Linear mapping of vision transformer image features onto fMRI responses. |
Natural Scenes Dataset (surface space) |
Human |
Images |
|||
Linear mapping of AlexNet image features onto fMRI responses. |
Natural Scenes Dataset (surface space) |
Human |
Images |
|||
Linear mapping of an untrained AlexNet image features onto fMRI responses. |
Natural Scenes Dataset (surface space) |
Human |
Images |
|||
✅ |
Feature-weighted receptive fields, convolutional neural networks trained end-to-end to predict fMRI responses from input images. |
Natural Scenes Dataset (volume space) |
Human |
Images |
||
✅ |
Linear mapping of 3D CNN video features onto fMRI responses. |
BOLD Moments Dataset (MNI152 volume space) |
Human |
Videos |
||
✅ |
Linear mapping of vision transformer image features onto whole-brain fMRI responses. |
THINGS fMRI1 |
Human |
Images |
||
✅ |
CNN predicting visual cortex responses (7,831 vertices) for 93 subjects across 8 datasets. |
MOSAIC (all datasets) |
Human |
Images |
||
✅ |
CNN predicting whole-cortex responses (57,051 vertices) for 8 NSD subjects. |
MOSAIC (NSD) |
Human |
Images |
||
GPT2-XL–based linear encoding model (LLM embeddings + ridge regression) predicting brain responses to sentences. |
Human |
Text |
||||
Transformer-based encoding model predicting whole-brain fMRI from video transcripts using LLM features |
Human |
Text |
||||
✅ |
Multimodal fMRI encoding model fusing video, audio, and text features via a two-stage Transformer architecture |
Human |
Video + Audio + Text |
|||
✅ |
Gateway to 10+ GPT-family language models from BrainScore, mapped to human fMRI responses (9 subjects, 384 sentences) via PLS regression. Model weights hosted by BrainScore — not BERG. |
Human |
Text |
|||
✅ |
Tri-modal (video, audio, language) Transformer encoding model predicting whole-brain fMRI (20,484 vertices). First place in Algonauts 2025 (263 teams). |
Human |
Video + Audio + Text |
|||
✅ |
OPT-1.3B–based linear encoding model (contextual LLM embeddings + ridge regression) predicting brain responses to spoken narratives, following Antonello et al., 2023. |
Human |
Text |
modality-eeg
Encoding models trained on neural responses recorded with Electroencephalography (EEG).
Best model |
Model ID |
Description |
Training dataset |
Species |
Stimuli |
Encoding accuracy |
|---|---|---|---|---|---|---|
✅ |
Linear mapping of vision transformer image features onto EEG responses. |
THINGS EEG2 |
Human |
Images |
||
Linear mapping of AlexNet image features onto EEG responses. |
THINGS EEG2 |
Human |
Images |
|||
Linear mapping of an untrained AlexNet image features onto EEG responses. |
THINGS EEG2 |
Human |
Images |
modality-meg
Encoding models trained on neural responses recorded with Magnetoencephalography (MEG).
Best model |
Model ID |
Description |
Training dataset |
Species |
Stimuli |
Encoding accuracy |
|---|---|---|---|---|---|---|
✅ |
Linear mapping of vision transformer image features onto time-resolved whole-brain MEG responses. |
THINGS MEG1 |
Human |
Images |
modality-utah_array
Encoding models trained on neural responses recorded with Utah arrays (intracortical electrophysiology).
Best model |
Model ID |
Description |
Training dataset |
Species |
Stimuli |
Encoding accuracy |
|---|---|---|---|---|---|---|
✅ |
Linear mapping of vision transformer image features onto time-resolved intracortical spiking activity. |
THINGS Ventral Stream Spiking Dataset (TVSD) |
Macaque |
Images |
modality-calcium_2p
Encoding models trained on neural responses recorded with two-photon calcium imaging.
Best model |
Model ID |
Description |
Training dataset |
Species |
Stimuli |
Encoding accuracy |
|---|---|---|---|---|---|---|
✅ |
Foundation model of mouse visual cortex, based on a spatiotemporal convolutional neural network (3D CNN + ConvLSTM). |
Mouse |
Videos |
modality-ephys
Encoding models trained on extracellular electrophysiology recordings from macaque visual cortex.
Best model |
Model ID |
Description |
Training dataset |
Species |
Stimuli |
Encoding accuracy |
|---|---|---|---|---|---|---|
✅ |
Gateway to 440+ vision models from BrainScore, mapped to macaque electrophysiology recordings (V1, V2, V4, IT) via PLS regression. Model weights hosted by BrainScore — not BERG. |
Freeman et al., 2013 (V1, V2); Majaj et al., 2015 (V4, IT) |
Macaque |
Images |
modality-ecog
Encoding models trained on neural responses recorded with electrocorticography (ECoG).
Best model |
Model ID |
Description |
Training dataset |
Species |
Stimuli |
Encoding accuracy |
|---|---|---|---|---|---|---|
✅ |
Linear mapping of GPT-2 XL contextual word embeddings onto time-resolved high-gamma ECoG activity during natural speech comprehension. |
Human |
Text |