meg-things_meg_1-vit_b_32

Model Summary

Modality	MEG
Training Dataset	THINGS MEG1
Species	Human
Stimuli	Images
Model Type	Vision transformer (ViT-B/32)
Creator	Domenic Bersch

Description

This encoding models consist of a linear mapping through linear regression of a vision transformer (Dosovitskiy et al., 2020) image features onto whole-brain magnetoencephalography (MEG) responses from the THINGS MEG1 dataset (Hebart et al., eLife 2023). The model provides features from all 12 transformer layers, using the full set of patch tokens per layer to represent each stimulus image. For each image stimulus, features are concatenated across all spatial tokens and reduced to 250 principal components via principal-component analysis (PCA). These reduced features serve as predictors for MEG responses. The encoding models are trained on either the full training data, or on four independent training data random splits.

Neural data. Encoding models were trained on the preprocessed data preparation provided in THINGS MEG1. MEG data were recorded from four human participants (P1–P4) viewing 1,854 object categories from the THINGS database (~22,000 naturalistic object images). Recordings were acquired with 271 sensors at 200 Hz, epoched from −100 ms to +1300 ms relative to stimulus onset. The preprocessing pipeline included band-pass filtering (0.1–40 Hz), epoching, baseline correction (−100 to 0 ms), and exclusion of malfunctioning sensors.

Model training partition. Single-trial responses to approximately 22,000 unique naturalistic images were used for training. One set of encoding models are trained on the full training data. Another set of encoding models are trained on four independent training data random splits (of 5,562 trials each), therefore generating four different in silico MEG response predictions (i.e., repetitions) per image. A unique PCA random seed is derived for each combination of subject and training split, ensuring independent PCA bases across encoding models.

Model testing partition. 200 test images, each repeated 12 times, were used for evaluation; the target responses correspond to the average MEG activity across repetitions.

Training procedure. Independent linear regression models were fitted separately for each MEG sensor and time point, predicting the sensor’s time-resolved response from the feature vectors. The resulting model weights provide a spatiotemporal mapping from visual features to MEG sensor activity.

Noise ceiling. The noise ceiling was computed from the 12 repeated presentations of each of the 200 test image, following the analytical procedure described in the Natural Scenes Dataset (NSD) paper (Allen et al., 2022).

Output. Each trained model predicts time-resolved MEG responses for all 271 sensors (or user-specified subsets) across 281 time points (−100 to +1300 ms) for each input image.

Metadata

meg

times : (281,) - Time points (e.g., -0.1 to 1.3s relative to stimulus onset)

subject_id : str - Subject identifier

sensors

sensor_names : (271,) - MEG sensor name strings

sensor_prefixes : (271,) - Sensor prefixes (e.g., ‘MLF’, ‘MRC’, ‘MZO’)

sensor_hemispheres : (271,) - Hemisphere labels (‘Left’, ‘Right’, ‘Midline’)

sensor_regions : (271,) - Region labels (‘Frontal’, ‘Central’, ‘Parietal’, ‘Temporal’, ‘Occipital’)

n_sensors : int - Number of MEG sensors (271)

encoding_model

all_training_splits: Training data and encoding accuracy results for encoding models trained on all training splits

train_img_ids : (22248,) - THINGS image IDs for train trials

train_concepts : (22248,) - Object category IDs for train trials

train_stimuli : (22248,) - Image filenames for train trials

train_sessions : (22248,) - Session numbers for train trials

train_runs : (22248,) - Run numbers for train trials

train_img_files : (22248,) - Full image paths for train trials

correlation_results : (271, 281) - Prediction accuracy (Pearson’s r)

percent_noise_ceiling : (271, 281) - Noise ceiling normalized prediction accuracy (% of noise ceiling)

single_training split_{N}: Training data and encoding accuracy results for encoding models trained on training split N (N=1,2,3,4)

train_img_ids : (5562,) - THINGS image IDs for train trials

train_concepts : (5562,) - Object category IDs for train trials

train_stimuli : (5562,) - Image filenames for train trials

train_sessions : (5562,) - Session numbers for train trials

train_runs : (5562,) - Run numbers for train trials

train_img_files : (5562,) - Full image paths for train trials

correlation_results : (271, 281) - Prediction accuracy (Pearson’s r)

percent_noise_ceiling : (271, 281) - Noise ceiling normalized prediction accuracy (% of noise ceiling)

test_img_ids : (2400,) - THINGS image IDs for test trials

test_stimuli : (2400,) - Image filenames for test trials

test_concepts : (2400,) - Object category IDs for test trials

test_image_nr : (2400,) - Test image numbers (1–200, repeated over repetitions)

test_sessions : (2400,) - Session numbers for test trials

test_runs : (2400,) - Run numbers for test trials

test_img_files : (2400,) - Full image paths for test images

ncsnr : (271, 281) - Noise ceiling signal-to-noise ratio

noise_ceiling : (271, 281) - Noise ceiling

Input

Type	`numpy.ndarray`
Shape	`['batch_size', 3, 'height', 'width']`
Description	The input should be a batch of RGB images.
Constraints	Image values should be integers in range [0, 255]. Image dimensions (height, width) should be equal (square). Minimum recommended image size: 224×224 pixels.

Output

Type	`numpy.ndarray`
Shape	`[batch_size, n_sensors, n_timepoints] or [batch_size, repeats, n_sensors, n_timepoints]`
Description	The output is a 3D or 4D array containing in silico MEG responses.
Dimensions	batch_size: Number of stimuli in the batch. repeats: Number of simulated repetitions of the same stimulus (always 4; only applies when using the encoding models trained on single training data splits). n_sensors: Number of MEG sensors (up to 271, based on the number of sensors selected). n_timepoints: Number of time points in the MEG epoch (up to 281, based on the number of time points selected).

Parameters

Parameters used in `get_encoding_model`

This function loads the encoding model.

model_id	Type: str Required: Yes Description: Unique identifier of the model to load. Valid Values: meg-things_meg_1-vit_b_32 Example: “meg-things_meg_1-vit_b_32”
subject	Type: int Required: Yes Description: Subject ID from the THINGS MEG1 dataset. Valid Values: 1, 2, 3, 4 Example: 1
train_splits	Type: str Required: No Description: Specifies the training data split on which the encoding model is trained. - “all”: Use an encoding model trained on all traning data splits. - “single”: Use encoding models trained on four independent training data random splits, therefore generating four different in silico MEG response predictions (i.e., repetitions) per image. Valid Values: “all”, “single” Example: “single”
selection	Type: dict Required: No Description: Specifies which outputs to include in the model responses. Can include specific senors and/or timepoints. If not provided, MEG responses are generated for all MEG sensors and time points. Properties: region Type: list[str] Description: List of anatomical sensor-region labels to include. Each region groups multiple MEG sensors located over a specific cortical area: • Central – midline motor/somatosensory sensors • Frontal – sensors over frontal cortex • Occipital – sensors over visual cortex • Parietal – sensors over parietal cortex • Temporal – sensors over temporal lobes If multiple regions are listed, their sensors are concatenated. Valid values: “Central”, “Frontal”, “Occipital”, “Parietal”, “Temporal” Example: [‘Central’, ‘Frontal’, ‘Occipital’] sensors Type: list[str] Description: List of MEG sensor prefix codes to include. Each code identifies a cluster of planar gradiometers or magnetometers based on hemisphere (L = left, R = right, Z = midline) and cortical region: • MLC / MRC – Central (motor/somatosensory) • MLF / MRF – Frontal • MLO / MRO – Occipital (visual) • MLP / MRP – Parietal • MLT / MRT – Temporal • MZC / MZF / MZO / MZP – Midline (central, frontal, occipital, parietal) Sensors sharing a prefix (e.g., “MLF”) are typically grouped together for regional analyses or dimensionality reduction. Valid values: “MLC”, “MLF”, “MLO”, “MLP”, “MLT”, “MRC”, “MRF”, “MRO”, “MRP”, “MRT”, “MZC”, “MZF”, “MZO”, “MZP” Example: [‘MLC’, ‘MLF’, ‘MLO’] sensor_index Type: numpy.ndarray Description: Binary one-hot encoded vector indicating which sensors to include. Must have exactly the same length as the number of available sensors (271). Each position set to 1 indicates that sensor should be included. Example: [0, 0, ‘…’, 1, 1, 0] timepoints Type: numpy.ndarray Description: Binary one-hot encoded vector indicating which timepoints to include. Length must equal the number of time samples (281). Each 1 indicates a selected timepoint. Example: [0, 0, ‘…’, 1, 1, 0]
device	Type: str Required: No Description: Device to run the model on. ‘auto’ will use CUDA if available, otherwise CPU. Valid Values: “cpu”, “cuda”, “auto” Example: “auto”

Parameters used in `encode`

This function generates in silico neural responses using the encoding model previously loaded.

model	Type: BaseModelInterface Required: Yes Description: An instantiated and loaded encoding model.
stimulus	Type: numpy.ndarray Required: Yes Description: A batch of RGB images to be encoded. Images should be in integer format with values in the range [0, 255], and square dimensions (e.g. 224×224). Example: “An array of shape [100, 3, 224, 224] representing 100 RGB images.”
return_metadata	Type: bool Required: No Description: Whether to return the encoding model’s metadata together with the in silico neural resposnes. Example: True
show_progress	Type: bool Required: No Description: Whether to show a progress bar during encoding (for large batches). Example: True

Parameters used in `get_model_metadata`

This function loads the encoding model’s metadata without having to load the model itself.

model_id	Type: str Required: Yes Description: Unique identifier of the model to load. Valid Values: meg-things_meg_1-vit_b_32 Example: “meg-things_meg_1-vit_b_32”
subject	Type: int Required: Yes Description: Subject ID from the THINGS MEG1 dataset. Valid Values: 1, 2, 3, 4 Example: 1

Performance

Accuracy Plots (AWS directory):

brain-encoding-response-generator/encoding_models/modality-meg/train_dataset-things_meg_1/model-vit_b_32/encoding_models_accuracy

Example Usage

from berg import BERG

# Initialize BERG
berg = BERG(berg_dir="path/to/brain-encoding-response-generator")

# Load the model
model = berg.get_encoding_model(
    "meg-things_meg_1-vit_b_32",
    subject=1,
    train_splits="single",
    selection={
        "region": ["Central", "Frontal", "Occipital"],
        "sensors": ["MLC", "MLF", "MLO"],
        "sensor_index": [0, 0, '...', 1, 1, 0],
        "timepoints": [0, 0, '...', 1, 1, 0]
    }
)

# Prepare the stimulus images
# Image shape should be [batch_size, 3 RGB channels, height, width]
stimulus = np.random.randint(0, 255, (100, 3, 256, 256))

# Generates the in silico neural responses using the encoding model previously loaded
responses = berg.encode(
    model,
    stimulus,
    show_progress=True
)

# The in silico fMRI responses will be a numpy.ndarray of shape:
# [batch_size, n_sensors, n_timepoints] or [batch_size, repeats, n_sensors, n_timepoints]
# where:
# - repeats: Number of simulated repetitions of the same stimulus (always 4; only applies when using the encoding models trained on single training data splits).
# - n_sensors: Number of MEG sensors (up to 271, based on the number of sensors selected).
# - n_timepoints: Number of time points in the MEG epoch (up to 281, based on the number of time points selected).

# Generate in silico neural responses with metadata
responses, metadata = berg.encode(
    model,
    stimulus,
    return_metadata=True
)

# Load the encoding model's metadata without having to load the model itself
metadata = berg.get_model_metadata(
    "meg-things_meg_1-vit_b_32",
    subject=1
)

References

Model building code: https://github.com/gifale95/BERG/tree/main/berg_creation_code/02_train_encoding_models/train_dataset-things_meg_1/model-vit_b_32
THINGS MEG & fMRI Paper (Hebart et al., 2023): https://doi.org/10.7554/eLife.82580
THINGS MEG & fMRI Data (Hebart et al., 2023): https://plus.figshare.com/collections/_/6161151
THINGS initiative (Hebart et al., 2019): https://things-initiative.org/
ViT-B/32 (Dosovitskiy et al., 2020): https://arxiv.org/abs/2010.11929