Predicting parvalbumin expression from neuronal morphology using deep learning
This validator currently predicts parvalbumin (Pvalb) expression in mouse neocortical neurons using deep learning on 14 engineered morphological features. The model was trained on 7,500 neurons from NeuroMorpho.org and validated against the Allen Brain Atlas.
Coming soon: Multi-gene support (GAD1, GFAP, VIP, SST), region-specific models across cortex, hippocampus, and cerebellum, and cross-species validation for human and rat neurons.
NGEP (Neuron Gene Expression Prediction) is a computational framework for predicting cell-type-specific gene expression from neuronal 3D morphology. It uses a feedforward neural network trained on open-access data from NeuroMorpho.org and the Allen Brain Atlas to infer parvalbumin mRNA levels, demonstrating that neuronal shape carries significant information about molecular state.
The model explains approximately 40% of variance in Pvalb expression from morphology alone. Remaining variance is attributable to gene regulatory networks, epigenetic state, developmental history, and local circuit context. The robust performance across 10-fold stratified cross-validation (R² std = 0.027) confirms generalization capability, with the lowest fold still achieving R² = 0.34.
Five base measurements are extracted from each neuron's SWC reconstruction and expanded to 14 engineered features:
Support for additional genes beyond Pvalb including GAD1, GFAP, VIP, SST, and other markers for comprehensive cellular characterization and disease-relevant gene expression mapping.
Region-specific models for cortex, hippocampus, cerebellum, and other brain structures to improve prediction accuracy and enable circuit-level analysis across the whole brain.
Cross-species validation for mouse, human, rat, and other organisms with species-specific prediction models to enable translational research and patient-derived iPSC analysis.
Tools for identifying circuit dysfunction from morphological imaging to enable targeted therapy development and precision medicine applications in neuropsychiatric disorders.
Morphological features as accessible, imaging-based biomarkers for drug screening, therapy validation, and organoid quality control without requiring genomic assays.
Feature importance analysis to understand which morphological attributes drive expression, linking neuronal structure to molecular function at scale.