Unsupervised pretraining in biological neural networks

Vision

Thousands of neurons

Behavior

Unsupervised learning - exposure to stimuli without rewards - drives large changes in neural activity in visual cortex, particularly in higher order medial visual areas.

Author

Lin Zhong, Scott Baptista, Rachel Gattoni, Jon Arnold, Daniel Flickinger, Carsen Stringer^†, Marius Pachitariu^†

Published

June 1, 2025

Abstract

Representation learning in neural networks may be implemented with supervised or unsupervised algorithms, distinguished by the availability of instruction. In the sensory cortex, perceptual learning drives neural plasticity, but it is not known whether this is due to supervised or unsupervised learning. Here we recorded populations of up to 90,000 neurons simultaneously from the primary visual cortex (V1) and higher visual areas (HVAs) while mice learned multiple tasks, as well as during unrewarded exposure to the same stimuli. Similar to previous studies, we found that neural changes in task mice were correlated with their behavioural learning. However, the neural changes were mostly replicated in mice with unrewarded exposure, suggesting that the changes were in fact due to unsupervised learning. The neural plasticity was highest in the medial HVAs and obeyed visual, rather than spatial, learning rules. In task mice only, we found a ramping reward-prediction signal in anterior HVAs, potentially involved in supervised learning. Our neural results predict that unsupervised learning may accelerate subsequent task learning, a prediction that we validated with behavioural experiments.

Thread by Lin Zhong:

Simple question: How do we learn? Answer: From teachers (supervised).

Sure! But we also learn a lot on our own (unsupervised), and so do our mice.

We developed a virtual reality (VR) task in which mice discriminated textures in order to get reward (supervised cohort) OR they just ran for FUN (unsupervised cohort).
Mice learned to discriminate textures by licking in the corridor with reward.
We recorded up to 90,000 neurons from the visual cortex during learning to try to understand the neural mechanism. Neural activities are visualized using our sorting algorithm Rastermap with behavioral annotations.
We found that the plasticity in medial visual areas was mediated by unsupervised learning.
Mice correctly generalized the reward rule to new stimuli based on the visual similarities, behaviorally and neurally.
Mice learned to discriminate two very similar textures (leaf1 vs leaf2) by orthogonalizing them in the neural space
Learning that only leaf1 was rewarded results in de-orthogonalization of another new leaf (leaf3).
Question: Wait! We don’t need supervised learning at all? I will say no to my supervisor if that is true 🙃.
Our results suggest I should think twice before doing that: inside the anterior visual areas we found a representation only in the supervised learning task, which can predict the reward and was highly correlated with behavior.
Question: What does unsupervised learning do? One possible answer is to pre-train our neural network for subsequent tasks. Indeed, we show that mice learned much faster after experiencing unsupervised pretraining!
What is more, we found that V1 and lateral visual areas can encode novelty when seeing a new stimulus after learning. The novelty responses went away after mice got familiar with the new stimulus.
Our results show:

Most learning is through unsupervised learning, mediated by medial visual areas
Supervised learning may require anterior visual areas
A third stream (V1 + lateral) encodes novelty in both supervised and unsupervised learning