Skip to main content
  • Poster presentation
  • Open access
  • Published:

Cortico-striatal plasticity for action-outcome learning using spike timing dependent eligibility

Introduction

We recently proposed that short-latency, sensory-evoked dopamine release is critical for learning action-outcome causality [1]. If an action causes an unexpected outcome associated with a phasic visual event, there will be a phasic burst of dopamine in the striatum. Subsequent reinforcement of the striatal response to the cortical representation of the action then makes the selection of the action (and its outcome) more likely; i.e. there is "repetition biasing" of action selection. This, in turn, facilitates associative learning of the action-outcome pairing elsewhere in the brain. Here, we present a model of cortico-striatal plasticity in medium spiny neurons (MSNs) that could form the basis for a quantitative account of action-outcome learning in basal ganglia.

Methods

We used an Izhikevich-style spiking model MSN with 200 synapses. We constructed new cortico-striatal learning rules based on a recent in vitro study by Shen et al [2]. This study provided, for the first time, comprehensive data on MSN plasticity in the D1 and D2 receptor-dominated MSN subpopulations. For each population (D1/D2) we ascribed STDP-like kernel "templates" to very high and low levels of dopamine in a manner consistent with the data. At intermediate levels of dopamine, the kernels were formed from a linear superposition of these templates. These kernels then gave rise to an eligibility trace for learning that induced plasticity in the presence of subsequent delivery of dopamine [3]. We refer to this mechanism as spike-timing dependent eligibility: STDE. We then mimicked the cortical and dopaminergic signals that an MSN might see during action-outcome learning. Each cortical input comprised 50 highly active afferents with others at a background rate. The selection of the active afferents was fixed for the causal action, and chosen randomly for other actions at each trial (see Figure 1a).

Figure 1
figure 1

a. MSN spike count in response to causal, and other, actions over trials. The phasic DA dip occurs after assigning a noxious value to the outcome of the causal action. b. Synaptic conductances at key trials. Only the first 50 synapses contribute to the action representation.

Results

When phasic dopamine is elicited by the causal action, it induced a rapid increase in MSN response that would be the foundation for inducing repetition bias of action selection. Further, the MSN has become receptive to the action request through synaptic pattern matching (Fig 1b top panel). Subsequent trials induce more selective synaptic patterning (reduced response at trial 1200). Dopamine dips (caused by assigning a noxious value to the outcome) induce a reduction in response. We conclude that the recently discovered complex dopamine-receptor dependent forms of STDP [2] can lead to cortico-striatal plasticity that can support action-outcome learning.

References

  1. Redgrave P, Gurney KN: The short-latency dopamine signal: a role in discovering novel actions?. Nat Rev Neurosci. 2006, 7: 967-975. 10.1038/nrn2022.

    Article  CAS  PubMed  Google Scholar 

  2. Shen W, Flajolet M, Greengard P, Surmeier DJ: Dichotomous dopaminergic control of striatal synaptic plasticity. Science. 2008, 321: 848-851. 10.1126/science.1160575.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  3. Izhikevich EM: Solving the distal reward problem through linkage of STDP and dopamine signaling. Cereb Cortex. 2007, 17: 2443-2452. 10.1093/cercor/bhl152.

    Article  PubMed  Google Scholar 

Download references

Acknowledgements

This work was part funded by EPSRC grant EP/C516303/1

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Kevin N Gurney.

Rights and permissions

Open Access This article is published under license to BioMed Central Ltd. This is an Open Access article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Gurney, K.N., Humphries, M.D. & Redgrave, P. Cortico-striatal plasticity for action-outcome learning using spike timing dependent eligibility. BMC Neurosci 10 (Suppl 1), P135 (2009). https://doi.org/10.1186/1471-2202-10-S1-P135

Download citation

  • Published:

  • DOI: https://doi.org/10.1186/1471-2202-10-S1-P135

Keywords