- Barto, AG, Sutton, RS and Anderson, (_;W. 1EEE 7ransacho~,s on Systems, MaTl, and Cyber~et~cs, 13, 834 (1983).Google Scholar
- Barto, AG, Sutton, RS, Watkins, CJCH, Techmcal Report 89-95, (Computer and Information Science, University of Massachusetts, Amherst, MA, 1989Google Scholar
- Bellman, R (1957) D.q~ctmtc Programming. Princeton: Princeton U. Press.Google Scholar
- Dayan, P (1994). Computational modelling. Current Olm~to~ z~t Neurobtology, 4, 212-217.Google Scholar
- Dayan, P. and Sejnowski, T. J. TD ()~) converges with probability 1, Machine Learning 14, 295-301 (1994). Google ScholarDigital Library
- Dickenson, A (1980). CoMemporary Animal Learn,ng Theory, Cambridge, England: Cambridge University Press.Google Scholar
- Gallistel, CR (1990) The organzzatto~, of leam~ng. Cambridge, Mass: MIT Press.Google Scholar
- Gluck, MA, Thompson, RF (1987) Modeling the neural substrates of associative learning and memory: a computational approach. Psychological Rev. 94, 176-191.Google ScholarCross Ref
- Greenough, WT, and Bailey, CH (1988). The anatomy of a memory: Convergence of results across a diversity of tests. Trends zn Neurosczence, 11, 142-147.Google Scholar
- Hammer, M (1994) An identified neuron mediates the unconditioned stimulus in associative olfactory learning in honeybees. Nature 366:59-63.Google ScholarCross Ref
- Hawkins RD, Kandel ER (1984) Is there a cell-biological alphabet for simple forms of learning? Psychological Rev. 91(3):375-91.Google ScholarCross Ref
- Hebb, DO (1949) The orgamzat,on of behavzor. New York: Wiley.Google Scholar
- Kalman, RE (1960) A new approach to linear filtering and prediction problems. J. Baszc Eng., Trans ASslIE, Series D 82(1):35-45.Google ScholarCross Ref
- Ljungberg, T, Apicella, P & Schultz, W (1992). Responses of monkey dopamine neurons during learning of behavioral reactions. Journal of Neurophyszology, 67(1), 145-163.Google ScholarCross Ref
- MacKintosh, NJ (1983) ConditioniT~g aT~d Assoc~ahve Learning. Oxford University Press: Oxford, UI(.Google Scholar
- Montague, P. R., Dayan, P. and Sejnowski, T. 3., A framework for mesolimbic dopamine systems based on predictive Hebbian learning, Journal of NeuroscleT~ce (submitted for publication).Google Scholar
- Quartz, S, Dayan, P, Montague, PR, Sejnowski, TJ (1992) Expectation learning in the brain using diffuse ascending connections. Soc. Neurosc,. Abslr. 18:1210Google Scholar
- Pearce JM, Hall G (1980) A model for Pavlovian learning: variations in the effectiveness of conditioned but. not of unconditioned stimuli. Psychologtcal Revtew 87: 532-52.Google ScholarCross Ref
- Rauschecker JP (1991) Mechanisms of visual plasticity: Hebb synapses, NMDA receptors, and beyond. Phystologzcal Reviews 71(2):587-615.Google Scholar
- Rescorla, RA & Wagner, AR (1972). A theory of Pavlovian conditioning: The effectiveness of reinforcement and non-reinforcement. In AH Black & WF Prokasy, editors, Classical Condit~o~zng II: Current Research aT~d Theory, pp 64-69. New York, NY: Appleton-Century- Crofts.Google Scholar
- Schultz, W, Apicella, P, Ljungberg, T (1993) Responses of monkey dopamine neurons to reward and conditioned stimuli during successive steps of learning a delayed response task. J. Neuroscience 13(3):900-13.Google ScholarCross Ref
- Sutton, RS, Barto, AG (1981). Toward a modern theory of adaptive networks: Expectation and prediction. Psychological Revzew, 88 2, pp 135-170.Google ScholarCross Ref
- Sutton, RS (1988). Learning to predict by the methods of temporal difference. Machzne Learning, 3, pp 9-44. Google ScholarDigital Library
- Sutton, RS, Barto, AG (1987). A temporM-difference model of classical conditioning. Proceedings of the N, nth Annual CoT~ference of the Cognztwe Science Society. Seattle, WA.Google Scholar
- Sutton, RS, Barto, AG (1989). Time-derivative models of Pavlovian reinforcement. In M Gabriel & J Moore, editors, LearT~zT~g and Computahonal Neurosc~ence. Cambridge, MA' MIT Press.Google Scholar
- Widrow. B, Stearns, SD (1985) Adaptive signal process- ~ng. Englewood Cliffs, N J: Prentice-Hall. Google ScholarDigital Library
Index Terms
- Predictive Hebbian learning
Recommendations
Inheritance of hippocampal place fields through hebbian learning: Effects of theta modulation and phase precession on structure formation
A place cell is a neuron that fires whenever the animal traverses a particular location of the environment-the place field of the cell. Place cells are found in two regions of the rodent hippocampus: CA3 and CA1. Motivated by the anatomical connectivity ...
A columnar model of somatosensory reorganizational plasticity based on Hebbian and non-Hebbian learning rules
Topographical and functional aspects of neuronal plasticity were studied in the primary somatosensory cortex of adult rats in acute electrophysiological experiments. Under these experimental conditions, we observed short-term reversible reorganization ...
Spike-Timing-Dependent Hebbian Plasticity as Temporal Difference Learning
A spike-timing-dependent Hebbian mechanism governs the plasticity of recurrent excitatory synapses in the neocortex: synapses that are activated a few milliseconds before a postsynaptic spike are potentiated, while those that are activated a few ...
Comments