全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...
PLOS ONE  2008 

The Influence of Markov Decision Process Structure on the Possible Strategic Use of Working Memory and Episodic Memory

DOI: 10.1371/journal.pone.0002756

Full-Text   Cite this paper   Add to My Lib

Abstract:

Researchers use a variety of behavioral tasks to analyze the effect of biological manipulations on memory function. This research will benefit from a systematic mathematical method for analyzing memory demands in behavioral tasks. In the framework of reinforcement learning theory, these tasks can be mathematically described as partially-observable Markov decision processes. While a wealth of evidence collected over the past 15 years relates the basal ganglia to the reinforcement learning framework, only recently has much attention been paid to including psychological concepts such as working memory or episodic memory in these models. This paper presents an analysis that provides a quantitative description of memory states sufficient for correct choices at specific decision points. Using information from the mathematical structure of the task descriptions, we derive measures that indicate whether working memory (for one or more cues) or episodic memory can provide strategically useful information to an agent. In particular, the analysis determines which observed states must be maintained in or retrieved from memory to perform these specific tasks. We demonstrate the analysis on three simplified tasks as well as eight more complex memory tasks drawn from the animal and human literature (two alternation tasks, two sequence disambiguation tasks, two non-matching tasks, the 2-back task, and the 1-2-AX task). The results of these analyses agree with results from quantitative simulations of the task reported in previous publications and provide simple indications of the memory demands of the tasks which can require far less computation than a full simulation of the task. This may provide a basis for a quantitative behavioral stoichiometry of memory tasks.

References

[1]  Eichenbaum H, Cohen NJ (2001) From Conditioning to Conscious Recollection. New York: Oxford University Press.
[2]  Squire LR (2004) Memory systems of the brain: a brief history and current perspective. Neurobiol Learn Mem 82(3): 171–7.
[3]  Schacter DL, Tulving E (1994) Memory systems. Cambridge, MA: The MIT Press.
[4]  Zilli EA, Hasselmo ME (2008) Modeling the role of working memory and episodic memory in behavioral tasks. Hippocampus 18(2): 193–209.
[5]  Sutton RS, Barto AG (1998) Reinforcement Learning: An Introduction. Cambridge: MIT Press.
[6]  Monahan GE (1982) A survey of Partially Observable Markov Decision Processes. Mgmt Sci 28: 16.
[7]  Kaelbling LP, Littman ML, Cassandra AR (1998) Planning and acting in partially observable stochastic domains. Artificial Intelligence 101: 99–134.
[8]  O'Reilly RC, Frank MJ (2006) Making working memory work: A computational model of learning in the prefrontal cortex and basal ganglia. Neural Comput 18: 283–328.
[9]  Dayan P (2007) Bilinearity, rules, and prefrontal cortex. Frontiers in Computational Neuroscience 1: 1.
[10]  Moustafa AA, Maida AS (2007) Using TD learning to simulate working memory performance in a model of the prefrontal cortex and basal ganglia. Cognitive Systems Research 8: 262–281.
[11]  Fuster JM (1995) Memory in the cerebral cortex. Cambridge, MA: MIT Press.
[12]  Lisman JE, Fellous JM, Wang XJ (1998) A role for NMDA-receptor channels in working memory. Nat Neurosci 1: 273–275.
[13]  Miller EK, Erickson CA, Desimone R (1996) Neural mechanisms of visual working memory in prefrontal cortex of the macaque. J Neurosci 16: 5154–5167.
[14]  Zipser D, Kehoe B, Littlewort G, Fuster J (1993) A spiking network model of short-term active memory. J Neurosci 13(8): 3406–20.
[15]  Hasselmo ME, Eichenbaum H (2005) Hippocampal mechanisms for the context-dependent retrieval of episodes. Neural Netw 18: 1172–1190.
[16]  Tulving E (2002) Episodic memory: From mind to brain. Annu Rev Psychol 53: 1–25.
[17]  Kemeny JG, Snell JL (1976) Finite Markov chains. New York: Springer-Verlag.
[18]  Wood ER, Dudchenko PA, Robitsek RJ, Eichenbaum H (2000) Hippocampal neurons encode information about different types of memory episodes occurring in the same location. Neuron 27: 623–633.
[19]  Lee I, Griffin AL, Zilli EA, Eichenbaum H, Hasselmo ME (2006) Gradual translocation of spatial correlates of neural firing in the hippocampus toward prospective reward locations. Neuron 51: 639–650.
[20]  Ainge JA, van der Meer MA, Langston RF, Wood ER (2007) Exploring the role of context-dependent hippocampal activity in spatial alternation behavior. Hippocampus 17: 988–1002.
[21]  Cohen JD, Perlstein WM, Braver TS, Nystrom LE, Noll DC, Jonides J, Smith EE (1997) Temporal dynamics of brain activation during a working memory task. Nature 386(6625): 604–8.
[22]  Nystrom LE, Braver TS, Sabb FW, Delgado MR, Noll DC, et al. (2000) Working memory for letters, shapes, and locations: fMRI evidence against stimulus-based regional organization in human prefrontal cortex. Neuroimage 11(5 Pt 1): 424–46.
[23]  Stern CE, Sherman SJ, Kirchhoff BA, Hasselmo ME (2001) Medial temporal and prefrontal contributions to working memory tasks with novel and familiar stimuli. Hippocampus 11(4): 337–46.
[24]  Nestor PG, Faux SF, McCarley RW, Shenton ME, Sands SF (1990) Measurement of visual sustained attention in schizophrenia using signal detection analysis and a newly developed computerized CPT task. Schizophr Res 3(5–6): 329–332.
[25]  Baddeley AD (1986) Working Memory. Oxford: Clarendon Press.
[26]  Goldman-Rakic PS (1995) Cellular basis of working memory. Neuron 14: 477–485.
[27]  Griffin AL, Eichenbaum H, Hasselmo ME (2007) Spatial representations of hippocampal CA1 neurons are modulated by behavioral context in a hippocampus-dependent memory task. J Neurosci 27: 2416–2423.
[28]  Hampson RE, Deadwyler SA (1996) Ensemble codes involving hippocampal neurons are at risk during delayed performance tests. Proc Natl Acad Sci U S A 93: 13487–13493.
[29]  Agster KL, Fortin NJ, Eichenbaum H (2002) The hippocampus and disambiguation of overlapping sequences. J Neurosci 22: 5760–5768.
[30]  Baddeley AD, Hitch G (1974) Working memory. In: Bower GH, editor. The Psychology of Learning and Motivation: Advances in Research and Theory. New York: Academic Press. pp. 47–89.
[31]  Khatri CG, Rao CR (1968) Solutions to some functional equations and their applications to characterization of probability distributions. Sankhya 30: 167–180.
[32]  Liu S (1999) Matrix results on the Khatri-Rao and Tracy-Singh products. Linear Algebra and its Applications 289: 267–277.
[33]  Howard MW, Fotedar MS, Datey AV, Hasselmo ME (2005) The temporal context model in spatial navigation and relational learning: toward a common explanation of medial temporal lobe function across domains. Psychol Rev 112: 75–116.
[34]  Koene RA, Hasselmo ME (2006) First-in-first-out item replacement in a model of short-term memory based on persistent spiking. Cereb Cortex 17(8): 1766–1781.
[35]  Manns JR, Howard MW, Eichenbaum H (2007) Gradual changes in hippocampal activity support remembering the order of events. Neuron 56: 530–540.
[36]  Fortin NJ, Agster KL, Eichenbaum HB (2002) Critical role of the hippocampus in memory for sequences of events. Nat Neurosci 5: 458–462.
[37]  Fuhs MC, Touretzky DS (2007) Context learning in the rodent hippocampus. Neural Computation 19: 3173–3215.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133