Long short-term memory
- PMID:9377276
- DOI: 10.1162/neco.1997.9.8.1735
Long short-term memory
Abstract
Learning to store information over extended time intervals by recurrent backpropagation takes a very long time, mostly because of insufficient, decaying error backflow. We briefly review Hochreiter's (1991) analysis of this problem, then address it by introducing a novel, efficient, gradient-based method called long short-term memory (LSTM). Truncating the gradient where this does not do harm, LSTM can learn to bridge minimal time lags in excess of 1000 discrete-time steps by enforcing constant error flow through constant error carousels within special units. Multiplicative gate units learn to open and close access to the constant error flow. LSTM is local in space and time; its computational complexity per time step and weight is O(1). Our experiments with artificial data involve local, distributed, real-valued, and noisy pattern representations. In comparisons with real-time recurrent learning, back propagation through time, recurrent cascade correlation, Elman nets, and neural sequence chunking, LSTM leads to many more successful runs, and learns much faster. LSTM also solves complex, artificial long-time-lag tasks that have never been solved by previous recurrent network algorithms.
Similar articles
- Biologically plausible gated recurrent neural networks for working memory and learning-to-learn.van den Berg AR, Roelfsema PR, Bohte SM.van den Berg AR, et al.PLoS One. 2024 Dec 31;19(12):e0316453. doi: 10.1371/journal.pone.0316453. eCollection 2024.PLoS One. 2024.PMID:39739908Free PMC article.
- Depressing time: Waiting, melancholia, and the psychoanalytic practice of care.Salisbury L, Baraitser L.Salisbury L, et al.In: Kirtsoglou E, Simpson B, editors. The Time of Anthropology: Studies of Contemporary Chronopolitics. Abingdon: Routledge; 2020. Chapter 5.In: Kirtsoglou E, Simpson B, editors. The Time of Anthropology: Studies of Contemporary Chronopolitics. Abingdon: Routledge; 2020. Chapter 5.PMID:36137063Free Books & Documents.Review.
- Dynamic Field Theory of Executive Function: Identifying Early Neurocognitive Markers.McCraw A, Sullivan J, Lowery K, Eddings R, Heim HR, Buss AT.McCraw A, et al.Monogr Soc Res Child Dev. 2024 Dec;89(3):7-109. doi: 10.1111/mono.12478.Monogr Soc Res Child Dev. 2024.PMID:39628288Free PMC article.
- Epiphora.Patel J, Levin A, Patel BC.Patel J, et al.2023 Aug 7. In: StatPearls [Internet]. Treasure Island (FL): StatPearls Publishing; 2025 Jan–.2023 Aug 7. In: StatPearls [Internet]. Treasure Island (FL): StatPearls Publishing; 2025 Jan–.PMID:32491381Free Books & Documents.
- Exploring conceptual and theoretical frameworks for nurse practitioner education: a scoping review protocol.Wilson R, Godfrey CM, Sears K, Medves J, Ross-White A, Lambert N.Wilson R, et al.JBI Database System Rev Implement Rep. 2015 Oct;13(10):146-55. doi: 10.11124/jbisrir-2015-2150.JBI Database System Rev Implement Rep. 2015.PMID:26571290
Cited by
- Automatic Detection and Extraction of Key Resources from Tables in Biomedical Papers.Ozyurt IB, Bandrowski A.Ozyurt IB, et al.bioRxiv [Preprint]. 2024 Oct 17:2024.10.15.618379. doi: 10.1101/2024.10.15.618379.bioRxiv. 2024.Update in:BioData Min. 2025 Mar 20;18(1):23. doi: 10.1186/s13040-025-00438-9.PMID:39464155Free PMC article.Updated.Preprint.
- A wavelet subband based LSTM model for 12-lead ECG synthesis from reduced lead set.Kapfo A, Datta S, Dandapat S, Bora PK.Kapfo A, et al.Biomed Eng Lett. 2024 Jul 31;14(6):1385-1395. doi: 10.1007/s13534-024-00412-0. eCollection 2024 Nov.Biomed Eng Lett. 2024.PMID:39465099
- TCEDN: A Lightweight Time-Context Enhanced Depression Detection Network.Yan K, Miao S, Jin X, Mu Y, Zheng H, Tian Y, Wang P, Yu Q, Hu D.Yan K, et al.Life (Basel). 2024 Oct 16;14(10):1313. doi: 10.3390/life14101313.Life (Basel). 2024.PMID:39459613Free PMC article.
- The genotype-phenotype landscape of an allosteric protein.Tack DS, Tonner PD, Pressman A, Olson ND, Levy SF, Romantseva EF, Alperovich N, Vasilyeva O, Ross D.Tack DS, et al.Mol Syst Biol. 2021 Mar;17(3):e10179. doi: 10.15252/msb.202010179.Mol Syst Biol. 2021.PMID:33784029Free PMC article.
- Forest Environmental Carrying Capacity Based on Deep Learning.Linshu S, Hao W, Chao Y, Weiming S, Siyi W, Shen W.Linshu S, et al.Comput Intell Neurosci. 2022 Sep 27;2022:7547645. doi: 10.1155/2022/7547645. eCollection 2022.Comput Intell Neurosci. 2022.PMID:36203723Free PMC article.
Publication types
MeSH terms
Related information
LinkOut - more resources
Full Text Sources
Other Literature Sources