.1997 Nov 15;9(8):1735-80.

doi: 10.1162/neco.1997.9.8.1735.

Long short-term memory

S Hochreiter¹, J Schmidhuber

Affiliations

PMID:9377276
DOI: 10.1162/neco.1997.9.8.1735

Long short-term memory

S Hochreiter et al. Neural Comput.1997.

.1997 Nov 15;9(8):1735-80.

doi: 10.1162/neco.1997.9.8.1735.

Authors

S Hochreiter¹, J Schmidhuber

Affiliation

¹ Fakultät für Informatik, Technische Universität München, Germany.

PMID:9377276
DOI: 10.1162/neco.1997.9.8.1735

Abstract

Learning to store information over extended time intervals by recurrent backpropagation takes a very long time, mostly because of insufficient, decaying error backflow. We briefly review Hochreiter's (1991) analysis of this problem, then address it by introducing a novel, efficient, gradient-based method called long short-term memory (LSTM). Truncating the gradient where this does not do harm, LSTM can learn to bridge minimal time lags in excess of 1000 discrete-time steps by enforcing constant error flow through constant error carousels within special units. Multiplicative gate units learn to open and close access to the constant error flow. LSTM is local in space and time; its computational complexity per time step and weight is O(1). Our experiments with artificial data involve local, distributed, real-valued, and noisy pattern representations. In comparisons with real-time recurrent learning, back propagation through time, recurrent cascade correlation, Elman nets, and neural sequence chunking, LSTM leads to many more successful runs, and learns much faster. LSTM also solves complex, artificial long-time-lag tasks that have never been solved by previous recurrent network algorithms.

PubMed Disclaimer

Cited by

Automatic Detection and Extraction of Key Resources from Tables in Biomedical Papers.
Ozyurt IB, Bandrowski A.Ozyurt IB, et al.bioRxiv [Preprint]. 2024 Oct 17:2024.10.15.618379. doi: 10.1101/2024.10.15.618379.bioRxiv. 2024.Update in:BioData Min. 2025 Mar 20;18(1):23. doi: 10.1186/s13040-025-00438-9.PMID:39464155Free PMC article.Updated.Preprint.
A wavelet subband based LSTM model for 12-lead ECG synthesis from reduced lead set.
Kapfo A, Datta S, Dandapat S, Bora PK.Kapfo A, et al.Biomed Eng Lett. 2024 Jul 31;14(6):1385-1395. doi: 10.1007/s13534-024-00412-0. eCollection 2024 Nov.Biomed Eng Lett. 2024.PMID:39465099
TCEDN: A Lightweight Time-Context Enhanced Depression Detection Network.
Yan K, Miao S, Jin X, Mu Y, Zheng H, Tian Y, Wang P, Yu Q, Hu D.Yan K, et al.Life (Basel). 2024 Oct 16;14(10):1313. doi: 10.3390/life14101313.Life (Basel). 2024.PMID:39459613Free PMC article.
The genotype-phenotype landscape of an allosteric protein.
Tack DS, Tonner PD, Pressman A, Olson ND, Levy SF, Romantseva EF, Alperovich N, Vasilyeva O, Ross D.Tack DS, et al.Mol Syst Biol. 2021 Mar;17(3):e10179. doi: 10.15252/msb.202010179.Mol Syst Biol. 2021.PMID:33784029Free PMC article.
Forest Environmental Carrying Capacity Based on Deep Learning.
Linshu S, Hao W, Chao Y, Weiming S, Siyi W, Shen W.Linshu S, et al.Comput Intell Neurosci. 2022 Sep 27;2022:7547645. doi: 10.1155/2022/7547645. eCollection 2022.Comput Intell Neurosci. 2022.PMID:36203723Free PMC article.

See all "Cited by" articles

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions

Related information

Cited in Books

LinkOut - more resources

Full Text Sources
- Silverchair Information Systems
Other Literature Sources
- The Lens - Patent Citations Database

Movatterモバイル変換

Account

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Full text links

Actions

Share

Long short-term memory

Affiliation

Long short-term memory

Authors

Affiliation

Abstract

Similar articles

Cited by

Publication types

MeSH terms

Related information

LinkOut - more resources

Full Text Sources

Other Literature Sources