Movatterモバイル変換

Neural Turing machine

From Wikipedia, the free encyclopedia

Type of recurrent neural network

Turing machines
Machine
Turing machine equivalents Turing machine examples
Variants
Alternating Turing machine Neural Turing machine Nondeterministic Turing machine Quantum Turing machine Post–Turing machine Probabilistic Turing machine Multitape Turing machine Multi-track Turing machine Symmetric Turing machine Total Turing machine Unambiguous Turing machine Universal Turing machine Zeno machine
Science
Alan Turing Category:Turing machine
v t e

Aneural Turing machine (NTM) is arecurrent neural network model of aTuring machine. The approach was published byAlex Graves et al. in 2014.^[1] NTMs combine the fuzzypattern matching capabilities ofneural networks with thealgorithmic power ofprogrammable computers.

An NTM has a neural network controller coupled toexternal memory resources, which it interacts with through attentional mechanisms. The memory interactions are differentiable end-to-end, making it possible to optimize them usinggradient descent.^[2] An NTM with along short-term memory (LSTM) network controller can infer simple algorithms such as copying, sorting, and associative recall from examples alone.^[1]

The authors of the original NTM paper did not publish theirsource code.^[1] The first stable open-source implementation was published in 2018 at the 27th International Conference on Artificial Neural Networks, receiving a best-paper award.^[3]^[4]^[5] Other open source implementations of NTMs exist but as of 2018 they are not sufficiently stable for production use.^[6]^[7]^[8]^[9]^[10]^[11]^[12] The developers either report that thegradients of their implementation sometimes becomeNaN during training for unknown reasons and cause training to fail;^[10]^[11]^[9] report slow convergence;^[7]^[6] or do not report the speed of learning of their implementation.^[12]^[8]

Differentiable neural computers are an outgrowth of Neural Turing machines, withattention mechanisms that control where the memory is active, and improve performance.^[13]

References

[edit]

^^a ^b ^cGraves, Alex; Wayne, Greg; Danihelka, Ivo (2014). "Neural Turing Machines".arXiv:1410.5401 [cs.NE].
^"Deep Minds: An Interview with Google's Alex Graves & Koray Kavukcuoglu". RetrievedMay 17, 2016.
^Collier, Mark; Beel, Joeran (2018), "Implementing Neural Turing Machines",Artificial Neural Networks and Machine Learning – ICANN 2018, Springer International Publishing, pp. 94–104,arXiv:1807.08518,Bibcode:2018arXiv180708518C,doi:10.1007/978-3-030-01424-7_10,ISBN 9783030014230,S2CID 49908746
^"MarkPKCollier/NeuralTuringMachine".GitHub. Retrieved2018-10-20.
^Beel, Joeran (2018-10-20)."Best-Paper Award for our Publication "Implementing Neural Turing Machines" at the 27th International Conference on Artificial Neural Networks | Prof. Joeran Beel (TCD Dublin)".Trinity College Dublin, School of Computer Science and Statistics Blog. Retrieved2018-10-20.
^^a ^b"snowkylin/ntm".GitHub. Retrieved2018-10-20.
^^a ^b"chiggum/Neural-Turing-Machines".GitHub. Retrieved2018-10-20.
^^a ^b"yeoedward/Neural-Turing-Machine".GitHub. 2017-09-13. Retrieved2018-10-20.
^^a ^b"camigord/Neural-Turing-Machine".GitHub. Retrieved2018-10-20.
^^a ^b"carpedm20/NTM-tensorflow".GitHub. Retrieved2018-10-20.
^^a ^b"snipsco/ntm-lasagne".GitHub. Retrieved2018-10-20.
^^a ^b"loudinthecloud/pytorch-ntm".GitHub. Retrieved2018-10-20.
^Administrator."DeepMind's Differentiable Neural Network Thinks Deeply".www.i-programmer.info. Retrieved2016-10-20.

Artificial intelligence (AI)

Concepts

Applications

Implementations

Audio–visual	AlexNet WaveNet Human image synthesis HWR OCR Computer vision Speech synthesis 15.ai ElevenLabs Speech recognition Whisper Facial recognition AlphaFold Text-to-image models Aurora DALL-E Firefly Flux Ideogram Imagen Midjourney Recraft Stable Diffusion Text-to-video models Dream Machine Runway Gen Hailuo AI Kling Sora Veo Music generation Riffusion Suno AI Udio
Text	Word2vec Seq2seq GloVe BERT T5 Llama Chinchilla AI PaLM GPT 1 2 3 J ChatGPT 4 4o o1 o3 4.5 4.1 o4-mini 5 Claude Gemini Gemini (language model) Gemma Grok LaMDA BLOOM DBRX Project Debater IBM Watson IBM Watsonx Granite PanGu-Σ DeepSeek Qwen
Decisional	AlphaGo AlphaZero OpenAI Five Self-driving car MuZero Action selection AutoGPT Robot control

People

Architectures

Category

Retrieved from "https://en.wikipedia.org/w/index.php?title=Neural_Turing_machine&oldid=1303916392"

Categories:

Hidden categories:

[8]ページ先頭