Movatterモバイル変換


[0]ホーム

URL:


Skip to Main Content

Advertisement

MIT Press Direct, home
header search
    Neural Computation
    Skip Nav Destination
    Article navigation
    April 01 2019

    Gated Orthogonal Recurrent Units: On Learning to Forget

    In Special Collection:CogNet
    Li Jing,
    Li Jing
    Massachusetts Institute of Technology, Cambridge, MA 02139, U.S.A.[email protected]
    Search for other works by this author on:
    Caglar Gulcehre,
    Caglar Gulcehre
    University of Montreal, Montreal H3T, 1J4, Quebec, Canada[email protected]
    Search for other works by this author on:
    John Peurifoy,
    John Peurifoy
    Massachusetts Institute of Technology, Cambridge, MA 02139, U.S.A.[email protected]
    Search for other works by this author on:
    Yichen Shen,
    Yichen Shen
    Massachusetts Institute of Technology, Cambridge, MA 02139, U.S.A.[email protected]
    Search for other works by this author on:
    Max Tegmark,
    Max Tegmark
    Massachusetts Institute of Technology, Cambridge, MA 02139, U.S.A.[email protected]
    Search for other works by this author on:
    Marin Soljacic,
    Marin Soljacic
    Massachusetts Institute of Technology, Cambridge, MA 02139, U.S.A.[email protected]
    Search for other works by this author on:
    Yoshua Bengio
    Yoshua Bengio
    University of Montreal, Montreal H3T 1J4, Quebec, Canada[email protected]
    Search for other works by this author on:
    Crossmark: Check for Updates
    Li Jing
    Massachusetts Institute of Technology, Cambridge, MA 02139, U.S.A.[email protected]
    Caglar Gulcehre
    University of Montreal, Montreal H3T, 1J4, Quebec, Canada[email protected]
    John Peurifoy
    Massachusetts Institute of Technology, Cambridge, MA 02139, U.S.A.[email protected]
    Yichen Shen
    Massachusetts Institute of Technology, Cambridge, MA 02139, U.S.A.[email protected]
    Max Tegmark
    Massachusetts Institute of Technology, Cambridge, MA 02139, U.S.A.[email protected]
    Marin Soljacic
    Massachusetts Institute of Technology, Cambridge, MA 02139, U.S.A.[email protected]
    Yoshua Bengio
    University of Montreal, Montreal H3T 1J4, Quebec, Canada[email protected]
    *

    Li Jing and Caglar Gulcehre contributed equally to this letter.

    Received:April 29 2018
    Accepted:December 08 2018
    Online ISSN: 1530-888X
    Print ISSN: 0899-7667
    © 2019 Massachusetts Institute of Technology
    2019
    Massachusetts Institute of Technology
    Neural Computation (2019) 31 (4): 765–783.
    Article history
    Received:
    April 29 2018
    Accepted:
    December 08 2018
    Citation

    Li Jing,Caglar Gulcehre,John Peurifoy,Yichen Shen,Max Tegmark,Marin Soljacic,Yoshua Bengio; Gated Orthogonal Recurrent Units: On Learning to Forget.Neural Comput 2019; 31 (4): 765–783. doi:https://doi.org/10.1162/neco_a_01174

    Download citation file:

    toolbar search
    toolbar search

      Abstract

      We present a novel recurrent neural network (RNN)–based model that combines the remembering ability of unitary evolution RNNs with the ability of gated RNNs to effectively forget redundant or irrelevant information in its memory. We achieve this by extending restricted orthogonal evolution RNNs with a gating mechanism similar to gated recurrent unit RNNs with a reset gate and an update gate. Our model is able to outperform long short-term memory, gated recurrent units, and vanilla unitary or orthogonal RNNs on several long-term-dependency benchmark tasks. We empirically show that both orthogonal and unitary RNNs lack the ability to forget. This ability plays an important role in RNNs. We provide competitive results along with an analysis of our model on many natural sequential tasks, including question answering, speech spectrum prediction, character-level language modeling, and synthetic tasks that involve long-term dependencies such as algorithmic, denoising, and copying tasks.

      © 2019 Massachusetts Institute of Technology
      2019
      Massachusetts Institute of Technology
      You do not currently have access to this content.

      Sign in

      Don't already have an account?Register

      Client Account

      You could not be signed in. Please check your email address / username and password and try again.
      Could not validate captcha. Please try again.

      Sign in via your Institution

      Sign in via your Institution
      630Views
      55Web of Science
      42Crossref

      Advertisement

      Related Book Chapters

      The SBA: Exploiting Orthogonality in AND-OR Parallel Systems
      Logic Programming: The 1997 International Symposium
      William Gates
      Portraits In Silicon
      The Controlled not Gate
      Quantum Computing Without Magic: Devices
      From the Gate to the Field
      The Householders: Robert Duncan and Jess

      Advertisement

      Neural Computation
      • Online ISSN 1530-888X
      • Print ISSN 0899-7667
      Close Modal
      Close Modal
      This Feature Is Available To Subscribers Only

      Sign In orCreate an Account

      Close Modal
      Close Modal
      This site uses cookies. By continuing to use our website, you are agreeing toour privacy policy. No content on this site may be used to train artificial intelligence systems without permission in writing from the MIT Press.
      Accept

      [8]ページ先頭

      ©2009-2025 Movatter.jp