Kuznetsov et al., 2021 - Google Patents

Solving continuous control with episodic memory

Kuznetsov et al., 2021

Document ID: 16502663327011426567
Author: Kuznetsov I; Filchenkov A
Publication year: 2021
Publication venue: arXiv preprint arXiv:2106.08832

External Links

Cited by

Snippet

Episodic memory lets reinforcement learning algorithms remember and exploit promising experience from the past to improve agent performance. Previous works on memory mechanisms show benefits of using episodic-based data structures for discrete action …

Continue reading atarxiv.org (PDF) (other versions)

230000001073episodic memory0titleabstractdescription29

Classifications

The classifications are assigned by a computer and are not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the classifications listed.

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/12—Computer systems based on biological models using genetic models
- G06N3/126—Genetic algorithms, i.e. information processing using digital simulations of the genetic system
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/08—Learning methods
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/04—Inference methods or devices
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06Q—DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q40/00—Finance; Insurance; Tax strategies; Processing of corporate or income taxes
- G06Q40/06—Investment, e.g. financial instruments, portfolio management or fund management
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06Q—DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management

Similar Documents

Publication	Publication Date	Title
Kuznetsov et al.	2021	Solving continuous control with episodic memory
Hambly et al.	2023	Recent advances in reinforcement learning in finance
Chen et al.	2019	Application of deep reinforcement learning on automated stock trading
Rose et al.	2021	A reinforcement learning approach to rare trajectory sampling
Liu et al.	2018	Practical deep reinforcement learning approach for stock trading
Chen et al.	2021	Generative inverse deep reinforcement learning for online recommendation
Li et al.	2022	Minimax-optimal multi-agent RL in Markov games with a generative model
Liu et al.	2022	Prioritized experience replay based on multi-armed bandit
US20240169237A1 (en)	2024-05-23	A computer implemented method for real time quantum compiling based on artificial intelligence
Bhambri et al.	2022	Reinforcement learning methods for wordle: A pomdp/adaptive control approach
Cini et al.	2020	Deep reinforcement learning with weighted Q-Learning
Taveeapiradeecharoen et al.	2018	Dynamic model averaging for daily forex prediction: A comparative study
Shi et al.	2022	Multi actor hierarchical attention critic with RNN-based feature extraction
Shakya et al.	2022	A deep reinforcement learning approach for inventory control under stochastic lead time and demand
Chua et al.	2023	FedPEAT: Convergence of federated learning, parameter-efficient fine tuning, and emulator assisted tuning for artificial intelligence foundation models with mobile edge computing
Karda et al.	2022	Automation of noise sampling in deep reinforcement learning
Tokmak et al.	2024	PACSBO: Probably approximately correct safe Bayesian optimization
Nguyen et al.	2021	Nonmyopic multifidelity acitve search
Nabati et al.	2023	Representation-driven reinforcement learning
Yang et al.	2020	Continuous control for searching and planning with a learned model
Bossens et al.	2024	Lifetime policy reuse and the importance of task capacity
Izadi et al.	2005	Using rewards for belief state updates in partially observable markov decision processes
Refael et al.	2025	LORENZA: Enhancing generalization in low-rank gradient LLM training via efficient zeroth-order adaptive SAM
Marfaing et al.	2018	Computer-assisted fraud detection, from active learning to reward maximization
Yin et al.	2017	Hashing over predicted future frames for informed exploration of deep reinforcement learning

Movatterモバイル変換

Kuznetsov et al., 2021 - Google Patents

External Links

Snippet

Classifications

Similar Documents