Learning

From Chessprogramming wiki

Home * Learning

Learning^[1]

Learning,
the process of acquiring newknowledge which involves synthesizing different types ofinformation.Machine learning as aspect of computer chess programming deals with algorithms that allow the program to change its behavior based on data, which for instance occurs duringgame playing against a variety of opponents considering the final outcome and/or the game record for instance as history score chart indexed by ply. Related to Machine learning isevolutionary computation and its sub-areas ofgenetic algorithms, andgenetic programming, that mimics the process of naturalevolution, as further mentioned inautomated tuning. The process of learning often impliesunderstanding,perception orreasoning. So calledRote learning avoids understanding and focuses onmemorization.Inductive learning takes examples and generalizes rather than starting with existing knowledge.Deductive learning takes abstract concepts to make sense of examples^[2].

Learning inside a Chess Program

Learning inside a chess program may address several disjoint issues. Apersistent hash table remembers "important" positions from earlier games inside thesearch with itsexact score^[3]. Worse positions may be avoided in advance.Learning opening book moves, that is appending successful novelties or modify the probability of already stored moves from the book based on the outcome of a game^[4]. Another application is learningevaluation weights of various features, f. i.piece-^[5] orpiece-square^[6] values ormobility. Programs may also learn to control search^[7] ortime usage^[8].

Learning Paradigms

There are three major learningparadigms, each corresponding to a particular abstract learning task. These aresupervised learning,unsupervised learning andreinforcement learning. Usually any given type ofneural network architecture can be employed in any of those tasks.

Supervised Learning

see main pageSupervised Learning

Supervised learning is learning from examples provided by a knowledgable external supervisor. In machine learning, supervised learning is a technique for deducing a function from training data. The training data consist of pairs of input objects and desired outputs, f.i. in computer chess a sequence of positions associated with the outcome of a game^[9] .

Unsupervised Learning

Unsupervised machine learning seems much harder: the goal is to have the computer learn how to do something that we don't tell it how to do. The learner is given only unlabeled examples, f. i. a sequence of positions of a running game but the final result (still) unknown. A form of reinforcement learning can be used for unsupervised learning, where anagent bases its actions on the previous rewards and punishments without necessarily even learning any information about the exact ways that its actions affect the world.Clustering is another method of unsupervised learning.

Reinforcement Learning

see main pageReinforcement Learning

Reinforcement learning is defined not by characterizing learning methods, but by characterizing a learning problem. Reinforcement learning is learning what to do - how to map situations to actions - so as to maximize a numerical reward signal. The learner is not told which actions to take, as in most forms of machine learning, but instead must discover which actions yield the most reward by trying them. The reinforcement learning problem is deeply indebted to the idea ofMarkov decision processes (MDPs) from the field ofoptimal control.

Learning Topics

Programs

Selected Publications

^[10]

1940 ...

Walter Pitts (1942).Some observations on the simple neuron circuit.Bulletin of Mathematical Biology, Vol. 4, No. 3
Warren S. McCulloch,Walter Pitts (1943).A Logical Calculus of the Ideas Immanent in Nervous Activity.Bulletin of Mathematical Biology, Vol. 5, No. 1
Donald O. Hebb (1949).The Organization of Behavior.Wiley & Sons

1950 ...

Stephen C. Kleene (1951)Representation of Events in Nerve Nets and Finite Automata. RM-704,RAND paper,pdf, reprinted in

Claude Shannon,John McCarthy (eds.) (1956).Automata Studies.Annals of Mathematics Studies, No. 34

Paul I. Richards (1951).Machines which can learn.American Scientist, 39:711-716
Paul I. Richards (1952).On Game Learning Machines.The Scientific Monthly, Vol. 74, No. 4, April 1952
Alan Turing (1953).Chess. part of the collectionDigital Computers Applied to Games inBertram Vivian Bowden (editor),Faster Than Thought, a symposium on digital computing machines, reprinted 1988 inComputer Chess Compendium, reprinted in

Alan Turing,Jack Copeland (editor) (2004).The Essential Turing, Seminal Writings in Computing, Logic, Philosophy, Artificial Intelligence, and Artificial Life plus The Secrets of Enigma.Oxford University Press,amazon,google books

Marvin Minsky (1954).Neural Nets and the Brain Model Problem. Ph.D. dissertation,Princeton University

1955 ...

Robert R. Bush,Frederick Mosteller (1955).Stochastic models for learning.John Wiley & Sons
John von Neumann (1956).Probabilistic Logic and the Synthesis of Reliable Organisms From Unreliable Components. in

Claude Shannon,John McCarthy (eds.) (1956).Automata Studies.Annals of Mathematics Studies, No. 34,pdf

Frederick Mosteller (1956).Stochastic Learning Models. inJerzy Neyman (1956).Proceedings of the Third Berkeley Symposium on Mathematical Statistics and Probability, Volume 5: Contributions to Econometrics, Industrial Research, and Psychometry,pdf
Frank Rosenblatt (1957).The Perceptron - a Perceiving and Recognizing Automaton. Report 85-460-1, Cornell Aeronautical Laboratory^[11]
Albert M. Uttley (1959).Imitation of Pattern Recognition and Trial-and-error Learning in a Conditional Probability Computer.Reviews of Modern Physics, Vol. 31, April 1959, pp. 546-548^[12]^[13]
Arthur Samuel (1959).Some Studies in Machine Learning Using the Game of Checkers. IBM Journal July 1959 »Checkers
Edward Feigenbaum (1959).An Information Processing Theory of Verbal Learning.RAND Paper

1960 ...

Edward Feigenbaum (1960).Information Theories of Human Verbal Learning. Ph.D. thesis,Carnegie Mellon University, advisorHerbert Simon
Edward Feigenbaum (1961).The Simulation of Verbal Learning Behavior. Proceedings Western Joint Conference, Vol. 19
Edward Feigenbaum,Herbert Simon (1961).Performance of a Reading Task by an Elementary Perceiving and Memorizing Program.RAND Paper,pdf
Donald Michie (1961).Trial and Error. Penguin Science Survey,pdf
Edward Feigenbaum,Herbert Simon (1962).A Theory of the Serial Position Effect.British Journal of Psychology, Vol. 53, 307-32,pdf
Earl B. Hunt (1962).Concept Learning: An Information Processing Problem. Wiley.google books
Frank Rosenblatt (1962).Principles of Neurodynamics: Perceptrons and the Theory of Brain Mechanisms. Spartan Books
Allen Newell (1963).Learning, Generality and Problem Solving.Memorandum RM-3285-1-PR pdf
Herbert Simon,Edward Feigenbaum (1964).An Information-processing Theory of Some Effects of Similarity, Familiarization, and Meaningfulness in Verbal Learning. Journal of Verbal Learning and Verbal Behavior, Vol. 3, No. 5,pdf

1965 ...

James R. Slagle (1965).A multipurpose Theorem Proving Heuristic Program that learns.IFIP Congress 65, Vol. 2
Donald Michie (1966).Game Playing and Game Learning Automata. Advances in Programming and Non-Numerical Computation,Leslie Fox (ed.), pp. 183-200. Oxford, Pergamon. » Includes Appendix:Rules of SOMAC byJohn Maynard Smith, introducesExpectiminimax tree^[14]
Thomas A. Throop (1966).Thoughts on the Development of Computer Learning Programs.Defense Technical Information Center
Arnold K. Griffith (1966).A new Machine-Learning Technique applied to the Game of Checkers.MIT,Project MAC, MAC-M-293
Arthur Samuel (1967).Some Studies in Machine Learning. Using the Game of Checkers. II-Recent Progress.pdf
Marvin Minsky,Seymour Papert (1969).Perceptrons.^[15]^[16]

1970 ...

Albert Zobrist (1970).A Pattern Recognition Program which uses a Geometry-Preserving Representation of Features. Technical Report #85,pdf
Vladimir Vapnik,Alexey Chervonenkis (1971).On the Uniform Convergence of Relative Frequencies of Events to Their Probabilities.Theory of Probability and its Applications, Vol. 16, No. 2
A. Harry Klopf (1972).Brain Function and Adaptive Systems - A Heterostatic Theory. Air Force Cambridge Research Laboratories, Special Reports, No. 133,pdf
Marvin Minsky,Seymour Papert (1972).Perceptrons: An Introduction to Computational Geometry.The MIT Press, 2nd edition with corrections
Herbert Simon,Kevin J. Gilmartin (1973).A Simulation of Memory for Chess Positions. Cognitive Psychology, Vol. 5, pp. 29-46.pdf
Arnold K. Griffith (1974).A Comparison and Evaluation of Three Machine Learning Procedures as Applied to the Game of Checkers.Artificial Intelligence, Vol. 5, No. 2 »Checkers

1975 ...

Jacques Pitrat (1976).A Program to Learn to Play Chess. Pattern Recognition and Artificial Intelligence, pp. 399-419. Academic Press Ltd. London, UK. ISBN 0-12-170950-7.
Jacques Pitrat (1976).Realization of a Program Learning to Find Combinations at Chess. Computer Oriented Learning Processes (ed. J. Simon). Noordhoff, Groningen, The Netherlands.
Pericles Negri (1977).Inductive Learning in a Hierarchical Model for Representing Knowledge in Chess End Games.pdf
Ryszard Michalski,Pericles Negri (1977).An experiment on inductive learning in chess endgames.Machine Intelligence 8,pdf
Boris Stilman (1977).The Computer Learns. in1976 US Computer Chess Championship, byDavid Levy, Computer Science Press, Woodland Hills, CA, pp. 83-90
Richard Sutton (1978).Single channel theory: A neuronal theory of learning. Brain Theory Newsletter 3, No. 3/4, pp. 72-75.
Ross Quinlan (1979).Discovering Rules by Induction from Large Collections of Examples. Expert Systems in the Micro-electronic Age, pp. 168-201. Edinburgh University Press (Introducing ID3)

1980 ...

Sarah E. Goldin,Philip Klahr (1981).Learning and Abstraction in Simulation.IJCAI 1981,pdf
Paul E. Utgoff,Tom Mitchell (1982).Acquisition of Appropriate Bias for Inductive Concept Learning.AAAI 1982,pdf
A. Harry Klopf (1982).The Hedonistic Neuron: A Theory of Memory, Learning, and Intelligence. Hemisphere Publishing Corporation,University of Michigan
Alen Shapiro,Tim Niblett (1982).Automatic Induction of Classification Rules for Chess End game.Advances in Computer Chess 3
Thomas Nitsche (1982).A Learning Chess Program.Advances in Computer Chess 3
Ryszard Michalski,Jaime Carbonell,Tom Mitchell (1983).Machine Learning: An Artificial Intelligence Approach.Springer
Ross Quinlan (1983).Learning efficient classification procedures and their application to chess end games. In Machine Learning: An Artificial Intelligence Approach, pages 463–482. Tioga, Palo Alto
Alen Shapiro (1983).The Role of Structured Induction in Expert Systems.University of Edinburgh, Machine Intelligence Research Unit (Ph.D. thesis)
Edward Feigenbaum,Herbert Simon (1984).EPAMlike models of recognition and learning.Cognitive Science, Vol. 8, 305-336,pdf
John E. Laird,Paul S. Rosenbloom,Allen Newell (1984).Towards Chunking as a General Learning Mechanism.AAAI 1984
Albrecht Heeffer (1984).Automated Acquisition on Concepts for the Description of Middle-game Positions in Chess.Turing Institute,Glasgow,Scotland, TIRM-84-005
Paul E. Utgoff (1984).Shift of Bias for Inductive Concept Learning. Ph.D. thesis,Rutgers University, New Brunswick
Leslie Valiant (1984).A Theory of the Learnable.Communications of the ACM, Vol. 27, No. 11,pdf

1985 ...

Tony Marsland (1985).Evaluation-Function Factors.ICCA Journal, Vol. 8, No. 2,pdf
Albrecht Heeffer (1985).Validating Concepts from Automated Acquisition Systems.IJCAI 1985,pdf
Hans Berliner (1985).Goals, Plans, and Mechanisms: Non-symbolically in an Evaluation Surface. Presentation at Evolution, Games, and Learning, Center for Nonlinear Studies,Los Alamos National Laboratory, May 21.
Ryszard Michalski,Jaime Carbonell,Tom Mitchell (1985, 2014).Learning: An Artificial Intelligence Approach, Volume I.Morgan Kaufmann
Igor Roizen,Judea Pearl (1985).Learning Link Probabilities in Causal Trees.Uncertainty in Artificial Intelligence 1

1986

Steven Skiena (1986).An Overview of Machine Learning in Chess.ICCA Journal, Vol. 9, No. 1
Jens Christensen,Richard Korf (1986).A Unified Theory of Heuristic Evaluation functions and Its Applications to Learning. Proceedings of the AAAI-86,pdf.
Ryszard Michalski,Jaime Carbonell,Tom Mitchell (1986).Machine Learning: An Artificial Intelligence Approach, Volume II.Morgan Kaufmann
Tom Mitchell,Jaime Carbonell,Ryszard Michalski (1986).Machine Learning: A Guide to Current Research.The Kluwer International Series in Engineering and Computer Science, Vol. 12
Ivan Bratko,Igor Kononenko (1986).Learning Rules from Incomplete and Noisy Data. Proceedings Unicom Seminar on the Scope of Artificial Intelligence in Statistics. Technical Press

1987

David Slate (1987).A Chess Program that uses its Transposition Table to Learn from Experience.ICCA Journal, Vol. 10, No. 2
Ronald L. Rivest (1987).Learning Decision Lists. Machine Learning 2,3,pdf 2001
Gerald Tesauro,Terrence J. Sejnowski (1987).A 'Neural' Network that Learns to Play Backgammon.NIPS 1987
Alen Shapiro (1987).Structured Induction in Expert Systems. Turing Institute Press in association with Addison-Wesley Publishing Company, Workingham, UK
Alberto Maria Segre (1987).On the Operationality/Generality Trade-off in Explanation-based Learning.IJCAI 1987,pdf
Alberto Maria Segre (1987).Explanation-Based Learning of Generalized Robot Assembly Plans. Ph.D. thesis,University of Illinois at Urbana-Champaign, Advisor:Gerald Francis DeJong, II
Eric B. Baum,Frank Wilczek (1987).Supervised Learning of Probability Distributions by Neural Networks.NIPS 1987

1988

Bruce Abramson (1988).Learning Expected-Outcome Evaluators in Chess. Proceedings of the 1988 AAAI Spring Symposium Series: Computer Game Playing, 26-28.
Richard Sutton (1988).Learning to Predict by the Methods of Temporal Differences.Machine Learning, Vol. 3, No. 1,pdf
David E. Goldberg,John H. Holland (1988).Genetic Algorithms and Machine Learning.Machine Learning, Vol. 3
Kenneth A. De Jong,Alan C. Schultz (1988).Using Experience-Based Learning in Game Playing. Proceedings of the Fifth International Machine Learning Conference,CiteSeerX »Othello
Kai-Fu Lee,Sanjoy Mahajan (1988).A Pattern Classification Approach to Evaluation Function Learning.Artificial Intelligence, Vol. 36, No. 1
Paul E. Utgoff (1988).ID5: An incremental ID3.ML 1988
Shaul Markovitch,Paul D. Scott (1988).The Role of Forgetting in Learning.ML 1988,pdf

1989

David E. Goldberg (1989).Genetic Algorithms in Search, Optimization and Machine Learning.Addison-Wesley
Robert Levinson (1989).A Self-Learning, Pattern-Oriented Chess Program.ICCA Journal, Vol. 12, No. 4
Bruce Abramson (1989).On Learning and Testing Evaluation Functions. Proceedings of the Sixth Israeli Conference on Artificial Intelligence, 1989, 7-16.
Eric Wefald,Stuart Russell (1989).Adaptive Learning of Decision-Theoretic Search Control Knowledge. In Proceedings of the Sixth International Workshop on Machine Learning. Ithaca, NY: Morgan Kaufmann
Stephen Muggleton,Michael Bain,Jean Hayes Michie,Donald Michie (1989).An Experimental Comparison of Human and Machine Learning Formalisms.6. ML 1989,pdf
Eric B. Baum (1989).A Proposal for More Powerful Learning Algorithms.Neural Computation, Vol. 1, No. 2
Susan L. Epstein (1989).The Intelligent Novice - Learning to Play Better.Heuristic Programming in Artificial Intelligence 1
Chris Watkins (1989).Learning from Delayed Rewards. Ph.D. thesis,Cambridge University,pdf

1990 ...

Richard Sutton,Andrew Barto (1990).Time Derivative Models of Pavlovian Reinforcement. Learning and Computational Neuroscience: Foundations of Adaptive Networks: 497-537.
Bruce Abramson (1990).On Learning and Testing Evaluation Functions. Journal of Experimental and Theoretical Artificial Intelligence 2: 241-251.
Tony Scherzer,Linda Scherzer,Dean Tjaden (1990).Learning in Bebe.Computers, Chess, and Cognition »Mephisto Best-Publication Award
Yves Kodratoff,Ryszard Michalski (1990, 2014).Machine Learning : An Artificial Intelligence Approach, Volume III.Morgan Kaufmann
Michèle Sebag (1990).A symbolic-numerical approach for supervised learning from examples and rules. Ph.D. thesis,Paris Dauphine University

1991

Robert Schapire (1991).The Design and Analysis of Efficient Learning Algorithms. Ph.D. thesis,Massachusetts Institute of Technology, supervisorRonald L. Rivest,pdf
Gerhard Mehlsam,Hermann Kaindl,Wilhelm Barth (1991).Feature Construction During Tree Learning.GWAI 1991
Alex van Tiggelen (1991).Neural Networks as a Guide to Optimization - The Chess Middle Game Explored.ICCA Journal, Vol. 14, No. 3
William Tunstall-Pedoe (1991).Genetic Algorithms Optimizing Evaluation Functions.ICCA Journal, Vol. 14, No. 3
Tony Scherzer,Linda Scherzer,Dean Tjaden (1991).Learning in Bebe.ICCA Journal, Vol. 14, No. 4
Steven Walczak (1991).Predicting Actions from Induction on Past Performance. Proceedings of the 8th International Workshop on Machine Learning
Paul E. Utgoff,Jeffery A. Clouse (1991).Two Kinds of Training Information for Evaluation Function Learning.University of Massachusetts, Amherst, Proceedings of theAAAI 1991
Byoung-Tak Zhang,Gerd Veenker (1991).Neural networks that teach themselves through genetic discovery of novel examples.IEEE IJCNN'91,pdf
Byoung-Tak Zhang,Gerd Veenker (1991).Focused incremental learning for improved generalization with reduced training sets. ICANN'91,pdf
Stephen Muggleton (1991).Inductive Logic Programming.New Generation Computing, Vol. 8, No. 4,pdf

1992

Miroslav Kubat (1992).Introduction to Machine Learning.Advanced Topics in Artificial Intelligence 1992
Michael Bain (1992).Learning optimal chess strategies. Proc. Intl. Workshop on Inductive Logic Programming (ed.Stephen Muggleton), Institute for New Generation Computer Technology, Tokyo, Japan.
Eduardo F. Morales (1992).First-Order Induction of Patterns in Chess. Ph.D. Thesis, The Turing Institute,University of Strathclyde,Glasgow
Eduardo F. Morales (1992).Learning Chess Patterns. Inductive Logic Programming (ed.Stephen Muggleton), Academic Press, The Apic Series, London, UK
Gerald Tesauro (1992).Temporal Difference Learning of Backgammon Strategy.ML 1992
Chris Watkins,Peter Dayan (1992).Q-learning.Machine Learning, Vol. 8, No. 2
Gerald Tesauro (1992).Practical Issues in Temporal Difference Learning.Machine Learning, Vol. 8, No. 3-4
Manuela Veloso (1992).Learning by Analogical Reasoning in General Purpose Problem Solving. Ph.D. thesis,Carnegie Mellon University, advisorJaime Carbonell
Chris J. Thornton (1992).Techniques in Computational Learning: An Introduction.Chapman & Hall

1993

Michael Gherrity (1993).A Game Learning Machine. Ph.D. Thesis,University of California, San Diego,zipped ps
Shaul Markovitch,Yaron Sella (1993).Learning of Resource Allocation Strategies for Game Playing.IJCAI 1993,pdf
David Carmel,Shaul Markovitch (1993).Learning Models of Opponent's Strategy in Game Playing.AAAI 1993, FS-93-02,pdf
Dan Geiger,Azaria Paz,Judea Pearl (1993).Learning simple causal structures.International Journal of Intelligent Systems, Vol. 8
Sebastian Thrun,Tom Mitchell (1993).Integrating Inductive Neural Network Learning and Explanation-Based Learning.IJCAI 1993,zipped ps
Alois Heinz,Christoph Hense (1993).Bootstrap learning of α-β-evaluation functions.ICCI 1993,pdf
Nicol N. Schraudolph,Peter Dayan,Terrence J. Sejnowski (1993).Temporal Difference Learning of Position Evaluation in the Game of Go.NIPS 1993

1994

Eduardo F. Morales (1994).Learning Patterns for Playing Strategies.ICCA Journal, Vol. 17, No. 1
Fernand Gobet,Peter Jansen (1994).Towards a chess program based on a model of human memory.Advances in Computer Chess 7 »CHUMP
Michael Bain (1994).Learning Logical Exceptions in Chess. Ph.D. thesis,University of Strathclyde,CitySeerX
Michael Bain,Stephen Muggleton (1994).Learning Optimal Chess Strategies. Machine Intelligence 13 (eds. K. Furukawa andDonald Michie), pp. 291-309. Oxford University Press, Oxford, UK. ISBN 0198538502.
Ryszard Michalski,George Tecuci (1994).Machine Learning: A Multistrategy Approach, Volume IV.Morgan Kaufmann
Gerald Tesauro (1994).TD-Gammon, a Self-Teaching Backgammon Program, Achieves Master-Level Play.Neural Computation Vol. 6, No. 2
Alberto Maria Segre,Charles Elkan (1994).A High-Performance Explanation-Based Learning Algorithm.Artificial Intelligence, Vol. 68, Nos. 1-2
David E. Moriarty,Risto Miikkulainen (1994).Evolving Neural Networks to focus Minimax Search.AAAI-94,pdf

1995 ...

Gerhard Mehlsam,Hermann Kaindl,Wilhelm Barth (1995).Feature Construction during Tree Learning.GOSLER Final Report 1995: 391-403
Chris McConnell (1995).Tuning Evaluation Functions for Search.ps orpdf fromCiteSeerX
David Heckerman,Dan Geiger,Max Chickering (1995).Learning Bayesian Networks: The Combination of Knowledge and Statistical Data.Machine Learning, Vol. 20,pdf
Tristan Cazenave (1995).Learning and Problem Solving in Gogol, a Go playing program.pdf
Gerald Tesauro (1995).Temporal Difference Learning and TD-Gammon.Communications of the ACM Vol. 38, No. 3
Sebastian Thrun (1995).Learning to Play the Game of Chess. inGerald Tesauro,David S. Touretzky,Todd K. Leen (eds.) Advances in Neural Information Processing Systems 7,MIT Press
Marco Wiering (1995).TD Learning of Game Evaluation Functions with Hierarchical Neural Architectures. Master's thesis,University of Amsterdam,pdf
Michael A. Arbib (ed.) (1995, 2002).The Handbook of Brain Theory and Neural Networks.The MIT Press
Nicol N. Schraudolph (1995).Optimization of Entropy with Neural Networks. Ph.D. thesis,University of California, San Diego
Robert W. Howard (1995).Learning and Memory: Major Ideas, Principles, Issues and Applications. Praeger,amazon.com

1996

Leemon C. Baird III,Mance E. Harmon,A. Harry Klopf (1996).Reinforcement Learning: An Alternative Approach to Machine Intelligence.pdf
Sebastian Thrun (1996).Explanation-Based Neural Network Learning: A Lifelong Learning Approach.Kluwer Academic Publishers
Leslie Pack Kaelbling,Michael L. Littman,Andrew W. Moore (1996).Reinforcement Learning: A Survey.JAIR Vol. 4,pdf
Eduardo F. Morales (1996).Learning Playing Strategies in Chess.Computational Intelligence, Vol. 12, No. 1,CiteSeerX
Wee Sun Lee (1996).Agnostic Learning and Single Hidden Layer Neural Networks. Ph.D. thesis,Australian National University,ps
Johannes Fürnkranz (1996).Machine Learning in Computer Chess: The Next Generation.ICCA Journal, Vol. 19, No. 3,zipped ps
Adriaan de Groot,Fernand Gobet (1996).Perception and memory in chess. Heuristics of the professional eye. Assen: Van Gorcum, The Netherlands. ISBN 90-232-2949-5. Chapter 9; A discussion: Two authors, two different views?word
Stuart Russell (1996).Machine Learning. Chapter 4 of M. A. Boden (Ed.), Artificial Intelligence, Academic Press. Part of the Handbook of Perception and Cognition,ps
Barney Pell,Susan L. Epstein,Robert Levinson (1996).Introduction to the special issue on games: Structure and Learning.Computational Intelligence, Vol. 12, No. 1,pdf
Robert Levinson (1996).General Game-Playing and Reinforcement Learning.Computational Intelligence, Vol. 12, No. 1
Tristan Cazenave (1996).Learning to forecast by explaining the consequences of actions.pdf
Tristan Cazenave (1996).Self fuzzy learning.pdf
Yoav Freund,Robert Schapire (1996).Game Theory, On-line Prediction and Boosting.COLT 1996,pdf
Christopher D. Rosin,Richard K. Belew (1996).A Competitive Approach in Game Learning.COLT 1996,pdf

1997

Yoav Freund,Robert Schapire (1997).A decision-theoretic generalization of on-line learning and an application to boosting.Journal of Computer and System Sciences, Vol. 55, No. 1,1996 pdf »AdaBoost
Sepp Hochreiter,Jürgen Schmidhuber (1997).Long short-term memory.Neural Computation, Vol. 9, No. 8,pdf^[17]
Eduardo F. Morales (1997).On Learning How to Play.Advances in Computer Chess 8,CiteSeerX
Don Beal,Martin C. Smith (1997).Learning Piece Values Using Temporal Differences.ICCA Journal, Vol. 20, No. 3
Kieran Greer,Piyush Ojha,David A. Bell (1997).Learning Search Heuristics from Examples: A Study in Computer Chess, Seventh Conference of the Spanish Association for Artificial Intelligence, CAEPIA’97, November, pp. 695-704.
Nir Friedman,Moises Goldszmidt,David Heckerman,Stuart Russell (1997).Where is the Impact of Bayesian Networks in Learning?IJCAI 1997,ps
Ronald Parr,Stuart Russell (1997).Reinforcement Learning with Hierarchies of Machines. In Advances in Neural Information Processing Systems 10, MIT Press,zipped ps
Tristan Cazenave (1997).Gogol (an Analytical Learning Program).IJCAI 1997,pdf
Tom Mitchell (1997).Machine Learning.McGraw Hill
Michèle Sebag (1997).Stochastic Heuristics for Machine Learning & Machine Learning for Stochastic Optimization. Habilitation,Paris-Sud 11 University
William Uther,Manuela M. Veloso (1997).Adversarial Reinforcement Learning.Carnegie Mellon University,ps
William Uther,Manuela M. Veloso (1997).Generalizing Adversarial Reinforcement Learning.Carnegie Mellon University,ps
Marco Wiering,Jürgen Schmidhuber (1997).HQ-learning.Adaptive Behavior, Vol. 6, No 2