- Notifications
You must be signed in to change notification settings - Fork81
Practical 6: LSTM language models
oxford-cs-ml-2015/practical6
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
Machine Learning, spring 2015
In this practical, we train an LSTM for character-level language modelling. Since this is the last week for practicals, it will beextremely short and does not require writing code, and is due by the end of the Friday's session (regardless of whether you are from the Wednesday or Friday session).
See PDF for details.
Setup will be the same as last time in practical 1. Please refer to thepractical 1 repository, and run the script as instructed last time. If you get an error thatnngraph
is not installed, run:
luarocks install nngraph
Clone the practicaland download the associated data:
git clone https://github.com/oxford-cs-ml-2015/practical6.gitcd practical6wget http://www.cs.ox.ac.uk/people/brendan.shillingford/teaching/practical6-data.tar.gztar xvf practical6-data.tar.gz
and start training the model:
th train.lua -vocabfile vocab.t7 -datafile train.t7
Make note of the time at which you run thetrain.lua
script. Every several iterations, the training script will save the current model (including its parameters) to a file calledmodel_autosave.t7
. You can make snapshots of this file if you want, but this is not required for the practical.
Thepractical6-data.tar.gz
file is for 64-bit little-endian CPUs. For all other machines (i.e. if runninguname -m
doesn't print outx86_64
), then see this comment for instructions:https://github.com/oxford-cs-ml-2015/practical6/commit/96749c8d9bc93f864c94c048a3c8cd73f59f733b#commitcomment-11003337. This is the same data, but using ASCII serialization.You may also want to use this faster LSTM factory method, instead of the one in this repository:https://gist.github.com/karpathy/7bae8033dcf5ca2630ba which performs all the matrix multiplications at once followed by severalnn.Narrow
operations to extract out the gate values; read its comments for details.
https://www.cs.ox.ac.uk/people/nando.defreitas/machinelearning/
About
Practical 6: LSTM language models
Resources
Uh oh!
There was an error while loading.Please reload this page.
Stars
Watchers
Forks
Releases
Packages0
Uh oh!
There was an error while loading.Please reload this page.