bentrevett/pytorch-pos-taggingPublic

NotificationsYou must be signed in to change notification settings
Fork27
Star180

A tutorial on how to implement models for part-of-speech tagging using PyTorch and TorchText.

License

MIT license

180 stars 27 forks Branches Tags Activity

Star

Notifications

You must be signed in to change notification settings

Branches Tags

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
assets		assets
.gitignore		.gitignore
1_bilstm.ipynb		1_bilstm.ipynb
2_transformer.ipynb		2_transformer.ipynb
LICENSE		LICENSE
README.md		README.md

Repository files navigation

PyTorch PoS Tagging

Note: This repo only works with torchtext 0.9 or above which requires PyTorch 1.8 or above. If you are using torchtext 0.8 then please usethis branch

This repo contains tutorials covering how to perform part-of-speech (PoS) tagging usingPyTorch 1.8,torchtext 0.9, and andspaCy 3.0, using Python 3.8.

These tutorials will cover getting started with the most common approach to PoS tagging: recurrent neural networks (RNNs). The first notebook introduces a bi-directional LSTM (BiLSTM) network. The second covers how to fine-tune a pretrained Transformer model.

If you find any mistakes or disagree with any of the explanations, please do not hesitate tosubmit an issue. I welcome any feedback, positive or negative!

Getting Started

To install PyTorch, see installation instructions on thePyTorch website.

To install TorchText:

pip install torchtext

To install the transformers library:

pip install transformers

We'll also make use of spaCy to tokenize our data. To install spaCy, follow the instructionshere making sure to install the English models:

python -m spacy download en_core_web_sm

Tutorials

1 -BiLSTM for PoS Tagging
This tutorial covers the workflow of a PoS tagging project with PyTorch and TorchText. We'll introduce the basic TorchText concepts such as: defining how data is processed; using TorchText's datasets and how to use pre-trained embeddings. Using PyTorch we built a strong baseline model: a multi-layer bi-directional LSTM. We also show how the model can be used for inference to tag any input text.
2 -Fine-tuning Pretrained Transformers for PoS Tagging
This tutorial covers how to fine-tune a pretrained Transformer model, provided by thetransformers library, by integrating it with TorchText. We use a pretrained BERT model to provide the embeddings for our input text and input these embeddings to a linear layer that will predict tags based on these embeddings.

References

Here are some things I looked at while making these tutorials. Some of it may be out of date.

About

A tutorial on how to implement models for part-of-speech tagging using PyTorch and TorchText.

Languages

Jupyter Notebook100.0%

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

License

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

PyTorch PoS Tagging

Note: This repo only works with torchtext 0.9 or above which requires PyTorch 1.8 or above. If you are using torchtext 0.8 then please usethis branch

Getting Started

Tutorials

References

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages

Contributors2

Uh oh!

Languages

Movatterモバイル変換

License

bentrevett/pytorch-pos-tagging

Folders and files

Latest commit

History

Repository files navigation

PyTorch PoS Tagging

Note: This repo only works with torchtext 0.9 or above which requires PyTorch 1.8 or above. If you are using torchtext 0.8 then please usethis branch

Getting Started

Tutorials

References

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages0

Contributors2

Uh oh!

Languages

Packages