Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

The source code for my bachelor's thesis "Abstractive Summarization of Meetings"

License

NotificationsYou must be signed in to change notification settings

Bastian/Abstractive-Summarization-of-Meetings

Repository files navigation

This project contains the source code for my bachelor's thesis "Abstractive Text Summarization of Meetings".

Requirements

This project was only tested with Python 3.6 but should also work with more recent version of Python.For dependency versions, take a look at therequirements.txt file.

Execution

Preparing the data

python prepare_data.py

reads thedata.[train|dev|test].tsv files and generates 3 TFRecord data filestrain.tf_record,eval.tf_record, andtest.tf_record.These files are used for training.

Training

python main.py --run_mode=train_and_evaluate

starts the training.

Testing

python main.py --run_mode=test

can be used to calculate BLEU and ROUGE scores on the test data.It will print the results into the console and write the three filestest-inputs.txt,test-predictions.txt,test-targets.txtin the/outputs folder. These files contain the sentences in a human readable format.

Predicting

python main.py --run_mode=predict

takes the content from the/data/predict.txt file and creates two files in the output-folder:predict-inputs.txt andpredict-predictions.txt.

Credits

Data

The data from thepredict.txt anddata.[train|dev|test].tsv files is taken from theAMI Corpusand processed using theNITE XML Toolkit. The code that parses the corpuscan be found atMeeting-Parser.

License

The AMI Corpus license can be found here:AMI Meeting Corpus License.

Code

Main parts of the code are taken from the Texar examples for BERT and Transformers. They can be found underthe following links:

These examples are licensed under theApache License 2.0.Copied files contain a link to their original version in the file header. Any of my modificationsare also licensed under the same license.

Inspiration

This project was inspired by the GitHub repositoryAbstractive Summarization With Transfer Learning.This project uses no source code of the repository, though. The repository is also based on the Texar examples and thushas similar code.

License

This project is licensed under theApache License 2.0.

About

The source code for my bachelor's thesis "Abstractive Summarization of Meetings"

Topics

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages


[8]ページ先頭

©2009-2025 Movatter.jp