fdlm/listening-moodsPublic

NotificationsYou must be signed in to change notification settings
Fork4
Star33

Accompanying code for our ISMIR 2020 paper on mood estimation.

License

BSD-3-Clause license

33 stars 4 forks Branches Tags Activity

Star

Notifications

You must be signed in to change notification settings

Branches Tags

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
data		data
.gitattributes		.gitattributes
LICENSE		LICENSE
README.md		README.md
model.py		model.py
requirements.txt		requirements.txt
run.py		run.py

Repository files navigation

Mood Classification using Listening Data

This repository contains the data and code to reproduce the results in the paper

Filip Korzeniowski,Oriol Nieto, Matthew C. McCallum, Minz Won, Sergio Oramas, Erik M. Schmidt.“Mood Classification Using Listening Data”, 21st International Society for Music InformationRetrieval Conference, Montréal, Canada, 2020 (PDF).(Authors in bold contributed equally.)

The AllMusic Mood Subset

We provide a list of track ids from the Million Song Dataset (MSD), with train/val/test splits and a number of inputfeatures in this repository. All files can be found indata.

Note: The data files are stored ongit lfs, but you can download themhere if you get any quota errors.

Meta-Data

Track metadata (metadata.csv): MSD artist id, song id, and track id. Album ids are consecutive numbers and do notpoint to any database. Further, we provide artist names, album names, and track names. All rows in NumPy filescorrespond to this ordering.
AllMusic Moods (moods.txt): Set of mood names used in this dataset. This is a subset of all moods available onAllMusic, selected by frequency of annotations. The original IDs of these moods can be found in the officialRovi website.
Data Splits ({train,val,test}_idx.npy): NumPy arrays containing the indices of tracks used in the respective set.

Features

We provide the following features:

Taste Profile (tp_source.npy): Listening-based embeddings computed using weighted alternating least-sqares on the complete Taste-Profile dataset.
Musicnn-MSD (mcn_msd_big_source.npy): Audio-based embeddings given by the penultimate layer of theMusicnn model onthe 30-second 7-digital snippets from the MSD. Here, we used the large Musicnn model trained on the MSD.
Musicnn-MTT (mcn_mtt_source.npy): Same as before, but using a smaller Musicnn model trained on the MagnaTagATune dataset.

Ground Truth

For legal reasons, we cannot provide the moods from AllMusic. However, the moods for an album can be obtained fromallmusic.com, for example forthis Bob Dylan album.We do not encourage the research community to collect and publish the data, but if they do, we accept pull requests.

After collecting the data, make sure to bring it into a multi-hot vector format (where 1 indicates the presence of amood, and 0 the absence) format and store it asdata/mood_target.npy. Each row should represent the ground truthfor the corresponding track found indata/metadata.csv.

Running the experiments

Therun.py scripts trains a model, reports validation results, and computes test set predictions for further evaluation.It logs the training progress to the console and toWeights & Biases. You can either create a freeaccount or disable the corresponding lines in the script. Make sure you have all requirements installed, seerequirements.txt.

Model hyper-parameters can be set using command line arguments. The standard values correspond to the best parameters foundfor Taste-Profile embeddings. Here's the explicit cli call for the two types of embeddings (listening-based and audio-based).Make sure to set a gpu id if you want to use it by adding--gpu_id <GPU_ID>:

# listening-based embeddings, e.g. taste-profilepython run.py --n_layers 4 --n_units 3909 --lr 4e-4 --dropout 0.25 --weight_decay 0.0 --feature tp# audio-based embeddings, e.g. musicnn msd-trained embeddingspython run.py --n_layers 4 --n_units 3933 --lr 5e-5 --dropout 0.25 --weight_decay 1e-6 --feature mcn_msd_big

We provide the following features in this repo:

Taste-Profile (--feature tp)
Large MusiCnn trained on the Million Song Dataset (--feature mcn_msd_big)
Regular MusiCnn trained on the MagnaTagATune Dataset (--feature mcn_mtt)

You can easily add your own features by storing a NumPy file in thedata directory calledyourfeature_source.npyand calling the script using--feature yourfeature. Make sure that the rows correspond to the MSD track ids found inmsd_track_ids.txt.

About

Accompanying code for our ISMIR 2020 paper on mood estimation.

Languages

Python100.0%

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

License

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Mood Classification using Listening Data

The AllMusic Mood Subset

Meta-Data

Features

Ground Truth

Running the experiments

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages

Contributors2

Uh oh!

Languages

Movatterモバイル変換

License

fdlm/listening-moods

Folders and files

Latest commit

History

Repository files navigation

Mood Classification using Listening Data

The AllMusic Mood Subset

Meta-Data

Features

Ground Truth

Running the experiments

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages0

Contributors2

Uh oh!

Languages

Packages