ryul99/pytorch-project-templatePublic template

NotificationsYou must be signed in to change notification settings
Fork22
Star138

Deep Learning project template for PyTorch (multi-gpu training is supported)

License

Apache-2.0 license

138 stars 22 forks Branches Tags Activity

Star

Notifications

You must be signed in to change notification settings

Branches Tags

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 137 Commits
.github/workflows		.github/workflows
assets		assets
config		config
dataset		dataset
model		model
tests		tests
utils		utils
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
README.md		README.md
requirements-dev.txt		requirements-dev.txt
requirements.txt		requirements.txt
setup.cfg		setup.cfg
trainer.py		trainer.py

Repository files navigation

`Pytorch Project Template`

Feature

TensorBoard /wandb support
Background generator is used (reason of using background generator)
- In Windows, background generator could not be supported. So if error occurs, set false touse_background_generator in config
Training state and network checkpoint saving, loading
- Training state includes not only network weights, but also optimizer, step, epoch.
- Checkpoint includes only network weights. This could be used for inference.
Hydra andOmegaconf is supported
Distributed Learning using Distributed Data Parallel is supported
Config with yaml file / easy dot-style access to config
Code lint / CI
Code Testing with pytest

Code Structure

assets dir: icon image ofPytorch Project Template. You can remove this directory.
config dir: directory for config files
dataset dir: dataloader and dataset codes are here. Also, put dataset inmeta dir.
model dir:model.py is for wrapping network architecture.model_arch.py is for coding network architecture.
tests dir: directory forpytest testing codes. You can check your network's flow of tensor by fixingtests/model/net_arch_test.py.Just copy & pasteNet_arch.forward method tonet_arch_test.py and addassert phrase to check tensor.
utils dir:
- train_model.py andtest_model.py are for train and test model once.
- utils.py is for utility. random seed setting, dot-access hyper parameter, get commit hash, etc are here.
- writer.py is for writing logs in tensorboard / wandb.
trainer.py file: this is for setting up and iterating epoch.

Setup

Install requirements

python3 (3.8, 3.9, 3.10, 3.11 is tested)
Write PyTorch version which you want torequirements.txt. (https://pytorch.org/get-started/)
pip install -r requirements.txt

Config

Config is written in yaml file
- You can choose configs atconfig/default.yaml. Custom configs are underconfig/job/
name is train name you run.
working_dir is root directory for saving checkpoints, logging logs.
device is device mode for running your model. You can choosecpu orcuda
data field
- Configs for Dataloader.
- globtrain_dir /test_dir withfile_format for Dataloader.
- Ifdivide_dataset_per_gpu is true, origin dataset is divide into sub dataset for each gpu.This could mean the size of origin dataset should be multiple of number of using gpu.If this option is false, dataset is not divided but epoch goes up in multiple of number of gpus.
train/test field
- Configs for training options.
- random_seed is for setting python, numpy, pytorch random seed.
- num_epoch is for end iteration step of training.
- optimizer is for selecting optimizer. Onlyadam optimizer is supported for now.
- dist is for configuring Distributed Data Parallel.
  - gpus is the number that you want to use with DDP (gpus value is used atworld_size in DDP).Not using DDP whengpus is 0, using all gpus whengpus is -1.
  - timeout is seconds for timeout of process interaction in DDP.When this is set as~, default timeout (1800 seconds) is applied ingloo mode and timeout is turned off innccl mode.
model field
- Configs for Network architecture and options for model.
- You can add configs in yaml format to config your network.
log field
- Configs for logging include tensorboard / wandb logging.
- summary_interval andcheckpoint_interval are interval of step and epoch between training logging and checkpoint saving.
- checkpoint and logs are saved underworking_dir/chkpt_dir andworking_dir/trainer.log. Tensorboard logs are saving underworking_dir/outputs/tensorboard
load field
- loading from wandb server is supported
- wandb_load_path isRun path in overview of run. If you don't want to use wandb load, this field should be~.
- network_chkpt_path is path to network checkpoint file.If using wandb loading, this field should be checkpoint file name of wandb run.
- resume_state_path is path to training state file.If using wandb loading, this field should be training state file name of wandb run.

Code lint

pip install -r requirements-dev.txt for install develop dependencies (this requires python 3.6 and above because of black)
pre-commit install for adding pre-commit to git hook

Train

python trainer.py working_dir=$(pwd)

Inspired by

https://github.com/open-mmlab/mmsr
https://github.com/allenai/allennlp (test case writing)

About

Deep Learning project template for PyTorch (multi-gpu training is supported)

Releases

No releases published

Packages

No packages published

Languages

Python100.0%

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

License

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

`Pytorch Project Template`

Feature

Code Structure

Setup

Install requirements

Config

Code lint

Train

Inspired by

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages

Uh oh!

Languages

Movatterモバイル変換

License

ryul99/pytorch-project-template

Folders and files

Latest commit

History

Repository files navigation

Pytorch Project Template

Feature

Code Structure

Setup

Install requirements

Config

Code lint

Train

Inspired by

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages0

Uh oh!

Languages

`Pytorch Project Template`

Packages