George0828Zhang/ssnt_lossPublic

NotificationsYou must be signed in to change notification settings
Fork0
Star2

Pure PyTorch implementation of the loss described in "Online Segment to Segment Neural Transduction"https://arxiv.org/abs/1609.08194

2 stars 0 forks Branches Tags Activity

Star

Notifications

You must be signed in to change notification settings

Branches Tags

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.gitignore		.gitignore
README.md		README.md
__init__.py		__init__.py
example.py		example.py
ssnt_loss.py		ssnt_loss.py
test.py		test.py

Repository files navigation

ssnt-loss

A pure PyTorch implementation of the loss described in"Online Segment to Segment Neural Transduction"https://arxiv.org/abs/1609.08194.

Usage

There are two versions, a normal version and a memory efficient version. They should give the same output, please inform me if they don't.

defssnt_loss_mem(log_probs:Tensor,targets:Tensor,source_lengths:Tensor,target_lengths:Tensor,emit_logits:Optional[Tensor]=None,emit_probs:Optional[Tensor]=None,neg_inf:float=-1e4,reduction="none",fastemit_lambda=0):"""The memory efficient implementation concatenates along the targets    dimension to reduce wasted computation on padding positions.    N is the minibatch size    T is the maximum number of output labels    S is the maximum number of input frames    V is the vocabulary of labels.    T_flat is the summation of lengths of all output labels    Assuming the original tensor is of (N, T, ...), then it should be reduced to    (T_flat, ...). This can be obtained by using a target mask.    For example:        >>> target_mask = targets.ne(pad)   # (B, T)        >>> targets = targets[target_mask]  # (T_flat,)        >>> log_probs = log_probs[target_mask]  # (T_flat, S, V)    Args:        log_probs (Tensor): (T_flat, S, V) Word prediction log-probs, should be output of log_softmax.        targets (Tensor): (T_flat,) target labels for all samples in the minibatch.        source_lengths (Tensor): (N,) Length of the source frames for each sample in the minibatch.        target_lengths (Tensor): (N,) Length of the target labels for each sample in the minibatch.        emit_logits, emit_probs (Tensor, optional): (T_flat, S) Emission logits (before sigmoid) or            probs (after sigmoid). If both are provided, logits is used.        neg_inf (float, optional): The constant representing -inf used for masking.            Default: -1e4        reduction (string, optional): Specifies reduction. suppoerts mean / sum.            Default: None.        fastemit_lambda (float, optional): Scale the emission gradient of emission paths to            encourage low latency. https://arxiv.org/pdf/2010.11148.pdf            Default: 0    """

Minimal example

python example.py

Note

ℹ️ This is a WIP project. the implementation is still being tested.

This implementation is based on the parallelizedcumsum andcumprod operations proposed in monotonic attention. Since the alignments in SSNT and monotonic attention is almost identical, we can infer that the forward variable alpha(i,j) of the SSNT can be computed similarly.
Run test bypython test.py (requirespip install expecttest).
Feel free to contact me if there are bugs in the code.

Reference

About

Pure PyTorch implementation of the loss described in "Online Segment to Segment Neural Transduction"https://arxiv.org/abs/1609.08194

Releases

No releases published

Packages

No packages published

Languages

Python100.0%

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

ssnt-loss

Usage

Minimal example

Note

Reference

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages

Languages

Movatterモバイル変換

George0828Zhang/ssnt_loss

Folders and files

Latest commit

History

Repository files navigation

ssnt-loss

Usage

Minimal example

Note

Reference

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages0

Languages

Packages