👨💻
postdoc in vgg at university of oxford. researcher of multi-modal machine learning
- University of Oxford
- Oxford, UK
- robots.ox.ac.uk/~vi
- @_iashin
PinnedLoading
- video_features
video_features PublicExtract video features from raw videos using multiple GPUs. We support RAFT flow frames as well as S3D, I3D, R(2+1)D, VGGish, CLIP, and TIMM models.
- Synchformer
Synchformer PublicSource code for "Synchformer: Efficient Synchronization from Sparse Cues" (ICASSP 2024)
- SparseSync
SparseSync PublicSource code for "Sparse in Space and Time: Audio-visual Synchronisation with Trainable Selectors." (Spotlight at the BMVC 2022)
Something went wrong, please refresh the page to try again.
If the problem persists, check theGitHub status page orcontact support.
If the problem persists, check theGitHub status page orcontact support.