cross-modal
Here are 57 public repositories matching this topic...
Language:All
Sort:Most stars
🪩 Create Disco Diffusion artworks in one line
- Updated
May 16, 2023 - Python
Represent, send, store and search multimodal data
- Updated
Jan 13, 2026 - Python
A collection of research on knowledge graphs
- Updated
Oct 7, 2022 - JavaScript
A curated list of different papers and datasets in various areas of audio-visual processing
- Updated
Jan 30, 2024
PyTorch source code for "Stacked Cross Attention for Image-Text Matching" (ECCV 2018)
- Updated
May 18, 2023 - Python
Analyze the unstructured data with Towhee, such as reverse image search, reverse video search, audio classification, question and answer systems, molecular search, etc.
- Updated
Feb 9, 2024 - Jupyter Notebook
Remote Sensing Sar-Optical Land-use Classfication Pytorch Pytorch高分辨率遥感语义分割/地物分割/地物分类
- Updated
May 6, 2024 - Python
Ultra-low bitrate speech codec (0.27-1 kbps) with cross-modal alignment and real-time capabilities
- Updated
Aug 27, 2025 - Python
[CVPR 2023] Referring Image Matting
- Updated
Apr 17, 2023
[NAACL 2022]Mobile Text-to-Image search powered by multimodal semantic representation models(e.g., OpenAI's CLIP)
- Updated
May 11, 2023 - Swift
BioT5 (EMNLP 2023) and BioT5+ (ACL 2024 Findings)
- Updated
Sep 14, 2024 - Python
DistillBEV: Boosting Multi-Camera 3D Object Detection with Cross-Modal Knowledge Distillation (ICCV 2023)
- Updated
Nov 24, 2023 - Python
Weakly Supervised 3D Object Detection from Point Clouds (VS3D), ACM MM 2020
- Updated
Mar 24, 2023 - Jupyter Notebook
Improving Visual Grounding with Visual-Linguistic Verification and Iterative Reasoning, CVPR 2022
- Updated
Dec 2, 2022 - Python
This repository provides a comprehensive collection of research papers focused on multimodal representation learning, all of which have been cited and discussed in the survey just acceptedhttps://dl.acm.org/doi/abs/10.1145/3617833 .
- Updated
Jun 16, 2025
Unofficial Implementation of Google Deepmind's paper `Objects that Sound`
- Updated
May 7, 2018 - Python
Code for journal paper "Learning Dual Semantic Relations with Graph Attention for Image-Text Matching", TCSVT, 2020.
- Updated
Oct 25, 2022 - Python
[CVPR2024] The code of "UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory"
- Updated
Oct 15, 2024 - Python
Unleash the Potential of Image Branch for Cross-modal 3D Object Detection [NeurIPS2023]
- Updated
Jun 4, 2024 - Python
Official PyTorch implementation of our CVPR 2022 paper: Beyond a Pre-Trained Object Detector: Cross-Modal Textual and Visual Context for Image Captioning
- Updated
Oct 21, 2022 - Python
Improve this page
Add a description, image, and links to thecross-modal topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with thecross-modal topic, visit your repo's landing page and select "manage topics."