Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings
#

cross-modal

Here are 57 public repositories matching this topic...

discoartdocarray

A curated list of different papers and datasets in various areas of audio-visual processing

  • UpdatedJan 30, 2024

PyTorch source code for "Stacked Cross Attention for Image-Text Matching" (ECCV 2018)

  • UpdatedMay 18, 2023
  • Python

Analyze the unstructured data with Towhee, such as reverse image search, reverse video search, audio classification, question and answer systems, molecular search, etc.

  • UpdatedFeb 9, 2024
  • Jupyter Notebook

Remote Sensing Sar-Optical Land-use Classfication Pytorch Pytorch高分辨率遥感语义分割/地物分割/地物分类

  • UpdatedMay 6, 2024
  • Python

Ultra-low bitrate speech codec (0.27-1 kbps) with cross-modal alignment and real-time capabilities

  • UpdatedAug 27, 2025
  • Python

[CVPR 2023] Referring Image Matting

  • UpdatedApr 17, 2023

[NAACL 2022]Mobile Text-to-Image search powered by multimodal semantic representation models(e.g., OpenAI's CLIP)

  • UpdatedMay 11, 2023
  • Swift

BioT5 (EMNLP 2023) and BioT5+ (ACL 2024 Findings)

  • UpdatedSep 14, 2024
  • Python

DistillBEV: Boosting Multi-Camera 3D Object Detection with Cross-Modal Knowledge Distillation (ICCV 2023)

  • UpdatedNov 24, 2023
  • Python

Improving Visual Grounding with Visual-Linguistic Verification and Iterative Reasoning, CVPR 2022

  • UpdatedDec 2, 2022
  • Python

This repository provides a comprehensive collection of research papers focused on multimodal representation learning, all of which have been cited and discussed in the survey just acceptedhttps://dl.acm.org/doi/abs/10.1145/3617833 .

  • UpdatedJun 16, 2025

Unofficial Implementation of Google Deepmind's paper `Objects that Sound`

  • UpdatedMay 7, 2018
  • Python

Code for journal paper "Learning Dual Semantic Relations with Graph Attention for Image-Text Matching", TCSVT, 2020.

  • UpdatedOct 25, 2022
  • Python

[CVPR2024] The code of "UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory"

  • UpdatedOct 15, 2024
  • Python

Unleash the Potential of Image Branch for Cross-modal 3D Object Detection [NeurIPS2023]

  • UpdatedJun 4, 2024
  • Python

Official PyTorch implementation of our CVPR 2022 paper: Beyond a Pre-Trained Object Detector: Cross-Modal Textual and Visual Context for Image Captioning

  • UpdatedOct 21, 2022
  • Python

Improve this page

Add a description, image, and links to thecross-modal topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with thecross-modal topic, visit your repo's landing page and select "manage topics."

Learn more


[8]ページ先頭

©2009-2026 Movatter.jp