Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings
#

multimodal

Here are 1,588 public repositories matching this topic...

anything-llm

The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, MCP compatibility, and more.

  • UpdatedDec 17, 2025
  • JavaScript

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

  • UpdatedAug 12, 2024
  • Python

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

  • UpdatedDec 15, 2025
  • Python
serve

The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra

  • UpdatedDec 15, 2025
  • TypeScript

Janus-Series: Unified Multimodal Understanding and Generation Models

  • UpdatedFeb 1, 2025
  • Python

AI app store powered by 24/7 desktop history. open source | 100% local | dev friendly | 24/7 screen, mic recording

  • UpdatedDec 9, 2025
  • TypeScript

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, Llava, GLM4v, Phi4, ...) (AAAI 2025).

  • UpdatedDec 17, 2025
  • Python
rerun

An open source SDK for logging, storing, querying, and visualizing multimodal and multi-rate data

  • UpdatedDec 17, 2025
  • Rust

SeaTunnel is a multimodal, high-performance, distributed, massive data integration tool.

  • UpdatedDec 17, 2025
  • Java
BentoML

The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!

  • UpdatedDec 15, 2025
  • Python

This repository is a curated collection of links to various courses and resources about Artificial Intelligence (AI)

  • UpdatedApr 22, 2024
  • Python

notes for software engineers getting up to speed on new AI developments. Serves as datastore forhttps://latent.space writing, and product brainstorming, but has cleaned up canonical references under the /Resources folder.

  • UpdatedSep 15, 2025
  • HTML

Solve Visual Understanding with Reinforced VLMs

  • UpdatedOct 21, 2025
  • Python
pyspur

A visual playground for agentic workflows: Iterate over your agents 10x faster

  • UpdatedJul 20, 2025
  • TypeScript

A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)

  • UpdatedApr 24, 2025
  • Python

Open-source framework for building AI-powered apps in JavaScript, Go, and Python, built and used in production by Google

  • UpdatedDec 17, 2025
  • TypeScript

A Next-Generation Training Engine Built for Ultra-Large MoE Models

  • UpdatedDec 17, 2025
  • Python
Daft

High-performance data engine for AI and multimodal workloads. Process images, audio, video, and structured data at any scale

  • UpdatedDec 17, 2025
  • Rust

Improve this page

Add a description, image, and links to themultimodal topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with themultimodal topic, visit your repo's landing page and select "manage topics."

Learn more


[8]ページ先頭

©2009-2025 Movatter.jp