Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings
#

training-data

Here are 233 public repositories matching this topic...

A system for quickly generating training data with weak supervision

  • UpdatedMay 2, 2024
  • Python
diffgram

The AI Datastore for Schemas, BLOBs, and Predictions. Use with your apps or integrate built-in Human Supervision, Data Workflow, and UI Catalog to get the most value out of your AI Data.

  • UpdatedNov 18, 2024
  • Python
ydata-synthetic

skweak: A software toolkit for weak supervision applied to NLP tasks

  • UpdatedSep 2, 2024
  • Python
myvisioncompose

A machine learning tool for automated prediction engineering. It allows you to easily structure prediction problems and generate labels for supervised learning.

  • UpdatedMar 31, 2025
  • Python
augraphy

Augmentation pipeline for rendering synthetic paper printing, faxing, scanning and copy machine processes

  • UpdatedJul 20, 2025
  • Python

Pure Python, lightweight, Pillow-based solver for Amazon's text captcha.

  • UpdatedDec 8, 2025
  • Python
page-agent

JavaScript in-page GUI agent. Control web interfaces with natural language.

  • UpdatedDec 17, 2025
  • TypeScript

A lightweight web application for brushing labels onto time series data; useful for building training sets.

  • UpdatedMar 4, 2023
  • JavaScript

Augmenty is an augmentation library based on spaCy for augmenting texts.

  • UpdatedMay 24, 2024
  • Python

Aubo i5 Dual Arm Collaborative Robot - RealSense D435 - 3D Object Pose Estimation - ROS

  • UpdatedJun 22, 2022
  • C++

Natural Language Data Augmentation Tool for Conversational Systems

  • UpdatedDec 26, 2022
  • Python

Generating training data from the Carla driving simulator in the KITTI dataset format

  • UpdatedMay 21, 2019
  • Python

Collection of casual conversations that can be used with the Rasa Stack

  • UpdatedMay 25, 2020
  • Python

SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 languages, generated using PaLM 2 and summarize-then-ask prompting.

  • UpdatedNov 13, 2023

Convert all files in git repository to .txt files. Useful for training LLMs on your codebase.

  • UpdatedDec 7, 2024
  • Python

COVID-19 Coughs files for training AI models

  • UpdatedOct 13, 2020
  • Python

Improve this page

Add a description, image, and links to thetraining-data topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with thetraining-data topic, visit your repo's landing page and select "manage topics."

Learn more


[8]ページ先頭

©2009-2025 Movatter.jp