Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings
#

ai-testing

Here are 48 public repositories matching this topic...

giskard-oss

Agentic testing for agentic codebases

  • UpdatedOct 13, 2025
  • TypeScript

A Python library for verifying code properties using natural language assertions.

  • UpdatedMar 1, 2025
  • Python

👁 零代码零标注 CV AI 自动化测试工具 🚀 免除大量人工画框和打标签等,直接零代码快速自动化测试 CV 计算机视觉 AI 人工智能图像识别算法:行人检测、动植物分类、人脸识别、OCR 车牌识别、旋转校正、舞蹈姿态、抠图分割 等,还可一键 下载测试报告、导出训练和测试数据集

  • UpdatedSep 23, 2025
  • JavaScript

Open-source framework for stress-testing LLMs and conversational AI. Identify hallucinations, policy violations, and edge cases with scalable, realistic simulations. Join the discord:https://discord.gg/ssd4S37WNW

  • UpdatedSep 15, 2025
  • Python

Übungsaufgaben zum Buch "Basiswissen KI-Testen"

  • UpdatedDec 20, 2024
  • Jupyter Notebook

Agent testing library that uses an agent to test your agent, in Go.

  • UpdatedApr 21, 2025
  • Go
Prompture

Prompture is an API-first library for requesting structured JSON output from LLMs (or any structure), validating it against a schema, and running comparative tests between models.

  • UpdatedOct 12, 2025
  • Python

Agent testing library that uses an agent to test your agent, in Typescript.

  • UpdatedJun 20, 2025
  • UpdatedNov 16, 2023

Public whitepaper on AI testing strategies in healthcare using prompt engineering and LLMs.

  • UpdatedAug 6, 2025

Burro is a command-line interface (CLI) tool built with Deno for evaluating Large Language Model (LLM) outputs. It provides a straightforward way to run different types of evaluations with secure API key management.

  • UpdatedJan 17, 2025
  • TypeScript

🚀 ARM64 Browser Automation for Claude Code - SaaS testing on 80 Raspberry Pi budget. The first solution that works where Playwright/Puppeteer fail on ARM64. Autonomous testing without human debugging.

  • UpdatedAug 10, 2025
  • Python

An automated approach for exploring and testing conversational agents using large language models. TRACER discovers chatbot functionalities, generates user profiles, and creates comprehensive test suites for conversational AI systems.

  • UpdatedOct 13, 2025
  • Python

Improve this page

Add a description, image, and links to theai-testing topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with theai-testing topic, visit your repo's landing page and select "manage topics."

Learn more


[8]ページ先頭

©2009-2025 Movatter.jp