ai-testing
Here are 48 public repositories matching this topic...
Language:All
Sort:Most stars
🐢 Open-Source Evaluation & Testing library for LLM Agents
- Updated
Oct 10, 2025 - Python
Agentic testing for agentic codebases
- Updated
Oct 13, 2025 - TypeScript
Deliver safe & effective language models
- Updated
Sep 27, 2025 - Python
MIT-licensed Framework for LLMs, RAGs, Chatbots testing. Configurable via YAML and integrable into CI pipelines for automated testing.
- Updated
Dec 11, 2024 - Python
GPT4Go: AI-Powered Test Case Generation for Golang 🧪
- Updated
Apr 5, 2023 - Go
A Python library for verifying code properties using natural language assertions.
- Updated
Mar 1, 2025 - Python
👁 零代码零标注 CV AI 自动化测试工具 🚀 免除大量人工画框和打标签等,直接零代码快速自动化测试 CV 计算机视觉 AI 人工智能图像识别算法:行人检测、动植物分类、人脸识别、OCR 车牌识别、旋转校正、舞蹈姿态、抠图分割 等,还可一键 下载测试报告、导出训练和测试数据集
- Updated
Sep 23, 2025 - JavaScript
Open-source framework for stress-testing LLMs and conversational AI. Identify hallucinations, policy violations, and edge cases with scalable, realistic simulations. Join the discord:https://discord.gg/ssd4S37WNW
- Updated
Sep 15, 2025 - Python
Übungsaufgaben zum Buch "Basiswissen KI-Testen"
- Updated
Dec 20, 2024 - Jupyter Notebook
Agent testing library that uses an agent to test your agent, in Go.
- Updated
Apr 21, 2025 - Go
Prompture is an API-first library for requesting structured JSON output from LLMs (or any structure), validating it against a schema, and running comparative tests between models.
- Updated
Oct 12, 2025 - Python
A CLI for testing your UI. Easy
- Updated
Oct 13, 2025 - PowerShell
Agent testing library that uses an agent to test your agent, in Typescript.
- Updated
Jun 20, 2025
- Updated
Nov 16, 2023
Integration of OpenAI with Pytest to automate API test generation.
- Updated
Jun 11, 2025 - Python
Turn plain English into Robot Framework files with AI. No dependencies, no hassle — just validated, ready-to-run tests
- Updated
Oct 12, 2025 - HTML
Public whitepaper on AI testing strategies in healthcare using prompt engineering and LLMs.
- Updated
Aug 6, 2025
Burro is a command-line interface (CLI) tool built with Deno for evaluating Large Language Model (LLM) outputs. It provides a straightforward way to run different types of evaluations with secure API key management.
- Updated
Jan 17, 2025 - TypeScript
🚀 ARM64 Browser Automation for Claude Code - SaaS testing on 80 Raspberry Pi budget. The first solution that works where Playwright/Puppeteer fail on ARM64. Autonomous testing without human debugging.
- Updated
Aug 10, 2025 - Python
An automated approach for exploring and testing conversational agents using large language models. TRACER discovers chatbot functionalities, generates user profiles, and creates comprehensive test suites for conversational AI systems.
- Updated
Oct 13, 2025 - Python
Improve this page
Add a description, image, and links to theai-testing topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with theai-testing topic, visit your repo's landing page and select "manage topics."