ai-testing

Open-source framework for stress-testing LLMs and conversational AI. Identify hallucinations, policy violations, and edge cases with scalable, realistic simulations. Join the discord:https://discord.gg/ssd4S37WNW

security ai simulation chatbot ai-agents ai-testing llm-testing chatbot-simulation

UpdatedSep 15, 2025
Python

KI-Testen /Uebungen

Star13

Übungsaufgaben zum Buch "Basiswissen KI-Testen"

artificial-intelligence exercises software-testing german-language hands-on ai-testing

UpdatedDec 20, 2024
Jupyter Notebook

langwatch /scenario-go

Star6

Agent testing library that uses an agent to test your agent, in Go.

testing ai agents qa-automation ai-qa ai-testing

UpdatedApr 21, 2025
Go

jhd3197 /Prompture

Sponsor

Star6

Prompture is an API-first library for requesting structured JSON output from LLMs (or any structure), validating it against a schema, and running comparative tests between models.

openai json-validation structured-output pydantic llm prompt-engineering ai-testing prompt-testing

UpdatedOct 12, 2025
Python

Bugsterapp /bugster-cli

Star5

A CLI for testing your UI. Easy

debugging automation nextjs ui-testing cli-tool e2e-testing playwright vercel ai-testing vibe-testing

UpdatedOct 13, 2025
PowerShell

langwatch /scenario-ts

Star4

Agent testing library that uses an agent to test your agent, in Typescript.

ai-testing agent-simulations

UpdatedJun 20, 2025

Sephrim-NightShade /Questions-you-want-answers-to

Star2

ai-testing automated-responses

UpdatedNov 16, 2023

taurus5650 /open_ai_with_pytest_simple_version

Star2

Integration of OpenAI with Pytest to automate API test generation.

artificial-intelligence pytest openai api-testing software-testing automated-testing open-ai automation-testing ai-testing llm-agents ai-test-case-generator

UpdatedJun 11, 2025
Python

monkscode /Natural-Language-to-Robot-Framework

Star2

Turn plain English into Robot Framework files with AI. No dependencies, no hassle — just validated, ready-to-run tests

python docker open-source natural-language-processing selenium test-automation quality-assurance robotframework automation-framework software-testing fastapi large-language-models generative-ai ai-testing agentic-framework llm-applications nlp-to-code

UpdatedOct 12, 2025
HTML

pavankumarinfo /ai-testing-healthcare

Star2

Public whitepaper on AI testing strategies in healthcare using prompt engineering and LLMs.

quality-assurance red-teaming shift-left healthcare-ai prompt-engineering ai-testing llmops

UpdatedAug 6, 2025

thisguymartin /burro

Star2

Burro is a command-line interface (CLI) tool built with Deno for evaluating Large Language Model (LLM) outputs. It provides a straightforward way to run different types of evaluations with secure API key management.

evaluation quality-assurance deno llm ai-testing

UpdatedJan 17, 2025
TypeScript

nfodor /mcp-chromium-arm64

Star2

🚀 ARM64 Browser Automation for Claude Code - SaaS testing on 80 Raspberry Pi budget. The first solution that works where Playwright/Puppeteer fail on ARM64. Autonomous testing without human debugging.

nodejs raspberry-pi mcp arm64 browser-automation startup-tools ai-testing claude-code saas-testing budget-ai

UpdatedAug 10, 2025
Python

Chatbot-TRACER /TRACER

Star2

An automated approach for exploring and testing conversational agents using large language models. TRACER discovers chatbot functionalities, generates user profiles, and creates comprehensive test suites for conversational AI systems.

test-automation software-testing automated-testing dialogue-systems conversational-ai chatbot-testing llm ai-testing

UpdatedOct 13, 2025
Python

Improve this page

Add a description, image, and links to theai-testing topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with theai-testing topic, visit your repo's landing page and select "manage topics."

Learn more

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ai-testing

Here are 48 public repositories matching this topic...

Giskard-AI /giskard-oss

langwatch /scenario

Pacific-AI-Corp /langtest

Addepto /contextcheck

tianshanghong /GPT4Go

kdunee /intentguard

TommyLemon /CVAuto

onerun-ai /onerun

KI-Testen /Uebungen

langwatch /scenario-go

jhd3197 /Prompture

Bugsterapp /bugster-cli

langwatch /scenario-ts

Sephrim-NightShade /Questions-you-want-answers-to

taurus5650 /open_ai_with_pytest_simple_version

monkscode /Natural-Language-to-Robot-Framework

pavankumarinfo /ai-testing-healthcare

thisguymartin /burro

nfodor /mcp-chromium-arm64

Chatbot-TRACER /TRACER

Improve this page

Add this topic to your repo