Uh oh!
There was an error while loading.Please reload this page.
- Notifications
You must be signed in to change notification settings - Fork765
Test your prompts, agents, and RAGs. AI Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command line and CI/CD integration.
License
promptfoo/promptfoo
Folders and files
| Name | Name | Last commit message | Last commit date | |
|---|---|---|---|---|
Repository files navigation
promptfoo is a developer-friendly local tool for testing LLM applications. Stop the trial-and-error approach - start shipping secure, reliable AI apps.
Website ·Getting Started ·Red Teaming ·Documentation ·Discord
# Install and initialize projectnpx promptfoo@latest init# Run your first evaluationnpx promptfooeval
SeeGetting Started (evals) orRed Teaming (vulnerability scanning) for more.
- Test your prompts and models withautomated evaluations
- Secure your LLM apps withred teaming and vulnerability scanning
- Compare models side-by-side (OpenAI, Anthropic, Azure, Bedrock, Ollama, andmore)
- Automate checks inCI/CD
- Share results with your team
Here's what it looks like in action:
It works on the command line too:
It also can generatesecurity vulnerability reports:
- 🚀Developer-first: Fast, with features like live reload and caching
- 🔒Private: LLM evals run 100% locally - your prompts never leave your machine
- 🔧Flexible: Works with any LLM API or programming language
- 💪Battle-tested: Powers LLM apps serving 10M+ users in production
- 📊Data-driven: Make decisions based on metrics, not gut feel
- 🤝Open source: MIT licensed, with an active community
We welcome contributions! Check out ourcontributing guide to get started.
Join ourDiscord community for help and discussion.
About
Test your prompts, agents, and RAGs. AI Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command line and CI/CD integration.
Topics
Resources
License
Contributing
Security policy
Uh oh!
There was an error while loading.Please reload this page.
Stars
Watchers
Forks
Sponsor this project
Uh oh!
There was an error while loading.Please reload this page.
Packages0
Uh oh!
There was an error while loading.Please reload this page.
Uh oh!
There was an error while loading.Please reload this page.



