• Home
  • Categories
  • Tags
  • Pricing
  • Submit
    Decorative pattern
    1. Home
    2. Llm Tools
    3. Promptfoo

    Promptfoo

    Open-source CLI and library for evaluating and red-teaming LLM applications with automated testing, security vulnerability scanning, and CI/CD integration. Recently acquired by OpenAI but remains open-source.

    🌐Visit Website

    About this tool

    Overview

    Promptfoo is a CLI and library for evaluating and red-teaming LLM applications. As of March 16, 2026, Promptfoo joined OpenAI while remaining open-source under the MIT license.

    Key Features

    • Comprehensive Testing: Test prompts, agents, and RAGs with automated vulnerability scanning
    • Provider Comparison: Compare outputs across 50+ LLM providers including GPT, Claude, Gemini, and Llama
    • Red Teaming: Systematic adversarial testing to detect content policy violations, information leakage, and API misuse
    • CI/CD Integration: Automatically evaluate prompts and test for security vulnerabilities before deployment
    • Developer Friendly: Fast with live reloads and caching
    • Language Agnostic: Define test cases without writing code

    Philosophy

    The goal: test-driven LLM development, not trial-and-error. Simple, declarative test cases enable automation without heavy notebooks or extensive coding.

    Assertions and Validation

    Use assertions to compare LLM output against expected values or conditions. Validate output through:

    • Equality checks
    • JSON structure validation
    • Similarity scoring
    • Custom functions

    Use Cases

    • Automated prompt regression testing
    • Security scanning for production deployments
    • Comparing LLM provider performance
    • RAG application evaluation
    • Agent behavior validation

    Pricing

    Free and open-source under MIT license.

    Surveys

    Loading more......

    Information

    Websitewww.promptfoo.dev
    PublishedMar 18, 2026

    Categories

    1 Item
    Llm Tools

    Tags

    3 Items
    #Testing#red-teaming#Evaluation

    Similar Products

    6 result(s)
    DeepEval

    Comprehensive LLM evaluation framework offering 50+ ready-to-use metrics for RAG, agents, and chatbots, featuring G-Eval for custom criteria and multi-turn conversation evaluation with human-like accuracy.

    Ragas

    RAG Assessment framework for Python providing reference-free evaluation of RAG pipelines using LLM-as-a-judge, measuring context relevancy, context recall, faithfulness, and answer relevancy with automatic test data generation.

    ARES

    Automatic RAG Evaluation System - a framework for assessing RAG system quality through automated evaluation of retrieval relevance and generation accuracy without human labels.

    RAGAS

    Retrieval Augmented Generation Assessment framework for reference-free evaluation of RAG pipelines. RAGAS provides automated metrics for retrieval quality, context relevance, and generation faithfulness.

    RAG Evaluation Frameworks

    Comprehensive overview of frameworks and tools for evaluating RAG systems including RAGAS, TruLens, LangSmith, and ARES with metrics for retrieval quality, generation accuracy, and end-to-end performance.

    TruLens

    Open-source evaluation and tracing library for AI agents and RAG systems, combining OpenTelemetry-based tracing with trustworthy evaluations including ground truth metrics and LLM-as-a-Judge feedback for production monitoring.

    Decorative pattern
    Built with
    Ever Works
    Ever Works

    Connect with us

    Stay Updated

    Get the latest updates and exclusive content delivered to your inbox.

    Product

    • Categories
    • Tags
    • Pricing
    • Help

    Clients

    • Sign In
    • Register
    • Forgot password?

    Company

    • About Us
    • Admin
    • Sitemap

    Resources

    • Blog
    • Submit
    • API Documentation
    All product names, logos, and brands are the property of their respective owners. All company, product, and service names used in this repository, related repositories, and associated websites are for identification purposes only. The use of these names, logos, and brands does not imply endorsement, affiliation, or sponsorship. This directory may include content generated by artificial intelligence.
    Copyright © 2025 Awesome Vector Databases. All rights reserved.·Terms of Service·Privacy Policy·Cookies