• Home
  • Categories
  • Tags
  • Pricing
  • Submit
    Decorative pattern
    1. Home
    2. Llm Tools
    3. Arize Phoenix

    Arize Phoenix

    Open-source LLM tracing and evaluation solution built on OpenTelemetry for RAG evaluation. Provides automated instrumentation which records the execution path of LLM requests through multiple steps.

    🌐Visit Website

    About this tool

    Overview

    Arize Phoenix is an open-source observability and evaluation platform for LLM applications built on the OpenTelemetry standard. It provides comprehensive tracing and evaluation capabilities for RAG systems and AI agents.

    Features

    • OpenTelemetry-Based: Built on industry-standard observability framework
    • Automated Instrumentation: Automatic tracing without code changes
    • RAG Evaluation: Specialized metrics for RAG pipelines
    • Trace Visualization: Detailed execution flow diagrams
    • LLM Scorers: Multiple evaluation methods including LLM-as-judge
    • Real-Time Monitoring: Production-ready observability
    • Dataset Management: Track and version evaluation datasets
    • Integration: Works with LangChain, LlamaIndex, and other frameworks

    Evaluation Metrics

    • Context relevance
    • Answer correctness
    • Hallucination detection
    • Response quality
    • Custom metrics support

    Use Cases

    • Debugging LLM applications
    • RAG pipeline optimization
    • Production monitoring
    • Evaluation dataset curation
    • Performance benchmarking

    Integration

    MLflow's third-party scorer framework supports Phoenix alongside RAGAS and TruLens. The platform is part of an ecosystem with 32M+ monthly PyPI downloads.

    Pricing

    Open-source under Apache 2.0 license. Arize also offers a commercial observability platform.

    Surveys

    Loading more......

    Information

    Websitephoenix.arize.com
    PublishedMar 11, 2026

    Categories

    1 Item
    Llm Tools

    Tags

    3 Items
    #Observability#Evaluation#Opentelemetry

    Similar Products

    6 result(s)
    TruLens

    Open-source solution for evaluating and tracing AI Agents and RAG applications using feedback functions to programmatically evaluate components of execution flow. Features the RAG Triad metrics for comprehensive evaluation.

    Langtrace

    Open-source LLM observability tool built on OpenTelemetry standards. Automatically captures traces from LLM APIs, vector databases, and frameworks with support for over 30 popular providers.

    RAGAS
    Featured

    Research-backed RAG evaluation framework providing metrics for context precision, recall, faithfulness, and response relevancy to objectively measure LLM application performance.

    ARES

    RAG evaluation framework that trains lightweight judges for retrieval and generation scoring, refining evaluation by training specialized LLM judges on synthetic datasets to provide more reliable, confidence-aware judgments.

    DeepEval

    Simple open-source LLM evaluation framework similar to Pytest for unit testing LLM outputs. Provides 14+ targeted metrics for RAG and fine-tuning scenarios including hallucination, faithfulness, and contextual relevancy.

    Helicone

    Open-source observability layer designed to help developers monitor and understand how their applications interact with large language models. Acts as a lightweight proxy between applications and LLM providers.

    Decorative pattern
    Built with
    Ever Works
    Ever Works

    Connect with us

    Stay Updated

    Get the latest updates and exclusive content delivered to your inbox.

    Product

    • Categories
    • Tags
    • Pricing
    • Help

    Clients

    • Sign In
    • Register
    • Forgot password?

    Company

    • About Us
    • Admin
    • Sitemap

    Resources

    • Blog
    • Submit
    • API Documentation
    All product names, logos, and brands are the property of their respective owners. All company, product, and service names used in this repository, related repositories, and associated websites are for identification purposes only. The use of these names, logos, and brands does not imply endorsement, affiliation, or sponsorship. This directory may include content generated by artificial intelligence.
    Copyright © 2025 Awesome Vector Databases. All rights reserved.·Terms of Service·Privacy Policy·Cookies