Arize Phoenix

Open-source LLM tracing and evaluation solution built on OpenTelemetry for RAG evaluation. Provides automated instrumentation which records the execution path of LLM requests through multiple steps.

Visit Website

Overview

Arize Phoenix is an open-source observability and evaluation platform for LLM applications built on the OpenTelemetry standard. It provides comprehensive tracing and evaluation capabilities for RAG systems and AI agents.

Features

OpenTelemetry-Based: Built on industry-standard observability framework
Automated Instrumentation: Automatic tracing without code changes
RAG Evaluation: Specialized metrics for RAG pipelines
Trace Visualization: Detailed execution flow diagrams
LLM Scorers: Multiple evaluation methods including LLM-as-judge
Real-Time Monitoring: Production-ready observability
Dataset Management: Track and version evaluation datasets
Integration: Works with LangChain, LlamaIndex, and other frameworks

Evaluation Metrics

Context relevance
Answer correctness
Hallucination detection
Response quality
Custom metrics support

Use Cases

Debugging LLM applications
RAG pipeline optimization
Production monitoring
Evaluation dataset curation
Performance benchmarking

Integration

MLflow's third-party scorer framework supports Phoenix alongside RAGAS and TruLens. The platform is part of an ecosystem with 32M+ monthly PyPI downloads.

Pricing

Open-source under Apache 2.0 license. Arize also offers a commercial observability platform.

Surveys

Loading more......

Information

Websitephoenix.arize.com

PublishedMar 11, 2026

Tags

3 Items

#observability #evaluation #opentelemetry

Similar Products

TruLens

Open-source evaluation and tracing library for AI agents and RAG systems, combining OpenTelemetry-based tracing with trustworthy evaluations including ground truth metrics and LLM-as-a-Judge feedback for production monitoring.

000

Galileo

An AI observability and evaluation platform that helps monitor and evaluate LLM outputs, RAG pipelines, and data quality, with tools for detecting hallucinations and measuring retrieval quality.

000

TruLens

An evaluation framework for LLM applications including RAG systems, providing observability, debugging, and guardrails. TruLens tracks retrieval quality, LLM performance, and hallucinations with detailed tracing.

000

Langtrace

Open-source LLM observability tool built on OpenTelemetry standards. Automatically captures traces from LLM APIs, vector databases, and frameworks with support for over 30 popular providers.

000

OpenLLMetry

Open-source observability for GenAI and LLM applications based on OpenTelemetry, providing AI-aware instrumentation for vector databases, LLM frameworks, and model providers.

000

Promptfoo

Open-source CLI and library for evaluating and red-teaming LLM applications with automated testing, security vulnerability scanning, and CI/CD integration. Recently acquired by OpenAI but remains open-source.

000

Overview

Features

OpenTelemetry-Based: Built on industry-standard observability framework
Automated Instrumentation: Automatic tracing without code changes
RAG Evaluation: Specialized metrics for RAG pipelines
Trace Visualization: Detailed execution flow diagrams
LLM Scorers: Multiple evaluation methods including LLM-as-judge
Real-Time Monitoring: Production-ready observability
Dataset Management: Track and version evaluation datasets
Integration: Works with LangChain, LlamaIndex, and other frameworks

Evaluation Metrics

Context relevance
Answer correctness
Hallucination detection
Response quality
Custom metrics support

Use Cases

Debugging LLM applications
RAG pipeline optimization
Production monitoring
Evaluation dataset curation
Performance benchmarking

Integration

MLflow's third-party scorer framework supports Phoenix alongside RAGAS and TruLens. The platform is part of an ecosystem with 32M+ monthly PyPI downloads.

Pricing

Open-source under Apache 2.0 license. Arize also offers a commercial observability platform.

Arize Phoenix

Overview

Features

Evaluation Metrics

Use Cases

Integration

Pricing

Information

Categories

Tags

Similar Products

Connect with us

Stay Updated

Product

Clients

Company

Resources

Arize Phoenix

Overview

Features

Evaluation Metrics

Use Cases

Integration

Pricing

Information

Categories

Tags

Similar Products