Home Knowledge Base Phoenix (Arize AI)

Phoenix (Arize AI) is an open-source ML observability and LLM evaluation platform that combines embedding visualization, RAG retrieval analysis, and LLM tracing — enabling data scientists and ML engineers to diagnose why their AI systems are failing by visualizing high-dimensional data, analyzing retrieval quality, and tracing complex multi-step LLM pipelines in a unified interface.

What Is Phoenix?

Why Phoenix Matters

Core Phoenix Capabilities

Embedding Analysis:

LLM Tracing:

import phoenix as px
from phoenix.otel import register

tracer_provider = register(project_name="my-rag-app")
# Now LangChain, LlamaIndex calls are automatically traced

Evaluation Framework:

from phoenix.evals import OpenAIModel, HallucinationEvaluator

model = OpenAIModel(model="gpt-4o")
evaluator = HallucinationEvaluator(model)

results = evaluator.evaluate(
    output=response_text,
    reference=retrieved_context
)
# Returns: {"label": "hallucinated"/"grounded", "score": 0.92, "explanation": "..."}

RAG Retrieval Debugging Workflow

1. Ingest embeddings: Send query and document embeddings to Phoenix during evaluation runs. 2. Identify failing queries: Filter by low quality scores or user complaints. 3. Visualize in UMAP: Select the failing queries — if they cluster far from the relevant documents, the retriever is failing semantically. 4. Diagnose root cause: Too-large chunks? Wrong embedding model? Missing content in the knowledge base? 5. Validate fix: Re-run after the fix — embedding clusters should converge.

Phoenix vs Alternatives

FeaturePhoenixLangfuseWeights & BiasesArize (Commercial)
Embedding visualizationExcellentNoGoodExcellent
RAG debuggingExcellentGoodLimitedExcellent
LLM tracingGoodExcellentGoodExcellent
Open sourceYesYesNoNo
Local runYesYesNoNo
Eval frameworkStrongStrongLimitedStrong

Getting Started

pip install arize-phoenix
phoenix serve  # Launches UI at http://localhost:6006
import phoenix as px
px.launch_app()  # Or connect to running server

# Import your traces and embeddings for analysis
ds = px.Dataset.from_dataframe(df, schema=px.Schema(
    prediction_id_column_name="id",
    prompt_column_names=px.EmbeddingColumnNames(
        vector_column_name="query_embedding",
        raw_data_column_name="query_text"
    )
))

Phoenix is the ML observability tool that makes invisible embedding-level problems visible — by projecting high-dimensional retrieval and semantic data into inspectable visualizations, Phoenix enables AI teams to diagnose RAG failures, embedding drift, and retrieval quality issues that would otherwise require days of manual analysis to understand.

phoenixarizeobservability

Explore 500+ Semiconductor & AI Topics

From EUV lithography to CUDA optimization — search the full knowledge base or chat with our AI assistant.