Architecture — Agentic BI Portal

Overview

The Agentic BI system is organized into five distinct layers. Each layer has a single responsibility and communicates only with its immediate neighbors. This separation ensures that security policies, business logic, and UI concerns never bleed into each other.

The Golden Rule

Treat the VS Code chat participant as just the front end/orchestrator, not the place where all business logic lives. VS Code chat participants own user interaction. Tool-like capabilities and MCP servers handle reusable, agent-invokable actions.

System Diagram

Layer 1 — Chat UI

The VS Code chat participant is purely a front-end orchestrator. It owns the user interaction — collecting natural-language prompts, displaying responses, and managing conversation context.

✓Respects the model the user selected in the chat UI (never hardcodes a model)
✓Uses the model passed through the request object per VS Code LM docs
✓Delegates all business logic to the orchestration layer
✓Supports conversation memory per session
✓Can be replaced by a web UI or Teams bot without changing backend

Layer 2 — Orchestration Service

A backend API service that sits between the chat UI and all data systems. This is the single enforcement point for security, governance, and business rules.

Intent Classification

Parse the user's question to understand what they're asking: KPI lookup, trend analysis, data exploration, report generation, etc.

Source Routing

Decide whether to query Power BI semantic model or Snowflake based on the nature of the question and available metadata.

Prompt Building

Construct source-specific prompts with relevant metadata subset, few-shot examples, and business glossary context.

Validation & Policy

Validate generated queries before execution. Enforce row limits, deny dangerous operations, apply role-based restrictions.

Layer 3 — Tool Layer (MCP)

Discrete, reusable tools exposed via the Model Context Protocol. Each tool does one thing well.

Tool	Action	Returns
`list-semantic-models`	Enumerate available Power BI datasets	Model IDs, names, workspace
`get-measures`	Retrieve measures from a semantic model	Measure definitions, descriptions
`generate-dax`	Produce validated DAX skeleton	DAX query string
`generate-sql`	Produce validated SQL query	SQL query string
`execute-preview`	Run query with row limit	Result rows (capped)
`save-export`	Export to file or create report link	Export URI / report URL

Layer 4 — Data Layer

Power BI

Access via the Execute Queries REST API against semantic models. Requires Azure AD / Entra app registration with proper scopes.

✓DAX query execution
✓Row-level security preservation
✓Business measure reuse

Snowflake

Access via the SQL API — submit, poll, cancel, fetch statements. Always through a policy layer with warehouse/role mapping.

✓Raw detail-level queries
✓Exploratory analysis
✓Schema whitelisting

Layer 5 — Presentation

Every response includes multiple output types, giving the user actionable results:

✓Plain-English interpretation — what the data means
✓Executed query — the actual SQL or DAX that ran
✓Result preview — first N rows in a formatted table
✓Recommended visual — chart type suggestion with config
✓Export artifact — CSV, embedded report link, or report config

End-to-End Data Flow

Two example scenarios showing how the pipeline works in practice:

Scenario 1: KPI Query via Power BIDAX

// User asks:
"Show top 20 clients by PMPM trend"

// Step 1 — Intent Classification
→ type: "kpi_trend", entities: ["clients", "PMPM"]

// Step 2 — Source Routing
→ source: "power_bi"  // PMPM is a defined measure

// Step 3 — Prompt + Validate
→ DAX query generated and validated against policy

// Step 4 — Execute + Present
→ Table of 20 clients with PMPM values, trend chart suggestion

Scenario 2: Exploration via SnowflakeSQL

// User asks:
"How many claims were denied in Q3 by denial reason?"

// Step 1 — Intent Classification
→ type: "exploration", entities: ["claims", "denied", "Q3"]

// Step 2 — Source Routing
→ source: "snowflake"  // detail-level, no PBI measure

// Step 3 — Prompt + Validate
→ SQL query generated, validated, schema whitelisted

// Step 4 — Execute + Present
→ Table of denial reasons with counts, bar chart suggestion