What makes Vanna AI different from other Text-to-SQL tools?

MIT licensed, works with local LLMs via Ollama (no cloud API required), connects to any database, ships with production-ready web UI. Privacy + accuracy solved.

What problem does Vanna solve?

Data teams bottlenecked by SQL knowledge gap. Business stakeholders file tickets, analysts queue requests, reports take days. Ask in plain English, get SQL that runs.

How does Vanna's training/indexing work?

Train on your schema and example queries. System learns your database structure, column names, common patterns. Accuracy improves with usage. Works offline with local models.

Vanna AI: The Open-Source Text-to-SQL That Actually Works (Locally)

By Prahlad Menon Published 2026-03-04 8 min read

Updated: March 2026 — Added detailed explanation of how Vanna’s training/indexing works and comparison to Databricks Genie.

After years of half-baked demos and cloud-only solutions, we finally have a Text-to-SQL framework that delivers on all fronts: Vanna AI 2.0. It’s open-source (MIT license), works with local LLMs via Ollama, connects to virtually any database, and includes a production-ready web interface out of the box.

Why This Matters

Every data team has the same bottleneck: business stakeholders need answers, but they can’t write SQL. They file tickets. Analysts queue requests. Reports take days. Text-to-SQL promises to change this — ask a question in plain English, get SQL that runs against your database.

The problem? Most solutions either require sending your data to cloud APIs (privacy nightmare) or have such poor accuracy they’re useless in production. Vanna 2.0 solves both.

What Makes Vanna Different

1. True Local LLM Support

Vanna works with any LLM — including local models via Ollama. This means your data never leaves your infrastructure:

from vanna.integrations.ollama import OllamaLlmService

llm = OllamaLlmService(model="llama3.2")  # or codellama, mistral, etc.

Want better SQL-specific performance? Pair it with SQLCoder, Defog’s open-source model that outperforms GPT-4 on text-to-SQL benchmarks. SQLCoder-7b-2 achieves 91.4% accuracy on complex ratio queries where GPT-4 scores 80%.

2. Works With Any Database

Postgres, MySQL, SQLite, Snowflake, BigQuery, Oracle, DuckDB, ClickHouse — Vanna connects to them all. The framework abstracts database connections so you can swap backends without changing your application code:

from vanna.integrations.postgres import PostgresRunner
from vanna.integrations.sqlite import SqliteRunner

# Switch between databases with one line
sql_runner = PostgresRunner(connection_string="postgresql://...")
# or
sql_runner = SqliteRunner("./local.db")

3. Production-Ready Web UI

Unlike research projects that dump you into notebooks, Vanna 2.0 ships with a pre-built <vanna-chat> web component. Drop it into any page:

<script src="https://img.vanna.ai/vanna-components.js"></script>
<vanna-chat 
  sse-endpoint="https://your-api.com/chat"
  theme="dark">
</vanna-chat>

The UI streams responses in real-time — not just text, but interactive data tables and Plotly charts. It works on mobile, supports dark/light themes, and integrates with React, Vue, or plain HTML.

4. Enterprise Security Built-In

This is where Vanna 2.0 really shines. User identity flows through the entire system:

Row-level security: Queries automatically filtered by user permissions
Audit logs: Every query tracked per user for compliance
Rate limiting: Per-user quotas via lifecycle hooks
Your auth system: Bring your own cookies, JWTs, or OAuth tokens

class MyUserResolver(UserResolver):
    async def resolve_user(self, request_context: RequestContext) -> User:
        token = request_context.get_header('Authorization')
        user_data = self.decode_jwt(token)
        return User(
            id=user_data['id'],
            group_memberships=user_data['groups']  # Controls permissions
        )

Getting Started in 5 Minutes

Here’s a complete local setup with Ollama and SQLite:

# Install Vanna
pip install vanna

# Install Ollama (if not already)
brew install ollama
ollama pull llama3.2

from fastapi import FastAPI
from vanna import Agent
from vanna.servers.fastapi.routes import register_chat_routes
from vanna.servers.base import ChatHandler
from vanna.integrations.ollama import OllamaLlmService
from vanna.tools import RunSqlTool
from vanna.integrations.sqlite import SqliteRunner
from vanna.core.registry import ToolRegistry

app = FastAPI()

# Configure local LLM
llm = OllamaLlmService(model="llama3.2")

# Set up database connection
tools = ToolRegistry()
tools.register(RunSqlTool(sql_runner=SqliteRunner("./chinook.db")))

# Create agent
agent = Agent(llm_service=llm, tool_registry=tools)

# Add routes
chat_handler = ChatHandler(agent)
register_chat_routes(app, chat_handler)

Run with uvicorn main:app --reload, open localhost:8000, and start asking questions.

How Vanna Actually Learns Your Database

This is the critical part most tutorials gloss over: Vanna doesn’t index your actual data — it indexes metadata about your database.

When you “train” Vanna, you’re teaching it three things:

1. Schema Structure (DDL)

Table names, column names, data types, and relationships:

vn.train(ddl="""
    CREATE TABLE customers (
        id INT PRIMARY KEY,
        name VARCHAR(100),
        region VARCHAR(50),
        created_at TIMESTAMP
    )
""")

vn.train(ddl="""
    CREATE TABLE orders (
        id INT PRIMARY KEY,
        customer_id INT REFERENCES customers(id),
        total DECIMAL(10,2),
        status VARCHAR(20)
    )
""")

2. Business Context (Documentation)

Domain-specific terminology that isn’t obvious from column names:

vn.train(documentation="Region codes: NA=North America, EU=Europe, APAC=Asia Pacific")
vn.train(documentation="OTIF = On Time In Full, the percentage of orders delivered on schedule")
vn.train(documentation="A 'churned' customer has no orders in the last 90 days")

3. Example Question→SQL Pairs (Few-Shot Learning)

Real examples of how questions map to queries in your specific context:

vn.train(
    question="How many customers do we have in Europe?",
    sql="SELECT COUNT(*) FROM customers WHERE region = 'EU'"
)

vn.train(
    question="What's our OTIF rate this month?",
    sql="""SELECT 
        COUNT(CASE WHEN delivered_on_time AND delivered_complete THEN 1 END) * 100.0 / COUNT(*) 
        FROM orders WHERE order_date >= DATE_TRUNC('month', CURRENT_DATE)"""
)

The RAG Flow

Here’s what happens under the hood:

Training data gets embedded → stored in a vector database (ChromaDB, Qdrant, Milvus, etc.)
User asks a question → Vanna retrieves the 10 most relevant training items (DDL, docs, examples)
Context + question go to the LLM → generates SQL using retrieved context
Your actual row data never leaves your database — only the schema metadata

Auto-Training from INFORMATION_SCHEMA

Don’t want to manually write DDL? Vanna can extract it automatically:

# Connect to your database first
vn.connect_to_postgres(host='localhost', dbname='mydb', user='user', password='pass')

# Pull schema from INFORMATION_SCHEMA
df = vn.run_sql("SELECT * FROM INFORMATION_SCHEMA.COLUMNS")

# Generate a training plan
plan = vn.get_training_plan_generic(df)
print(plan)  # Review what it found

# Execute the training
vn.train(plan=plan)

This extracts table structures automatically, but you’ll still want to add business documentation and example queries for best results.

How This Compares to Databricks Genie

If you’re on Databricks, you might wonder: why not just use AI/BI Genie? It’s a fair question — Genie takes a fundamentally different approach.

Databricks Genie: Metadata at the Source

Genie leverages Unity Catalog, where metadata lives alongside your data:

Column descriptions are defined at table creation time
Table comments explain business context
PK/FK relationships are declared in the catalog
Synonyms and business terms can be added in a “knowledge store”

When you point Genie at a Unity Catalog table, it already knows what the columns mean — because that metadata was captured when the table was created. There’s no separate training step.

-- In Databricks, metadata is part of table definition
CREATE TABLE sales.customers (
    customer_id INT COMMENT 'Unique customer identifier',
    region STRING COMMENT 'Geographic region: NA, EU, APAC',
    ltv DECIMAL(10,2) COMMENT 'Lifetime value in USD'
) COMMENT 'Master customer table, updated nightly from CRM';

Vanna: Metadata as a Separate Layer

Vanna works with any database, but that means it can’t assume metadata exists. You build the semantic layer yourself:

Aspect	Databricks Genie	Vanna
Metadata source	Unity Catalog (built-in)	Manual training or INFORMATION_SCHEMA extraction
Setup effort	Low if catalog is well-documented	Medium — requires explicit training
Database support	Databricks only	Any SQL database
Business terms	Knowledge store (UI-based)	Documentation strings in code
Example queries	SQL examples in Genie space	`vn.train(question=..., sql=...)`
Self-improving	Learns from user feedback	Tool Memory learns from successful queries
Cost	Included in Databricks	Free (MIT license)

When to Use Which

Choose Genie if:

You’re already on Databricks
Your Unity Catalog has good column/table descriptions
You want zero-code setup for business users

Choose Vanna if:

You need to support multiple databases (Postgres, MySQL, Snowflake, etc.)
You want full control over the semantic layer
You need to run everything locally (privacy/compliance)
You’re building a custom application, not just ad-hoc analysis

The Honest Trade-Off: Setup Cost

Let’s be direct about this: Vanna requires real work that Genie doesn’t.

With Databricks Genie, if your Unity Catalog tables have good column descriptions and table comments, you’re essentially done. Point Genie at the table, and it understands what “region” means, what “LTV” stands for, and how tables relate to each other. The metadata was captured when the data was modeled.

With Vanna, you’re recreating that semantic layer from scratch:

Task	Genie	Vanna
Schema structure	Already in Unity Catalog	Extract from INFORMATION_SCHEMA or write DDL manually
Column descriptions	Already in Unity Catalog	Write documentation strings yourself
Business terminology	Add to knowledge store (UI)	Write documentation strings yourself
Example queries	Add SQL examples (UI)	Write `vn.train(question=..., sql=...)` calls
Table relationships	PK/FK in Unity Catalog	Describe in DDL or documentation

For a data warehouse with 50+ tables, this isn’t trivial. You’re looking at:

Hours of writing documentation strings
Curating dozens of example question→SQL pairs
Maintaining this as your schema evolves

This is the real cost of database flexibility. Vanna works with any database precisely because it doesn’t assume a rich metadata layer exists. You build that layer yourself.

When the Trade-Off Makes Sense

Despite the setup cost, Vanna wins in specific scenarios:

You’re not on Databricks — Postgres, MySQL, Snowflake, BigQuery, etc. don’t have Unity Catalog. Vanna is your only option for a Genie-like experience.
Compliance/privacy requirements — You need everything local. Vanna + Ollama means no data or queries leave your infrastructure.
Building a product — You’re embedding Text-to-SQL into a customer-facing application. Vanna’s MIT license and API-first design make this possible.
You already have the metadata — If you’ve built internal documentation, a data dictionary, or similar tools, you can script the training. The work isn’t wasted.

The Takeaway

Genie’s advantage is metadata at the source — if you’ve invested in documenting your Unity Catalog, you get Text-to-SQL almost for free. Vanna’s advantage is flexibility — it works anywhere, with any LLM, and you own the entire stack. But that flexibility comes with a real setup cost.

For teams not on Databricks, Vanna is the closest you’ll get to a Genie-like experience with open-source tools — just know that you’re signing up to build the semantic layer yourself.

Benchmarks: How Does It Compare?

On the BIRD benchmark, which tests real-world SQL generation:

System	Execution Accuracy
GPT-4o (API)	~72%
Contextual-SQL (local, Qwen-32B)	~73%
Vanna + SQLCoder-7b (local)	~65-70%*
Vanna + GPT-4 (API)	~70-75%*

*Varies by training data and schema complexity

The key insight: local models with good context (RAG training) approach cloud API performance. Contextual AI’s research shows that inference-time scaling — generating multiple candidate queries and selecting the best one — helps local models compete with larger API models.

Alternative: DataLine for Quick Analysis

If you want a simpler, GUI-first approach, check out DataLine. It’s a standalone app (Docker or binary) that provides chat-based data analysis with automatic chart generation. Privacy-focused by default — it can hide your actual data from LLMs while still generating accurate SQL.

docker run -p 7377:7377 -v dataline:/home/.dataline ramiawar/dataline:latest

DataLine is better for ad-hoc exploration; Vanna is better for building production applications.

The Bottom Line

Text-to-SQL has crossed the threshold from “interesting demo” to “actually useful.” Vanna 2.0 gives you:

Privacy: Run everything locally with Ollama
Flexibility: Any LLM, any database
Production-ready: Real UI, real security, real scalability
MIT license: Use it however you want

For teams drowning in SQL tickets, this is the escape hatch. Your analysts become force multipliers instead of query machines, and business users get self-service access to data insights.

The code is on GitHub: github.com/vanna-ai/vanna

References: