Any Scale, One Engine¶

Alopex DB provides a spectrum of deployment topologies—from a single embedded library to a globally distributed cluster. Start small and scale infinitely without changing your data model or application code.

Deployment Spectrum¶

graph LR
    W[WASM Viewer] -.->|Read-only| E[Embedded]
    E --> S[Single-Node]
    S --> R[Replicated]
    R --> D[Distributed]

    style W fill:#152C4A,color:#fff
    style E fill:#1E3A5F,color:#fff
    style S fill:#2E5077,color:#fff
    style R fill:#3D6A99,color:#fff
    style D fill:#5FB4C9,color:#fff

Topology	Scaling	HA	Write	Use Case
WASM Viewer	-	No	No	Browser data exploration
Embedded	Vertical	No	Yes	Edge, Mobile, CLI
Single-Node	Vertical	No	Yes	Dev, Small workloads
Replicated	Vertical	Yes	Yes	HA without sharding
Distributed	Horizontal	Yes	Yes	Large-scale production

Embedded Mode¶

The simplest deployment—Alopex runs as a library within your application process.

graph LR
    subgraph "Your Application Process"
        APP[Application Code]
        ALOPEX[Alopex Library]
        DATA[(Data Files)]
    end

    APP <--> ALOPEX
    ALOPEX <--> DATA

Characteristics¶

Zero network overhead: Direct function calls
Single file storage: All data in one directory
No separate process: Embedded in your application
ACID transactions: Full durability guarantees

Use Cases¶

:iphone: Mobile Applications: Local-first data storage
Local RAG Systems: Embeddings on-device
Edge Devices: IoT and edge computing
:terminal: CLI Tools: Self-contained utilities

Example¶

use alopex_embedded::{Database, Config};

let db = Database::open(Config::default().path("./data"))?;

// Use like SQLite, but with vectors!
db.execute("CREATE TABLE docs (id TEXT, vec VECTOR(384))")?;

Single-Node Mode¶

A standalone server that accepts connections from multiple clients.

graph LR
    subgraph "Clients"
        C1[Client 1]
        C2[Client 2]
        C3[Client 3]
    end

    subgraph "Server"
        SRV[Alopex Server]
        DATA[(Data Files)]
    end

    C1 <-->|HTTP/gRPC| SRV
    C2 <-->|HTTP/gRPC| SRV
    C3 <-->|HTTP/gRPC| SRV
    SRV <--> DATA

Characteristics¶

Network protocol: HTTP REST and gRPC
Multi-client: Concurrent connections
Postgres wire protocol: Compatible with existing tools*
Connection pooling: Efficient resource usage

Use Cases¶

Microservices: Backend data store
Development: Local development environment
Testing: Integration test databases
Small Production: Single-server workloads

Example¶

# Start the server
alopex-server --port 5432 --data ./data

# Connect with any Postgres client
psql -h localhost -p 5432 -d alopex

-- Works with standard SQL clients
CREATE TABLE products (
    id SERIAL PRIMARY KEY,
    name TEXT,
    embedding VECTOR(1536)
);

Replicated Mode¶

Primary-replica topology for high availability without horizontal sharding.

graph TB
    subgraph "Clients"
        C1[Client 1]
        C2[Client 2]
    end

    subgraph "Cluster"
        P[Primary]
        R1[Replica 1]
        R2[Replica 2]
    end

    C1 -->|Write| P
    C2 -->|Read| R1
    P -.->|Replicate| R1
    P -.->|Replicate| R2

Characteristics¶

Primary-replica: Single writer, multiple readers
Automatic failover: Replica promotion on primary failure
Read scaling: Distribute read load across replicas
Simpler operations: No sharding complexity

Use Cases¶

High Availability: Fault tolerance without sharding
Read-heavy workloads: Scale reads horizontally
Low-latency reads: Local replicas near users
Stepping stone: Before full distribution

Replication Modes¶

Mode	Durability	Latency
Sync	Strong (no data loss)	Higher
Semi-sync	At least 1 replica	Balanced
Async	Eventual	Lowest

Example¶

# Start primary
alopex-server --mode primary --port 5432 --data ./primary

# Start replicas
alopex-server --mode replica --primary localhost:5432 --port 5433
alopex-server --mode replica --primary localhost:5432 --port 5434

# Check replication status
alopex-cli replication status

-- Route reads to replicas
SET alopex.read_from = 'replica';
SELECT * FROM products WHERE id = 123;

-- Writes always go to primary
INSERT INTO products (name) VALUES ('New Product');

Distributed Mode¶

A horizontally scalable cluster with range sharding and Raft consensus.

graph TB
    subgraph "Clients"
        C1[Client 1]
        C2[Client 2]
    end

    subgraph "Cluster"
        subgraph "Range A"
            N1[Node 1 - Leader]
            N2[Node 2 - Follower]
        end
        subgraph "Range B"
            N3[Node 3 - Leader]
            N4[Node 4 - Follower]
        end
    end

    C1 <--> N1
    C2 <--> N3
    N1 <-.->|Raft| N2
    N3 <-.->|Raft| N4
    N1 <-.->|Chirps| N3

Characteristics¶

Horizontal scaling: Add nodes for more capacity
High availability: Automatic failover with Raft
Range sharding: Data distributed across nodes
Chirps mesh: QUIC-based cluster communication

Use Cases¶

High-Availability Production: Mission-critical workloads
Global Distribution: Multi-region deployments
Massive Scale: Petabyte-scale data

Architecture¶

Range Sharding¶

Data is partitioned into ranges and distributed:

Range A: [a-m) → Node 1 (Leader), Node 2 (Follower)
Range B: [m-z) → Node 3 (Leader), Node 4 (Follower)

Raft Consensus¶

Each range has a Raft group for replication:

sequenceDiagram
    participant C as Client
    participant L as Leader
    participant F1 as Follower 1
    participant F2 as Follower 2

    C->>L: Write Request
    L->>F1: Replicate
    L->>F2: Replicate
    F1-->>L: ACK
    F2-->>L: ACK
    L-->>C: Success (majority confirmed)

Example¶

# Start a 3-node cluster
alopex-cluster init --nodes 3 --data ./cluster

# Join nodes
alopex-cluster join --seed node1:7000

# Check cluster status
alopex-cluster status

Migration Between Topologies¶

One of Alopex's key features is seamless migration between deployment topologies.

Embedded → Single-Node¶

# Export embedded data
alopex-cli export --from ./embedded_data --to ./export.parquet

# Import into server
alopex-server import --from ./export.parquet

Single-Node → Replicated¶

# Add replicas to existing server
alopex-server --mode replica --primary localhost:5432 --port 5433

# Promote to HA mode
alopex-cli replication enable --replicas 2

Replicated → Distributed¶

# Initialize sharding on existing replicated cluster
alopex-cluster init --from primary:5432 --shards 4

# Enable range distribution
alopex-cli sharding enable

Zero Downtime Migration

For production migrations, use the --live flag to replicate data while serving traffic:

alopex-cluster migrate --from single-node:5432 --live

WASM Mode (Read-Only Viewer)¶

A browser-based read-only viewer for exploring pre-built database snapshots.

graph LR
    subgraph "Browser Environment"
        APP[JavaScript App]
        WASM[Alopex WASM Module]
        IDB[(IndexedDB Cache)]
    end

    subgraph "Server"
        SNAP[DB Snapshot]
    end

    APP <--> WASM
    WASM <--> IDB
    SNAP -->|Fetch| WASM

Characteristics¶

Read-only: SELECT queries only, no writes
Pre-built snapshots: Loads server-generated SSTable files
Offline capable: IndexedDB caching for offline access
Lightweight: < 1MB WASM binary (gzipped)

Use Cases¶

Data Exploration: Browse datasets in the browser
Documentation: Interactive examples
:iphone: Offline Viewing: Cached snapshots work offline
Vector Search Demo: Client-side similarity search

Example¶

import { AlopexViewer } from '@alopex-db/wasm';

// Load snapshot from URL
const viewer = await AlopexViewer.loadSnapshot(
  'https://example.com/data/snapshot.alopex'
);

// SQL SELECT only
const results = await viewer.query(
  'SELECT * FROM products WHERE category = ?',
  ['electronics']
);

// Vector search (Flat algorithm)
const similar = await viewer.vectorSearch({
  table: 'documents',
  queryVector: embedding,
  similarity: 'cosine',
  limit: 10
});

Limitations¶

Feature	Supported
SELECT queries
INSERT/UPDATE/DELETE
Transactions
Vector Search (Flat)
Vector Search (HNSW)
IndexedDB caching

Choosing the Right Topology¶

graph TD
    START[Start Here] --> Q1{Need network access?}
    Q1 -->|No| EMBEDDED[Embedded]
    Q1 -->|Yes| Q2{Need high availability?}
    Q2 -->|No| SINGLE[Single-Node]
    Q2 -->|Yes| Q3{Data fits on one node?}
    Q3 -->|Yes| REPLICATED[Replicated]
    Q3 -->|No| DISTRIBUTED[Distributed]

    EMBEDDED --> USE1[Mobile, Edge, CLI]
    SINGLE --> USE2[Dev, Small Prod]
    REPLICATED --> USE3[HA, Read-heavy]
    DISTRIBUTED --> USE4[Scale, Multi-region]

Decision Matrix¶

Requirement	WASM	Embedded	Single	Replicated	Distributed
Browser runtime
Write operations
No network
Multi-client
High availability
Read scaling
Write scaling
Data > 1 node

Next Steps¶

Chirps - Cluster messaging layer
Vector Search - Learn about vector operations
Architecture - Deep dive into internals