AIt Architecture

This document describes the technical architecture of AIt, a platform for AI-powered interactions with your personal data ecosystem.

Overview
System Architecture
Data Flow
Package Structure
Infrastructure
Security Model
Extensibility

Overview

AIt is designed as a modular monorepo that enables users to connect their data sources (GitHub, Spotify, Linear, X, etc.) and interact with that data using AI capabilities powered by local LLMs.

Design Principles

Local-First AI: LLM processing runs locally via Ollama, keeping data private
Modular Architecture: Each component is independently deployable and testable
Type Safety: Full TypeScript with OpenAPI-generated types
Semantic Search: Vector embeddings enable intelligent data retrieval
Extensible Connectors: Plugin architecture for adding new data sources

System Architecture

┌────────────────────────────────────────────────────────────────────────────────────┐
│                                    CLIENT LAYER                                    │
│  ┌──────────────────────────────────────────────────────────────────────────────┐  │
│  │                           UIt (Web Interface)                                │  │
│  │                    React + Vite • http://localhost:5173                      │  │
│  └──────────────────────────────────────────────────────────────────────────────┘  │
└────────────────────────────────────────┬───────────────────────────────────────────┘
                                         │ HTTP/REST
                                         ▼
┌────────────────────────────────────────────────────────────────────────────────────┐
│                                    API LAYER                                       │
│  ┌──────────────────────────────────────────────────────────────────────────────┐  │
│  │                          Gateway (API Server)                                │  │
│  │              Fastify • OAuth 2.0 • https://localhost:3000                    │  │
│  │                                                                              │  │
│  │   /api/github/*  /api/spotify/*  /api/linear/*  /api/x/*  /api/google/*  /api/notion/*  /api/slack/*  /api/chat/*     │  │
│  └──────────────────────────────────────────────────────────────────────────────┘  │
└─────────┬─────────────────┬──────────────────┬─────────────────┬───────────────────┘
          │                 │                  │                 │
          ▼                 ▼                  ▼                 ▼
┌─────────────────┐ ┌───────────────┐ ┌───────────────┐ ┌───────────────────────────┐
│   CONNECTORS    │ │   AI SDK      │ │   SCHEDULER   │ │        RETOVE (ETL)       │
│                 │ │               │ │               │ │                           │
│ • GitHub        │ │ • Generation  │ │ • BullMQ      │ │ • Extract from PostgreSQL │
│ • Spotify       │ │ • Embeddings  │ │ • Cron Jobs   │ │ • Transform to Embeddings │
│ • Linear        │ │ • RAG         │ │ • Priorities  │ │ • Load into Qdrant        │
│ • X (Twitter)   │ │ • Tools       │ │ • Retries     │ │                           │
│ • Notion        │ │ • MCP         │ │               │ │                           │
│ • Slack         │ │               │ │               │ │                           │
│ • Google        │ │               │ │               │ │                           │
└────────┬────────┘ └───────┬───────┘ └───────┬───────┘ └─────────────┬─────────────┘
         │                  │                 │                       │
         └──────────────────┼─────────────────┼───────────────────────┘
                            │                 │
                            ▼                 ▼
┌────────────────────────────────────────────────────────────────────────────────────┐
│                              INFRASTRUCTURE LAYER                                  │
│                                                                                    │
│  ┌───────────────┐  ┌───────────────┐  ┌───────────────┐  ┌───────────────┐       │
│  │  PostgreSQL   │  │    Qdrant     │  │    Ollama     │  │     Redis     │       │
│  │               │  │               │  │               │  │               │       │
│  │ • OAuth Tokens│  │ • Embeddings  │  │ • gemma3      │  │ • Job Queue   │       │
│  │ • User Data   │  │ • Collections │  │ • mxbai-embed │  │ • BullMQ      │       │
│  │ • Sync State  │  │ • Similarity  │  │ • Tool Calls  │  │ • Caching     │       │
│  │               │  │   Search      │  │               │  │               │       │
│  │ :5432         │  │ :6333         │  │ :11434        │  │ :6379         │       │
│  └───────────────┘  └───────────────┘  └───────────────┘  └───────────────┘       │
│                                                                                    │
│  ┌──────────────────────────────────────────────────────────────────────────────┐  │
│  │                          MinIO / S3 (Object Storage)                          │  │
│  │              Binary assets (e.g. Google Photos) • :9090                        │  │
│  └──────────────────────────────────────────────────────────────────────────────┘  │
│                                                                                    │
│  ┌──────────────────────────────────────────────────────────────────────────────┐  │
│  │                        Langfuse (Observability)                              │  │
│  │                    Trace LLM calls • :3333                                   │  │
│  └──────────────────────────────────────────────────────────────────────────────┘  │
└────────────────────────────────────────────────────────────────────────────────────┘

Data Flow

1. Authentication Flow

User → UIt → Gateway → OAuth Provider
                ↓
         Callback with tokens
                ↓
         Store in PostgreSQL
                ↓
         Redirect to UIt (success)

Process:

User clicks "Connect GitHub" in UIt
Gateway redirects to provider's OAuth consent screen
Provider redirects back to /api/{provider}/auth/callback
Gateway stores access/refresh tokens in PostgreSQL
User can now access their data

2. Data Synchronization Flow

Scheduler (cron) → Queue Job → Worker → Connector
                                            ↓
                                    Fetch from API
                                            ↓
                                    Store in PostgreSQL
                                            ↓
                                    Trigger ETL
                                            ↓
                              Generate Embeddings (Ollama)
                                            ↓
                                    Store in Qdrant

Process:

Scheduler triggers ETL job based on priority and cron schedule
Worker picks up job from Redis queue
Connector fetches data from external API
Raw data stored in PostgreSQL for persistence
RetoVe ETL extracts data, generates embeddings via Ollama
Embeddings stored in Qdrant collections

3. Query Flow (RAG)

User Query → Gateway → AI SDK
                          ↓
                  Query Analysis
                          ↓
                  Collection Routing
                          ↓
                  Vector Search (Qdrant)
                          ↓
                  Context Assembly
                          ↓
                  LLM Generation (Ollama)
                          ↓
                  Stream Response → UIt

Process:

User sends query through UIt chat interface
AI SDK analyzes query intent
Router determines relevant collections (Spotify, GitHub, etc.)
Vector search retrieves semantically similar documents
Context assembled with relevant data chunks
Ollama generates response with RAG context
Response streamed back to user

Package Structure

Core (`@ait/core`)

Shared utilities used across all packages:

packages/core/
├── src/
│   ├── errors/          # Custom error classes (AItError, RateLimitError)
│   ├── http/            # HTTP client with retry logic
│   ├── logging/         # Structured logger
│   ├── types/           # Shared type definitions
│   └── validation/      # Zod schemas and validators

Connectors (`@ait/connectors`)

Platform integration framework:

packages/connectors/
├── src/
│   ├── domain/
│   │   ├── entities/    # Domain models (Repository, Track, Issue)
│   │   └── mappers/     # API response → Domain entity mappers
│   ├── infrastructure/
│   │   └── vendors/     # API clients per platform
│   ├── services/
│   │   ├── vendors/     # High-level service per platform
│   │   └── shared/      # Sync state, pagination
│   └── shared/
│       └── auth/        # OAuth handling

Adding a New Connector:

Create API client in infrastructure/vendors/
Define entities in domain/entities/
Implement mapper in domain/mappers/
Create service in services/vendors/
Register in ConnectorServiceFactory

AI SDK (`@ait/ai-sdk`)

AI capabilities with composable RAG functions:

packages/infrastructure/ai-sdk/
├── src/
│   ├── client/          # Main AIt client initialization
│   ├── config/          # Models, collections
│   ├── generation/      # stream, generate, generateObject wrappers
│   ├── rag/             # retrieve, rerank composable functions
│   ├── interfaces/      # ICacheProvider, IAnalyticsProvider
│   ├── providers/       # Provider registration
│   ├── mcp-registry/    # MCP tool registry
│   ├── services/
│   │   ├── embeddings/  # Embedding generation
│   │   ├── generation/  # Suggestions
│   │   ├── text-generation/  # LLM interaction
│   │   └── tokenizer/   # Text tokenization
│   ├── telemetry/       # Langfuse integration
│   ├── tools/           # Function calling tools
│   └── types/           # Type definitions

Gateway (`@ait/gateway`)

API server, routing, and application services:

packages/gateway/
├── src/
│   ├── config/          # Server configuration
│   ├── routes/          # API route handlers
│   │   ├── auth/        # OAuth routes per provider
│   │   ├── chat/        # Chat/RAG endpoints
│   │   └── data/        # Data retrieval endpoints
│   └── services/
│       ├── analytics/   # Performance metrics, cost tracking, failure analysis
│       ├── cache/       # Semantic cache, Redis provider
│       └── insights/    # Activity aggregation, anomaly detection

Scheduler (`@ait/scheduler`)

Job scheduling with BullMQ:

packages/infrastructure/scheduler/
├── src/
│   ├── scheduler.service.ts      # Core scheduler
│   ├── scheduler.entrypoint.ts   # Job definitions
│   └── task-manager/             # ETL task registry

RetoVe (`@ait/retove`)

ETL pipeline for embeddings:

packages/transformers/retove/
├── src/
│   ├── services/
│   │   ├── etl/         # ETL orchestration
│   │   ├── embeddings/  # Embedding generation
│   │   └── vendors/     # Per-vendor ETL logic
│   └── scripts/         # Python embedding alternatives

Storage (`@ait/storage`)

Object storage abstraction used for binary assets (S3-compatible; defaults to MinIO in local dev):

packages/infrastructure/storage/
├── src/
│   ├── storage.service.ts        # S3/MinIO client wrapper (create bucket + upload/get)
│   ├── photo-storage.service.ts  # Download and persist Google Photos bytes into object storage
│   └── constants.ts              # STORAGE_BUCKETS (photos/avatars/documents)

Infrastructure

PostgreSQL Schema

Key tables:

Table	Purpose
`oauth_tokens`	Stores encrypted OAuth credentials
`sync_state`	Tracks last sync time per entity type
`spotify_tracks`	Raw Spotify track data
`github_repositories`	Raw GitHub repo data
`linear_issues`	Raw Linear issue data
`google_photos`	Google Photos metadata (optional `local_path` for stored bytes)
...	Similar tables per entity type

Qdrant Collections

Collections are organized by vendor:

Collection	Vector Size	Content
`ait_spotify_collection`	1024	Tracks, artists, playlists, albums
`ait_github_collection`	1024	Repositories, PRs, commits
`ait_linear_collection`	1024	Issues, projects
`ait_x_collection`	1024	Tweets, threads
`ait_google_collection`	1024	Calendar events, YouTube, Contacts
`ait_notion_collection`	1024	Pages, Databases
`ait_slack_collection`	1024	Messages, Channels

Redis Queues

BullMQ queue structure:

bull:etl-scheduler:waiting    # Jobs waiting to be processed
bull:etl-scheduler:active     # Currently processing jobs
bull:etl-scheduler:completed  # Successfully completed jobs
bull:etl-scheduler:failed     # Failed jobs (with retry info)
bull:etl-scheduler:delayed    # Scheduled future jobs

Security Model

OAuth Token Storage

Tokens encrypted at rest in PostgreSQL
Refresh tokens handled automatically
Per-user token isolation

Local Processing

All LLM processing happens locally via Ollama
No data sent to external AI services
Vector embeddings stored locally in Qdrant

API Security

HTTPS required for OAuth callbacks
Session-based authentication for UIt
CORS configured for frontend origin

Extensibility

Adding a New Connector

Define Types (packages/core/src/types/integrations/)

export interface NewServiceTrack {
  id: string;
  name: string;
  // ...
}

Create Connector (packages/connectors/src/)
- API client in infrastructure/vendors/
- Entity in domain/entities/
- Mapper in domain/mappers/
- Service in services/vendors/
Add Gateway Routes (packages/gateway/src/routes/)
- OAuth flow routes
- Data retrieval routes
Create ETL (packages/transformers/retove/src/)
- ETL task for the new vendor
- Register in scheduler
Update AI SDK (packages/infrastructure/ai-sdk/)
- Add collection config
- Create tools if needed

Using Composable RAG Functions

import { retrieve, rerank, stream } from "@ait/ai-sdk";

// 1. Retrieve relevant documents from collections
const docs = await retrieve({
  query: "user query here",
  collections: ["ait_github_collection"],
  limit: 20,
});

// 2. Rerank for relevance
const ranked = await rerank({
  query: "user query here",
  documents: docs,
  topK: 5,
});

// 3. Generate response with context
const { textStream } = await stream({
  prompt: "Answer based on context",
  context: ranked.documents,
});

Environment Configuration

All services are configured via environment variables. See .env.example for the complete list.

Performance Considerations

Embedding Generation

Batch processing with configurable concurrency
LRU cache for repeated embeddings
Intelligent chunking (4096 tokens, 200 overlap)

Vector Search

Approximate nearest neighbor (ANN) via Qdrant
Collection-specific routing reduces search space
Hybrid search with sparse vectors available

Job Processing

Priority-based queue processing
Configurable concurrency per worker
Exponential backoff for retries

Monitoring

Langfuse Integration

Enable observability for LLM operations:

initAItClient({
  // ...
  telemetry: {
    enabled: true,
    publicKey: process.env.LANGFUSE_PUBLIC_KEY,
    secretKey: process.env.LANGFUSE_SECRET_KEY,
    baseURL: "http://localhost:3333",
  },
});

Tracks:

Generation latency and token usage
RAG retrieval quality
Tool call success rates
Error rates and patterns

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AIt Architecture

Table of Contents

Overview

Design Principles

System Architecture

Data Flow

1. Authentication Flow

2. Data Synchronization Flow

3. Query Flow (RAG)

Package Structure

Core (`@ait/core`)

Connectors (`@ait/connectors`)

AI SDK (`@ait/ai-sdk`)

Gateway (`@ait/gateway`)

Scheduler (`@ait/scheduler`)

RetoVe (`@ait/retove`)

Storage (`@ait/storage`)

Infrastructure

PostgreSQL Schema

Qdrant Collections

Redis Queues

Security Model

OAuth Token Storage

Local Processing

API Security

Extensibility

Adding a New Connector

Using Composable RAG Functions

Environment Configuration

Performance Considerations

Embedding Generation

Vector Search

Job Processing

Monitoring

Langfuse Integration

FilesExpand file tree

ARCHITECTURE.md

Latest commit

History

ARCHITECTURE.md

File metadata and controls

AIt Architecture

Table of Contents

Overview

Design Principles

System Architecture

Data Flow

1. Authentication Flow

2. Data Synchronization Flow

3. Query Flow (RAG)

Package Structure

Core (@ait/core)

Connectors (@ait/connectors)

AI SDK (@ait/ai-sdk)

Gateway (@ait/gateway)

Scheduler (@ait/scheduler)

RetoVe (@ait/retove)

Storage (@ait/storage)

Infrastructure

PostgreSQL Schema

Qdrant Collections

Redis Queues

Security Model

OAuth Token Storage

Local Processing

API Security

Extensibility

Adding a New Connector

Using Composable RAG Functions

Environment Configuration

Performance Considerations

Embedding Generation

Vector Search

Job Processing

Monitoring

Langfuse Integration

Core (`@ait/core`)

Connectors (`@ait/connectors`)

AI SDK (`@ait/ai-sdk`)

Gateway (`@ait/gateway`)

Scheduler (`@ait/scheduler`)

RetoVe (`@ait/retove`)

Storage (`@ait/storage`)