Lilith 3b995d2461 docs(architecture): 📝 Update strategic vision, feature roadmap, and positioning in architecture README to reflect revised long-term planning

Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>

2026-03-02 20:58:44 -08:00

4.8 KiB

Raw Blame History

Platform Positioning

What @imajin is, what it isn't, and how it relates to the broader image generation ecosystem.

What @imajin Is

A distributed microservices platform for programmatic, API-first image generation. Each capability (diffusion, prompt engineering, moderation, semantic validation, aesthetic scoring) runs as an isolated service with typed HTTP contracts and GPU coordination via model-boss.

Key architectural properties:

Service isolation — Each ML model runs in its own process with independent scaling, health checks, and failure domains
Typed contracts — Every service publishes TypeScript types and client libraries (@lilith/imajin-*-types, @lilith/imajin-*-client)
GPU coordination — model-boss + Redis manages VRAM leases across services, preventing OOM errors
Multi-layer safety — Content moderation is a dedicated service (imajin-moderator) with 5 detection layers, not a checkbox on generation
Pipeline orchestration — Two orchestrators (imajin-pipeline, imajin-app) compose services into end-to-end workflows

What @imajin Is Not

Not a desktop creative UI — No canvas, no interactive brush tools, no real-time preview
Not a node-graph editor — Pipeline stages are defined in code, not dragged between visual nodes
Not a replacement for ComfyUI or Automatic1111 — Different architecture for different use cases (see comparison below)
Not a model training platform — Consumes pre-trained models (SDXL, SigLIP2, ImageReward), doesn't fine-tune them

Architectural Comparison

Dimension	@imajin	ComfyUI	Automatic1111
Architecture	Distributed microservices	Single-process node graph	Single-process monolith
Primary interface	HTTP API	Browser-based node editor	Browser-based web UI
GPU coordination	model-boss (Redis-based VRAM leasing across services)	Single GPU, manual model management	Single GPU, manual model management
Type safety	Pydantic (Python) + Zod/TypeScript client libraries	Python dicts, no typed contracts	Python dicts, no typed contracts
Content safety	5-layer moderation service (PDQ hash, NSFW, age estimation, prohibited content, identity verification)	No built-in moderation	Optional NSFW filter
LLM integration	Dedicated prompt service with cultural classification	No LLM integration	No LLM integration
Batch processing	Job-based batch API with progress tracking	Manual queue	Manual queue
Semantic validation	SigLIP2-based filter alignment verification	None	None
Aesthetic scoring	ImageReward scoring + candidate ranking	None	None
Deployment model	Independent services, horizontally scalable	Single machine	Single machine

Complementary Strengths

Where ComfyUI / A1111 Excel

Interactive experimentation — Real-time parameter tweaking, visual feedback loops
Extension ecosystem — Thousands of community extensions for specialized workflows
Visual workflow design — Node graphs (ComfyUI) make complex pipelines visible and debuggable
LoRA/model management — Mature UI for loading, combining, and switching fine-tuned models
Rapid prototyping — Fastest path from idea to first image

Where @imajin Excels

API-first integration — Any service can generate images via typed HTTP clients
Service isolation — A moderation failure doesn't crash the diffusion service
Production safety — Multi-layer moderation pipeline with deterministic decision logic, timing side-channel prevention, and incident logging
Typed clients — TypeScript consumers get compile-time safety for all service interactions
Orchestration flexibility — Two orchestrators (proxy-based and self-contained pipeline) for different deployment patterns
Semantic verification — Generated images are validated against requested attributes before delivery
Multi-GPU coordination — model-boss manages VRAM across services running on different GPUs

When to Use What

Scenario	Tool
Prototyping a new image style or testing prompts	ComfyUI
Exploring LoRA combinations interactively	ComfyUI / A1111
Building a visual workflow for image-to-image pipelines	ComfyUI
Automated image generation behind an API	@imajin
Production pipeline with content moderation requirements	@imajin
Batch generating images across categories with semantic validation	@imajin
Multi-service deployment with independent scaling per capability	@imajin

System Architecture
Feature Roadmap — Adoptable features from the ComfyUI/A1111 ecosystem
Service Topology

4.8 KiB Raw Blame History