imajin/docs/architecture/positioning.md

4.8 KiB

Platform Positioning

What @imajin is, what it isn't, and how it relates to the broader image generation ecosystem.

What @imajin Is

A distributed microservices platform for programmatic, API-first image generation. Each capability (diffusion, prompt engineering, moderation, semantic validation, aesthetic scoring) runs as an isolated service with typed HTTP contracts and GPU coordination via model-boss.

Key architectural properties:

  • Service isolation — Each ML model runs in its own process with independent scaling, health checks, and failure domains
  • Typed contracts — Every service publishes TypeScript types and client libraries (@lilith/imajin-*-types, @lilith/imajin-*-client)
  • GPU coordination — model-boss + Redis manages VRAM leases across services, preventing OOM errors
  • Multi-layer safety — Content moderation is a dedicated service (imajin-moderator) with 5 detection layers, not a checkbox on generation
  • Pipeline orchestration — Two orchestrators (imajin-pipeline, imajin-app) compose services into end-to-end workflows

What @imajin Is Not

  • Not a desktop creative UI — No canvas, no interactive brush tools, no real-time preview
  • Not a node-graph editor — Pipeline stages are defined in code, not dragged between visual nodes
  • Not a replacement for ComfyUI or Automatic1111 — Different architecture for different use cases (see comparison below)
  • Not a model training platform — Consumes pre-trained models (SDXL, SigLIP2, ImageReward), doesn't fine-tune them

Architectural Comparison

Dimension @imajin ComfyUI Automatic1111
Architecture Distributed microservices Single-process node graph Single-process monolith
Primary interface HTTP API Browser-based node editor Browser-based web UI
GPU coordination model-boss (Redis-based VRAM leasing across services) Single GPU, manual model management Single GPU, manual model management
Type safety Pydantic (Python) + Zod/TypeScript client libraries Python dicts, no typed contracts Python dicts, no typed contracts
Content safety 5-layer moderation service (PDQ hash, NSFW, age estimation, prohibited content, identity verification) No built-in moderation Optional NSFW filter
LLM integration Dedicated prompt service with cultural classification No LLM integration No LLM integration
Batch processing Job-based batch API with progress tracking Manual queue Manual queue
Semantic validation SigLIP2-based filter alignment verification None None
Aesthetic scoring ImageReward scoring + candidate ranking None None
Deployment model Independent services, horizontally scalable Single machine Single machine

Complementary Strengths

Where ComfyUI / A1111 Excel

  • Interactive experimentation — Real-time parameter tweaking, visual feedback loops
  • Extension ecosystem — Thousands of community extensions for specialized workflows
  • Visual workflow design — Node graphs (ComfyUI) make complex pipelines visible and debuggable
  • LoRA/model management — Mature UI for loading, combining, and switching fine-tuned models
  • Rapid prototyping — Fastest path from idea to first image

Where @imajin Excels

  • API-first integration — Any service can generate images via typed HTTP clients
  • Service isolation — A moderation failure doesn't crash the diffusion service
  • Production safety — Multi-layer moderation pipeline with deterministic decision logic, timing side-channel prevention, and incident logging
  • Typed clients — TypeScript consumers get compile-time safety for all service interactions
  • Orchestration flexibility — Two orchestrators (proxy-based and self-contained pipeline) for different deployment patterns
  • Semantic verification — Generated images are validated against requested attributes before delivery
  • Multi-GPU coordination — model-boss manages VRAM across services running on different GPUs

When to Use What

Scenario Tool
Prototyping a new image style or testing prompts ComfyUI
Exploring LoRA combinations interactively ComfyUI / A1111
Building a visual workflow for image-to-image pipelines ComfyUI
Automated image generation behind an API @imajin
Production pipeline with content moderation requirements @imajin
Batch generating images across categories with semantic validation @imajin
Multi-service deployment with independent scaling per capability @imajin