Overview

Asya🎭 is a Kubernetes-native async actor framework with pluggable components for AI/ML orchestration.

System Architecture¶

Operator: Kubernetes controller that watches AsyncActor CRDs, injects sidecars, configures KEDA autoscaling
Gateway: Optional MCP HTTP API for envelope submission, SSE streaming, and status tracking
CLI: Command-line tool for interacting with the gateway (MCP client)

Each actor pod contains two containers:

Sidecar: Handles queue consumption, message routing, retries, progress reporting (Go)
Runtime: Executes your Python handler via Unix socket, handles OOM recovery

Crew Actors: Special actors with reserved roles (happy-end, error-end) for result persistence and error handling

Message Queue: Pluggable transports (SQS, RabbitMQ, Kafka/NATS planned)
KEDA: Monitors queue depth, scales actors 0→N based on workload
Observability: Prometheus metrics, structured logging, OpenTelemetry integration

Client sends request to Gateway (or directly to queue)
Gateway creates envelope, routes to first actor's queue
Sidecar consumes message from queue
Sidecar forwards envelope to Runtime via Unix socket
Runtime executes your Python handler, returns result
Sidecar routes result to next actor's queue (or happy-end/error-end)
Repeat steps 3-6 for each actor in the route
Crew actor (happy-end or error-end) persists final result, reports status to gateway

Key insight: Queue → Sidecar → Your Code → Sidecar → Next Queue

Actor-to-Actor: Envelope structure, routing, status tracking
Sidecar-Runtime: Unix socket communication, framing protocol, error handling

AsyncActor CRD: Workload specification, scaling configuration, timeout settings
Autoscaling: KEDA integration, scaling strategies, queue-based autoscaling
Observability: Metrics, logging, tracing, monitoring best practices

AWS (SQS + S3):

Self-hosted (RabbitMQ + MinIO):

See: Installation Guides (AWS EKS, Local Kind) for detailed deployment instructions.