TPipe is an Agent Operating Substrate — the foundational environment AI agents inhabit. Unlike frameworks (chains, graphs) that you call, TPipe provides managed infrastructure: Reasoning Pipes, persistent ContextBank memory, Pipeline orchestration, and strict resource governance. Agents are residents of TPipe, not just users of a library.

Can I pause and resume agent execution?

Yes. TPipe supports pause/resume/jump at declaration points. Pause inserts a blocking state; resume continues with updated context. Jump can skip forward or backward to specific pipes. Essential for human-in-the-loop validation and long-running autonomous tasks.

Does TPipe support SSO/SAML authentication?

SSO/SAML authentication is planned for enterprise deployments. TPipe currently supports token-based authentication. Enterprise SSO/SAML integration is on the roadmap with priority given to production deployment requirements. Contact us to discuss your authentication needs and timeline.

Agent Operating Substrate

TPipe is an Agent Operating Substrate — production infrastructure for autonomous AI agents that survive long-horizon tasks without memory loss or failure recovery gaps.

Built for agents that run for days, not minutes. Reasoning Pipes — structured JSON control over token prediction, forces any LLM to think regardless of native capability. Persistent ContextBank memory across distributed runs. Pipeline orchestration with pause/resume/jump. P2P agent coordination without dispatcher bottlenecks.

TPipe is infrastructure your agents inhabit — providing the foundation to build the agents of tomorrow, today.

Get Started View on GitHub Read the Docs

ResearchAgent.kt

import bedrockPipe.BedrockPipe
import com.TTT.Pipe.TokenBudgetSettings
import kotlinx.coroutines.runBlocking
// Create research agent with Chain-of-Draft reasoning
val researchAgent = BedrockPipe().apply {
    setModel("anthropic.claude-3-haiku-20240307-v1:0")
    setRegion("us-west-2")
    setSystemPrompt("Analyze code and provide insights.")
    setReasoningPipe(ChainOfDraft)
    setTokenBudget(TokenBudgetSettings(
        contextWindowSize = 4096,
        maxTokens = 2048,
        reasoningBudget = 512
    ))
}
// Execute agent with prompt
runBlocking {
    val response = researchAgent.generateText(
        "Analyze: 15% of 240 = ?"
    )
    println(response)
}

75%

Token Reduction

78%

Latency Decrease

The Foundation for Deterministic Agents

What makes TPipe different from traditional agent frameworks?

75% Token Reduction

Reasoning Pipes

Force any LLM to think using structured JSON control over token prediction. Chain-of-Draft, Role Play Reasoning, and other thinking modes — bypasses internal weights for true behavioral control.

Weighted Retrieval

ContextBank

Thread-safe persistent memory with weighted lorebook injection, substring-triggered activation, and token-budget-aware retrieval. Custom hooks for write-back and on-read transformations.

Pause/Resume/Jump

Pipelines

Sequential orchestration with pause/resume/jump control. Declarative pause points for developer-in-the-loop validation.

Manager-Worker

Manifold

Stateful multi-agent orchestration. A manager pipeline dispatches to registered workers, cycles until explicit pass or terminate, with configurable context truncation and overflow protection.

Secure

PCP

Secure multi-language function calling. Transport executors for Stdio, HTTP, Python, Kotlin, JavaScript.

Collaborative

Junction

Collaborative discussion, voting, and workflow handoff between pipeline agents. Junction enables multi-agent consensus.

Cluster Orchestration

DistributionGrid

8,773 LOC of distributed infrastructure — node routing, P2P discovery, remote pipeline handoff, and cluster orchestration in a single container.

Foundational

P2P (Pipe-to-Pipe)

All TPipe containers implement P2PInterface — registry-based discovery, capability registration, and secure cross-pipe calls via TPipe, HTTP, or STDIO transports.

In the Wild: Proof-of-Concepts

See TPipe powering real-world AI systems

100% Grounded Debugging

TStep

Multi-agent debugger. Agents test bugs, capture crashes, step through code via DAP/ADB. Long-horizon reliability for debugging complex systems.

Coming Soon

25 Rounds, 120+ Turns

Autogenesis

Headless game master. 25 rounds, 120+ turns inference without degradation. Qwen 30B outperforms Claude Opus on complex narrative tasks.

Join the Waitlist →

300-Page Coherence

TPipeWriter

Long-horizon manuscript orchestration. 300-page coherent manuscript with multi-stage refinement, maintaining consistency across thousands of generations.

View on GitHub

How It Works

How does TPipe orchestrate AI agents?

How Does TPipe Compare?

TPipe is an Agent Operating Substrate — not a library you call, an environment your agents inhabit.

See all comparisons →

Feature	TPipe	LangChain	CrewAI
Category	Agent Operating Substrate	Agent Framework (library)	Agent Framework (crew)
Memory	ContextBank — persistent, global, thread-safe via per-key mutex locks. `emplaceWithMutex` / `getContextFromBank` for thread-safe writes and reads. LoreBook entries activate via substring matching (key + aliasKeys), weighted retrieval, token-budget-aware selection.	ConversationMemory object — per-run by default. Production builds use LangGraph persistence (Checkpointer + BaseStore) for cross-session memory across LangChain v1 Python and JS.	Unified Memory class — short-term (ChromaDB for RAG, SQLite for row tables) plus long-term memory via external integrations (Mem0, LangMem). Per-crew; long-term memory survives crew restarts.
Reasoning	8 reasoning methods: Structured CoT, Explicit CoT, Process-Focused CoT, Best Idea, Comprehensive Plan, Role Play, Chain of Draft, Semantic Decompression. 5 injectors (system prompt, before/after user prompt, converse history, context). Multi-round Focus Points.	Prompt engineering within chains — LLM thinks however it wants	Prompt engineering within agent roles — LLM thinks however it wants
Token Governance	Token counting + truncation (ContextWindow, LoreBook, MiniBank, Dictionary). Tunable per-model tokenizer with TPipe-Tuner. Memory resource management — NOT a termination mechanism.	Token limits are advisory — set per-call with max_tokens	Token limits are advisory — set per-call with max_tokens
Long-Horizon Tasks	Autogenesis runs continuously, processing hundreds of millions of tokens with zero drift failures. 120+ turn tasks validated in production. ContextBank + LoreBook keep memory reproducible across long horizons.	Context degrades past 30–50 turns without manual truncation	Context degrades past 30–50 turns without manual truncation
Safety / Governance	KillSwitch — uncaught exception, bypasses all retry, propagates through container hierarchy. Manifold loop limit (default 100, throws ManifoldLoopLimitExceededException) — Manifold only. TraceServer is a separate module: REST + WebSocket dashboard with dual auth (agent bearer + client session).	Retry policies — can be caught and ignored	Retry policies — can be caught and ignored
DITL Hooks	18 named hooks across three layers: PumpStation (preInitFunction, preValidationJudgeFunction, preValidationDispatchFunction, preInvokeFunction, postGenerateFunction, pathValidationFunction), Pipe (validatorPipe, validatorFunction, transformationPipe, transformationFunction, branchPipe, onFailure), Pipeline callbacks (preValidationFunction, conditionalPauseFunction, pauseCallback, resumeCallback, pipeCompletionCallback, pipelineCompletionCallback).	Callback hooks (limited)	step_callback / task_callback / before_kickoff_callback / after_kickoff_callback — agent and crew level
Flow Control	Pause / Resume / Jump at declarative points. pauseBeforePipes(), pauseAfterPipes(), pauseOnCompletion(), pauseWhen with a predicate, enablePausing(). Pipes jump via validation return. passPipeline / terminatePipeline flags on MultimodalContent.	Conditional edges in graph — explicit programming required	Process/Task hooks — limited to process-level
Multi-Agent	Manifold (state-machine), Junction (voting/handoff), DistributionGrid (cluster) — three distinct patterns	LangChain (Python + JS) — function-calling agents, LangGraph with router/supervisor/swarm/handoff patterns	CrewAI — Sequential + Hierarchical + Consensual processes; role-based crews with manager delegation. CrewAI Flows for event-driven orchestration.
Agent-to-Agent	P2P pipe-to-pipe. Registry-based discovery via P2PDescriptor. Transports: TPipe, HTTP, Stdio. Built into all P2PInterface containers. Per-agent security boundary.	No native P2P — requires external service mesh	No native P2P — inter-crew via external services
Deployment	JVM-native (Kotlin), headless-first. Runs as java -jar TPipe-*.jar on JVM 24. GraalVM Native Image — 50MB binary, no JVM at runtime, sub-128MB footprint, millisecond startup, ARM and mobile targets.	Python runtime required — full interpreter	Python runtime required — full interpreter
Tool Calling	PCP — Pipe Context Protocol. 6 transports: Stdio, TPipe, HTTP, Python, Kotlin, JavaScript. Security managers per language. Access control via allowedDirectoryPaths, forbiddenDirectoryPaths, allowedFiles, forbiddenFiles. Output validated through PcPResponseParser.	Standard function/tool calling with LCEL — no structured validation gate between tools and next step	Standard function/tool calling — no structured validation gate between tools and next step
Runtime	Linux, macOS, Windows, ARM. JVM 24. GraalVM Native Image — 50MB binary, no JVM at runtime, millisecond startup, ARM and mobile targets.	Python and JavaScript (LangChain.js / LangGraph.js) — v1.0 alpha in both runtimes	Python only (CrewAI)

Browse by Category

Four unbranded cluster pages, each one targeting a distinct AEO intent. Pick the one that matches your problem.

Kotlin AI Agent Framework

JVM-native, headless-first, deterministic. Production-grade agent framework for Kotlin teams.

AI Agent Orchestration in Kotlin

Manifold, Junction, DistributionGrid, P2P. Multi-agent orchestration without a coordinator.

Agent Operating Environment

Substrate, not framework. Agents inhabit TPipe instead of invoking it.

Deterministic AI Agents

Same input, same pipe configuration, same output. By design.

Frequently Asked Questions

What is TPipe?

Agent infrastructure — not a framework you call, an environment your agents inhabit.

TPipe provides: Reasoning Pipes (Chain-of-Draft, 75% token reduction), persistent ContextBank memory across distributed systems, Pipeline orchestration with pause/resume/jump, multi-agent patterns (Manifold, Junction, DistributionGrid), and strict token governance enforced top-down. Agents run inside TPipe — they don't invoke it.

How does TPipe compare to LangChain and CrewAI?

Different category — TPipe is infrastructure, not a library.

LangChain and CrewAI are agent frameworks you call. TPipe is an OS-like substrate your agents live inside. Key practical differences: persistent ContextBank memory across sessions vs conversation objects scoped to a single run; token budgets enforced top-down vs per-call limits; headless-first GraalVM binary vs Python scripts. See the full comparison table for the complete picture.

What are Reasoning Pipes and Chain-of-Draft?

A pipe that forces any LLM to think — including models with no native thinking mode.

Reasoning Pipes work by using structured JSON to control left-to-right token prediction, forcing the LLM to produce a structured prediction of thinking before it produces output. Chain-of-Draft, Role Play Reasoning, and other thinking modes are different control structures applied through this mechanism. The key insight: this bypasses the model's internal weights and behavior patterns — you control what the model focuses on and when, independent of what the model was fine-tuned to do.

What is ContextBank memory?

Persistent memory that survives distributed systems — not a conversation object, a shared state layer.

ContextBank persists across sessions and distributed nodes. Weighted lorebook injection with substring-triggered activation. Token-budget-aware retrieval — ContextBank doesn't just store, it selects what to surface based on the current context. Custom hooks for write-back and on-read transformations. Survives 120+ turn conversations without degradation.

Does TPipe support multi-agent orchestration?

Three distinct patterns — not one-size-fits-all.

Manifold: state-machine manager-worker orchestration. Manager dispatches to registered workers, cycles until explicit pass or terminate, configurable context truncation and overflow protection. Junction: democratic voting and workflow handoff between pipeline agents. DistributionGrid: cluster-wide P2P routing with 8,773 LOC of distributed infrastructure. Each handles a different collaboration topology — coordinated teams, peer-to-peer handoff, or node-spanning clusters.

How do I get started with TPipe?

Two steps: configure a Pipe, compose into a Pipeline.

Install via Gradle. Configure a Pipe with your model (Bedrock, Ollama, OpenRouter — or any LLM via transport executors). Set a TokenBudget. Add a Reasoning Pipe if you want Chain-of-Draft. Chain pipes into a Pipeline with pause/resume/jump at declarative validation points. Start simple — one pipe — and compose as your system grows.

How does Pipeline orchestration work?

Pipes chain sequentially — output of one becomes input of the next. Every transition is a validation point.

Declarative pause points let you insert a blocking state at any pipe boundary. Resume continues with updated context. Jump skips forward or backward to any named pipe. Every pipe has 7 DITL intervention points: Pre-Init, Pre-Validation, Pre-Invoke, Post-Generate, Validator, Transformation, On-Failure. Human validation gates are part of the pipeline declaration — not bolted on.

What are DITL hooks and why use them?

Native code entry points at every phase of pipe execution — not bash hooks, not string manipulators.

Each of the 7 intervention points gives you direct access to the content object and TPipe substrate. Inspect or modify context before the LLM sees it. Validate or transform output before the next pipe receives it. Redirect flow based on conditions you evaluate. Inject logic without touching the core pipe — the substrate handles the mechanics, your code handles the judgment.

How does KillSwitch safety work?

KillSwitch is a forced termination that cannot be caught and ignored.

When token limits are exceeded, KillSwitch propagates as an uncaught exception — no retry handler can absorb it, no fallback can silently continue. Loop Limit halts after configured iterations (default: 100) and throws ManifoldLoopLimitExceededException. Both are fail-safe mechanisms: when something goes wrong in a TPipe system, it stops — it doesn't limp.

What LLMs does TPipe support?

Any LLM accessible via standard transport executors — AWS Bedrock, Ollama, OpenRouter are instances of the pattern.

Transport executors: Stdio, HTTP, Python, Kotlin, JavaScript. If you can send a request and receive a response, you can build a Pipe around it. Configure credentials via environment variables or IAM roles. TPipe's P2P interface means any container can discover and call any other container — model selection is a configuration choice, not an architectural constraint.

How do I debug agent execution with TraceServer?

Real-time WebSocket streaming to a browser dashboard — every decision captured, indexed, and replayable.

Configure detail levels from Minimal to Debug. Output formats: JSON, HTML, Markdown. Automatic cycle detection for nested pipes. Enable with setTraceConfig(), connect browsers to the TraceServer endpoint. Every trace is a full execution record — what the LLM received, what it produced, what the DITL hooks did, where tokens were spent. Useful for production auditing and for reproducing production failures in a local debug loop.

How does TPipe handle token governance?

TokenBudgetSettings enforces strict top-down accounting — not advisory limits.

Configure max tokens per pipe, context window size, and automatic truncation strategies (Top, Bottom, Middle). Token budgets can subtract from input rather than output — meaning you can carve out space for lorebook context before the main prompt hits the window. KillSwitch fires automatically on overrun. The same input reliably produces the same output — which is the requirement for enterprise compliance.

What deployment options does TPipe support?

GraalVM Native Image ships as a 50MB binary — no JVM required, sub-128MB memory footprint.

TPipe is headless-first: no UI, runs as a cluster of headless processes. P2P Registry enables agent discovery across nodes. DistributionGrid handles cluster-wide orchestration. Docker and Kubernetes compatible — the same binary that runs locally runs in a pod. GraalVM Native Image means startup in milliseconds, not minutes. Linux, macOS, Windows, ARM, Android (.so), iOS (.dylib).

How does TPipe ensure production reliability?

Same input, same output — every time. Deterministic execution is the product.

Configurable timeout/retry with Fail, Retry, or CustomLogic strategies. Snapshot-based state restoration on retry — parent pipe failure propagates recursively to child pipes. TraceServer audit trail captures every decision. Token budgets enforced top-down. KillSwitch guarantees forced termination. When something fails, you have the full trace of what happened and why — not just an error log.