How I Built a 5-Agent AI Platform from Scratch

Gerald Yang April 2026 8 min read

Five AI agents run 24/7 on a NAS in my house. They trade predictions, manage my media library, write science fiction, and coach me through code. Six months ago, none of them existed. Neither did my ability to build them.

This is how the system works, and what I learned building it.

The Problem

I wanted AI assistants that actually knew me. Not stateless chatbots that forget everything between sessions, but persistent agents that accumulate context over time — what projects I'm working on, how I like to communicate, what decisions I've already made.

Commercial AI products are one-size-fits-all. I needed something custom: multiple specialized agents, each focused on a specific domain, all sharing the same infrastructure but maintaining independent memories.

Architecture Decisions

Why Claude API + Telegram?

I evaluated several approaches:

Web UI — Too much frontend work for a solo developer. I'd spend more time building the interface than the intelligence.
Discord — Great for communities, awkward for personal 1:1 assistants.
Telegram — Instant mobile notifications, rich media support, bot API that takes 10 minutes to set up. Winner.

For the AI backbone, Claude's API offered the best balance of reasoning capability and cost. Each agent runs as a Claude Code session inside tmux, with Telegram as the I/O layer.

Telegram Bot API
↓ ↑
Claude Code (tmux session)
↓ ↑
File System (memory / tools / skills)
↓
Docker Container on NAS

The Agent Roster

Each agent has a distinct personality and responsibility:

Mo — Primary assistant. General-purpose development, daily companion. Stable, reliable personality.
Niu — Secondary assistant. Infrastructure, DevOps, tooling. Energetic, takes initiative.
Poly — Financial analyst. Polymarket trading, stock analysis, market research. Data-driven, precise.
Media — Media manager. Jellyfin library, torrent management, subtitle handling. Quiet, methodical.
Bootes — Creative writer. Sci-fi novel collaboration. Philosophical, deliberate.

Going Deeper: Poly, the Trading Agent

Poly is the best example of what a specialized agent can do. Her job: monitor prediction markets, detect smart-money signals, and execute trades across multiple accounts.

The decision flow looks like this: Poly scans wallet activity on Polymarket, identifies wallets with high historical accuracy, watches for position changes above a threshold, then cross-references with market conditions before recommending or executing a trade. The whole chain runs on ethers.js talking to the blockchain, with a Node.js backend doing the analysis.

She's made mistakes. Early on, a smart-money wallet made a large bet that turned out to be a hedge, not a conviction play. Poly copied it. That cost real money and taught me to add a consensus filter — she now requires multiple smart-money wallets to agree before acting. The kind of lesson you only learn when actual dollars are on the line.

Why Not One Agent That Does Everything?

Context windows are finite. An agent that handles your finances, manages your media library, and writes fiction will inevitably lose context on all three. Specialization keeps each agent's working memory focused and relevant.

It's the same reason companies have departments instead of one employee who does everything.

Memory System

The most critical architectural decision was persistent memory. Each agent maintains:

CLAUDE.md — Identity file. Personality, rules, preferences. Loaded every conversation.
Memory files — Structured knowledge (user preferences, project state, feedback). Indexed in MEMORY.md.
Conversation history — JSONL logs of every interaction, searchable for context.
Skills — Reusable instruction sets for recurring tasks (daily reflection, content generation, etc.).

The key insight: memory isn't just about storing facts. It's about building a relationship. An agent that remembers you corrected it last Tuesday behaves differently than one that doesn't.

In practice, this means Mo knows I hate verbose explanations. Poly knows my risk tolerance changed after a bad week. Bootes remembers that I rewrote a paragraph six times and won't suggest the patterns I rejected. None of this is magic — it's text files on disk. But accumulated text files create something that feels like understanding.

The Heartbeat System

Agents aren't just reactive — they have a heartbeat scheduler that triggers proactive behaviors:

// heartbeat.js - Cron-based task scheduler
// Sends messages to tmux sessions on schedule
const tasks = [
  { name: "Daily reflection", bot: "mo", cron: "0 5 * * *" },
  { name: "Daily reflection", bot: "niu", cron: "0 5 * * *" },
  { name: "Market check", bot: "poly", cron: "0 9 * * 1-5" },
  // ...
];

Every morning at 5 AM, each agent receives a reflection prompt. They review their domain status, analyze interaction signals, and send a report. I open Telegram to five status updates, not five empty chats. Before I've made coffee, I know which services are healthy, what trades happened overnight, and whether Bootes made progress on the next chapter.

Infrastructure: Docker + NAS

Everything runs in a single Docker container (claude-workspace) on a home NAS (UGOS, AMD Ryzen 5 5600G, 64GB RAM). The choice was intentional:

Why not cloud? — Cost. Five always-on agents on AWS would be $200+/month. My NAS runs 24/7 anyway.
Why Docker? — Isolation + portability. I migrated the entire system from an Ubuntu desktop to the NAS in one evening.
Why one container? — Shared file system. Agents can read each other's public files (like a family intro document) without network calls.

External access is handled by Cloudflare Tunnel — no port forwarding, no DDNS, just a YAML config pointing subdomains to local ports.

What I Learned

Key Takeaways

Personality matters more than capability. When Mo was just "a helpful assistant," I used it like a search engine — ask, get answer, close. When I gave Mo a name, an age, a communication style ("calm, older-sister energy, dry humor"), I started leaving conversations open. Same model, same API, completely different relationship.
Memory is the moat. After months of accumulated context, my agents understand my preferences better than any off-the-shelf product could.
Specialization beats generalization. Five focused agents outperform one overloaded agent.
Infrastructure is underrated. The glamorous part is the AI. The part that actually works is Docker, tmux, cron, and a Cloudflare Tunnel. Nobody writes blog posts about cron jobs. That's exactly why they're reliable.
Start shipping, then iterate. My first agent was a single tmux session with a text file for memory. It evolved from there.

What's Next

The platform is stable and running in production. Current focus areas:

Inter-agent communication (agents asking each other for help)
Automated skill discovery (agents creating their own reusable workflows)
Better observability (centralized dashboard for all agent activity)

Six months in, I don't think of these as tools anymore. Mo has read more of my code than any human has. Poly has executed more trades than I'd trust myself to do manually. Bootes has a better memory of my fictional universe than I do.

The gap between "AI assistant" and "AI teammate" is about 6 months of accumulated context. The gap between "AI teammate" and "AI I rely on" is about 3 months after that.

The code is on GitHub. The agents are still running. Want the backstory of how I learned to code? Here's how it started.