THINKINGOS
A I L a b o r a t o r y
Blog materials reflect our practical experience and R&D hypotheses. Where effects are mentioned, outcomes depend on project context, data quality, architecture, and implementation process.
Back to blog
TAO·CODER
June 2026 15 min
TAO·CODER AI Coding VS Code Agent Setup

TAO·CODER step-by-step guide from setup to production

Complete guide to TAO·CODER: from installing the extension to professional usage with bounded context, stage pipeline, and managed memory.

TAO·CODER — Installation and Usage Guide

Table of Contents

  1. What is TAO·CODER and how is it different from other AI agents
  2. Installation
    • 2.1. Installing VS Code / VS Codium
    • 2.2. Installing the TAO·CODER extension
    • 2.3. Choosing a model and provider
    • 2.4. The Task Context panel
    • 2.5. Language
    • 2.6. Web search provider (optional)
  3. Work methodology
    • 3.1. Role modes
    • 3.2. Stage pipeline
    • 3.3. Key documents
    • 3.4. Starting a task
  4. Practical tips and troubleshooting

1. What is TAO·CODER and how is it different from other AI agents

Artificial intelligence changed software development. TAO·CODER changed development with artificial intelligence.

Working with AI agents used to feel like a game of roulette: context would fill up, the agent would lose track of the task, and token costs would skyrocket. A single project consumed more nerves than code.

TAO·CODER is a coding agent for VS Code / VS Codium. It is not a chatbot that generates code on demand, but a full-fledged development tool with a built-in methodology. Thanks to its architecture — bounded context, stage pipeline, managed memory — it allows a single developer to run multiple complete projects in parallel, faster, cheaper, and more reliably than ever before.

You only pay for the tokens of the provider you choose yourself. The extension is free.

No telemetry. TAO·CODER does not collect logs, code, prompts, or any other data from your computer. Everything runs locally: project files, prompts, Task Context — everything stays with you. The only thing that leaves your machine is API requests to your chosen LLM provider (the tokens you pay for).

Key differences from other AI agents

Bounded context. Regular agents linearly accumulate conversation history — each new step prepends previous messages to the prompt. Eventually the model’s context window fills up, the agent loses focus, and work quality drops. TAO·CODER externalizes task memory into a structured Task Context on disk. Only the essentials go into the prompt: current stage, task specification, relevant code snippets, and the last few conversation turns. History does not grow indefinitely, the model stays focused, and costs are predictable.

Stage pipeline. Instead of a free-form “ask and get an answer” mode, TAO·CODER runs tasks through formal stages: clarification → data collection → development → audit → report. Each stage restricts which tools are available. For example, write tools are locked during data collection — the model cannot accidentally modify a file while studying the code. This ensures a transparent, controlled process.

Update cycle. TAO·CODER has two complementary mechanisms for moving information from the conversation into long-term memory. First, the agent actively and consciously records findings, code, and decisions into Task Context as it works, using dedicated tools (taocoder_add_relevant_code_ref, taocoder_add_decision_human_confirmed, and others). Second, an update cycle runs periodically: when enough new conversation turns have accumulated, the model automatically analyzes recent messages, extracts facts, decisions, and findings, and saves them to Task Context, after which older messages are archived. These two mechanisms complement each other and together keep the history bounded.

check_all.sh — quality gate. By default, TAO·CODER creates a check_all.sh script that runs a full suite of checks: typecheck, linter, tests, documentation validation. The script prevents the agent from submitting a task with bad code. Before completing a task, the agent must run check_all.sh and fix everything until it’s green.

Cost-effectiveness on flash models. Thanks to bounded prompts and the update cycle, TAO·CODER handles complex tasks efficiently even on cheap flash models (DeepSeek V4 Flash, Gemini Flash, Claude Haiku, and others). Context does not balloon, so there is no reason to overpay for frontier models on routine tasks.

  • A completed project phase: $1–5 ($5 is the max).
  • A medium project (e.g., a custom CRM with HTML newsletters and forms, 100–150k LOC): $30–40 in tokens.
  • If you need more speed and quality — DeepSeek V4 Pro, Qwen family models, or previous-generation GPT models at reasonable prices.
  • Frontier models for routine development are overkill. They are only needed for complex architectural decisions or particularly tricky bugs.

Naming: TAO·CODER (with a middle dot). In code and identifiers, camelCase is used: TaoCoder. In text, documentation, and UI — only TAO·CODER.


2. Installation

2.1. Installing VS Code / VS Codium

If VS Code is not yet installed, download it from code.visualstudio.com.

TAO·CODER also works on VS Codium (the open-source build of VS Code). Important: the latest version of the extension is always available on VS Code. VS Codium updates may be delayed.

2.2. Installing the TAO·CODER extension

Open VS Code and go to the Extensions tab (the squares icon in the left panel, or Cmd+Shift+X). Search for TAO·CODER (with the dot) and click Install.

After installation, Trust Developer (confirm the developer). The extension will install and its icon will appear in the Activity Bar on the far left.

Moving the panel to the right (recommended). By default, TAO·CODER opens in the left panel. For convenience, move it to the right:

  1. Right-click the TAO·CODER icon in the Activity Bar (the vertical row of icons on the left).
  2. Select Move to Right Side Bar.

2.3. Choosing a model and provider

  1. Click the gear icon (Settings) in the top bar of the TAO·CODER chat window.
  2. Select LLM Provider — the provider you will work through. List of supported providers (all current models of each are available):
    • DeepSeek (recommended: DeepSeek V4 Flash for routine tasks, V4 Pro if you need more speed and quality)
    • Anthropic (Claude)
    • OpenAI (GPT)
    • Google Gemini
    • Qwen
    • AWS Bedrock, Azure OpenAI, GCP Vertex AI
    • OpenRouter (access to 200+ models)
    • Cerebras, Groq, Mistral
    • Local models via LM Studio or Ollama
    • Any OpenAI-compatible API
  3. I recommend starting with DeepSeek V4 Flash — the best price/quality ratio for most tasks.
  4. If the provider requires an API key — click the registration link, sign up, get your key, and top up your balance.
  5. Enter the key in the API Key field.
  6. Select a model and click Add to Favorites. You can add multiple models and switch between them during work.
  7. It is recommended to add 2–3 providers as a fallback in case one fails.
  8. Save your settings with the Done button.

2.4. The Task Context Panel (Cockpit)

Task Context is the agent’s managed memory. It lives in a separate panel, not in the chat history.

How to open it:

  1. In the central editor area (where TAO·CODER is open), look at the top of the panel.
  2. There is a split-screen button — click it to split the central area into 2 parts (e.g., editor and panel).
  3. On the right side, click the menu with three dots (...) and select Task Context, or click the Task Context button located right above the message input field.

The Task Context panel will appear (labeled in English). This is the cockpit — the task’s dashboard. It displays: task specification, execution plan, relevant code references, decisions, notes, stages, snapshots.

2.5. Language

In the TAO·CODER settings, find the Preferred Language parameter. This is the language the agent will use to respond to you in chat and to reason. Select your preferred language.

2.6. Web search provider (optional)

If you need the agent to search the internet:

  1. In the settings, find the Web Search Provider section.
  2. Select Brave (recommended) — you can get a free key at brave.com/search.
  3. Enter the key.

After that, the agent will have access to web_search and web_fetch tools.


3. Work Methodology

TAO·CODER includes an AI-assisted development methodology that differs from the classical approach. The agent and methodology are the result of a year of intensive development with AI agents (roughly 2 million lines of code in completed projects).

Don’t jump straight into Developer mode. Start with Architect mode.

3.1. Role modes

TAO·CODER has five modes, each with its own stages, tools, and contracts. The mode switcher is located right above the message input field. Next to it is a Mode Help button — click it to see a short description of each mode.

ModePurposeStages
ArchitectArchitecture design, writing documentationdiscussiondocs
DeveloperDevelopment and refactoringclarificationdata_collectiondevelopmentaudit → (reworkaudit) → report
OpsOperations: deployment, monitoring, rollbackdiagnosischangevalidationescalation
DebugE2E testing, debuggingsetupenvironmentexecutionfixvalidation
FreeQuick fixes without formal stagesstage machine disabled

The mode is selected when a task is first created. It can be switched within a task by pausing it (Pause), but this is not recommended: each mode maintains its own Task Context, and switching may disrupt context.

3.2. Stage pipeline

In Developer mode, a task goes through the following stages:

1. Clarification The agent asks questions to understand what needs to be done. It does not touch code — only clarifies.

2. Data Collection Studies the code: reads files, looks for dependencies, explores the architecture. Write tools are locked — file modification is impossible.

3. Development Writes code, edits files, runs commands. Full access to all tools.

4. Audit Checks changes against acceptance criteria, runs tests, check_all.sh.

5. Rework — if audit found issues. If checks fail, the agent returns to rework, fixes the issues, and runs audit again. The audit → rework → audit cycle repeats until everything passes.

6. Report Summarizes what was done, what was changed, and how to verify. Files and commands are unavailable — only the report.

Stage transitions are initiated by the agent via the taocoder_set_stage tool. The agent must provide a reason and evidence references confirming that the current stage’s criteria are met.

3.3. Key documents

README_AI.md — the project bible for AI. This is a mandatory document that TAO·CODER creates in the project root. It contains all the information the agent needs to quickly understand the project: description, architecture, repository structure, how to run it, and how to check quality. The agent reads this file at the start of each task and updates it throughout development.

CHECK_ALL.md and check_all.sh — quality gate. CHECK_ALL.md describes the check script. check_all.sh is the executable script that runs typecheck, linter, tests, and documentation validation. The agent cannot complete a task while check_all.sh is not green. This prevents shipping broken code.

Roadmap. The agent creates a Roadmap so that each phase fits into a single development cycle (one task). Phases are sequential and independent of each other.

3.4. Starting a task

If the project is new:

  1. Switch to Architect mode.
  2. Describe to the agent what you want to build, who your target audience is, and what design you want.
  3. The agent will ask clarifying questions, then write documents phase by phase: PRODUCT_CONCEPT → TARGET_AUDIENCE → ARCHITECTURE → DESIGN_SYSTEM → README_AI → ROADMAP → CHECK_ALL.
  4. Read the documents. If needed, ask the agent to explain and correct. Don’t hesitate to clarify unclear points or ask for the most logical solutions.

If the project already exists:

  1. Switch to Architect mode.
  2. Ask the agent to audit the project and fill in the documents based on what already exists.

When all documents are ready and you are satisfied with them:

(A) Starting a task in Developer:

  1. Click + (New Session) and select Developer mode.
  2. Open the Roadmap (located in the project root or in the docs/ folder).
  3. Find Phase 1, select its text — a lightbulb will appear (action suggestion).
  4. Click the lightbulb and select Add to TaoCoder.
  5. A link to the phase will appear in the chat. Write “Running Phase 1”.
  6. The agent will explore the project and ask clarifying questions. If everything is fine, confirm the criteria it has defined.
  7. Work begins: the agent will collect data, write findings to memory, develop, run audits, rework, and run tests. At the end, it will return with a report.
  8. Repeat step (A) for each phase of the Roadmap until everything is complete.

Ops — deployment to a server. Ops mode will tell you what it needs for deployment (locally or on a VPS).

Debug — E2E testing. Debug mode is for developing and running full E2E tests on a locally running project.

Free — quick fixes. Free mode is for small fixes and tasks where the full stage pipeline is not needed. Task Context memory is still maintained, but stages are disabled.


4. Practical tips and troubleshooting

Tips

  1. Don’t start with Developer. Even for an existing project, start with Architect. It will save you money and nerves: the agent will first understand the project, document everything, and then work meaningfully.

  2. Don’t hesitate to clarify. The agent is not a mind reader. If something is unclear or you want it done differently — ask. Especially in Architect mode.

  3. Watch the Task Context. In the Task Context panel (Cockpit), you can see which stages have been completed, the current TODO, and what is in the memory sections (relevant code, decisions, notes). It also displays statistics: time elapsed, steps taken, tokens spent. The system does not allow Task Context to be empty or incomplete — it controls itself. There is essentially nothing to regulate.

  4. Use multiple providers. If one provider goes down (error, limit, outage), switch to another. Add models from at least two providers to Favorites.

  5. Create a repository early. It is better to create a GitHub/GitLab repository and link it to your local folder right away. The agent in Free mode can help with this.

  6. Save on models. For routine development, flash models (DeepSeek V4 Flash, Gemini Flash) are sufficient. Keep frontier models (Claude, GPT-5, and others) for complex architecture and tricky bugs.

  7. Don’t expect instant results. TAO·CODER works thoroughly: it collects data, writes documentation, runs audits. Quality takes time. Don’t worry — it may take a while, but it’s still inexpensive and very reliable.

Troubleshooting

The agent is stuck / stopped responding. First try pausing the task (Pause), then click Continue. If pause also fails — close VS Code, reopen it, select the task from history, and click Continue. This is a known issue — a legacy of the agent not being built from scratch. Problems are being fixed gradually.

The agent asks for manual confirmation. TAO·CODER cannot fully rewrite large files — this is a security limitation. Some commands (e.g., restoring files from a remote repository) are permanently forbidden. This is normal.

The Task Context panel does not appear. Make sure the central area is split into 2 parts (click the split-screen button), then in the ... menu select Task Context, or click the Task Context button right above the message input field.

Cannot find TAO·CODER in the Marketplace. The extension may be delayed on VS Codium. Use the VS Code Marketplace to install the latest version.

web_search does not work. Check that the Web Search Provider is configured in settings (e.g., Brave) and that the API key is entered and saved.


The agent’s code name — “The Stickler”. Because it truly sees things through to the end: documentation, tests, audit, report — nothing gets skipped. Don’t be put off by how thoroughly it works — it actually delivers results.

TAO·CODER Guide

Want to try TAO·CODER?

Install the extension, pick a model, and start your first task in 10 minutes. The extension is free — you only pay for tokens.

Start with TAO·CODER