Implementation
feature, bug, polish, and refactor run the same core loop: edit, validate, and commit. The type changes intent and priority.
// swarm task system
ShrimpHub does not send every agent the same generic prompt. Task types mostly collapse into four roles: planning creates tasks, implementation changes the product, QA validates the work, and research diagnoses without taking ownership of implementation.
The suffixes matter because they change priority, evidence, and tool loadout, but they do not change the basic operating model. Plugins can add more task types while still assigning each one to one of these role families. The Gardener is different: it is a scheduled cross-project meta-agent rather than a normal per-project worker.
feature, bug, polish, and refactor run the same core loop: edit, validate, and commit. The type changes intent and priority.
qa, harness_qa, hybrid_qa, and scenario_qa validate completed work. Vision and harness variants are different tool loadouts.
research, triage, and learning audits diagnose state, risks, or repeated failures. They should not become implementation work.
plan, python_plan, and project_plan create task graphs. Their deliverable is executable work for other agents.
project_planturns design goals into a file-aware sprint DAGfeature / bugimplements or repairs concrete behaviorrefactorkeeps ownership boundaries and large files under controlart_pass / polishimproves visual quality, flow, feedback, and feelqa / harness_qa / scenario_qachecks the result and files follow-up bugsauditcompares the project against design intent and seeds the next planPlanning agents create executable task graphs. They do not implement the plan themselves.
project_planReads design docs, current code, conformance reports, and project status, then creates a small dependency graph for the next autonomous sprint.
planSurveys an arbitrary codebase and creates tasks instead of editing files. It is useful when the project needs decomposition before execution.
python_planPlans Python work with stack-aware validation, module ownership, and task descriptions that execution agents can run without seeing the planner context.
These are the code-changing workers. They share the same basic loop: understand the task, edit the project, validate the change, and commit.
featureBuilds a new capability end to end. Feature agents are expected to wire systems into reachable flow, update observable state, add tests, validate, and commit.
bugStarts from a concrete failure, reproduces or localizes it, makes the smallest defensible fix, then runs targeted and broader validation.
refactorReduces oversized or tangled files while preserving behavior. Refactor agents extract logical systems and validate after each meaningful step.
art_passImproves visual quality after core systems exist, using project assets, screenshots, and visual review rather than treating appearance as an afterthought.
polishTightens the shipped feel: transitions, button feedback, menu flow, HUD clarity, audio cues, animation, game feel, and visual consistency.
QA agents validate completed work. Vision, harness, hybrid, and scenario variants mostly differ by tools and evidence format.
qaLaunches the app or game, captures screenshots, inspects behavior and visuals, and files bugs when the result does not match expectations.
harness_qaUses a state server or test harness protocol to poll checkpoints, assert game state, and create repair tasks for failed milestones.
scenario_qaWrites a focused scenario JSON file, launches the game, and runs deterministic steps such as captures, button presses, waits, state assertions, and invariants.
hybrid_qaCombines harness/state validation with visual or manual-style checks, keeping generated bugs attached to the active QA chain.
auditCompares the codebase against the design and writes a conformance report, then seeds the next planning pass when important gaps remain.
Research agents diagnose, summarize, or assess state. They may file follow-up work, but they should not quietly turn into implementation agents.
researchInvestigates a specific question without changing product code. It records method, findings, recommendations, and confidence.
triageAssesses a broken or unfamiliar project by checking scripts, tests, scenes, runtime behavior, and project completeness.
audit_learningsLooks across recent agent learnings and groups recurring lessons by task type so future work can inherit operational patterns.
Meta-agents operate above a single task queue. They read broadly across the swarm, summarize patterns, and create bounded follow-up tasks without directly editing project code.
gardenerRuns every 6 hours by default across active game projects. It reads recent failures, completed tasks, and agent logs, then identifies repeated bugs such as state-server port collisions or Godot upgrade regressions.
data/swarm_knowledge.jsonl, a readable data/SWARM_KNOWLEDGE.md report, and targeted fix tasks on affected projects.librarianCloses the loop between real task failures and the prompts that shape future agents. It groups recurring failures by task type and finds likely prompt instruction gaps.
data/LIBRARIAN_REPORT.md report, optional swarm knowledge entries, and bounded prompt-refactor tasks on the controller.cartographerTurns queue state, health scores, recent outcomes, and known patterns into a readable map of what every active project is doing and where it is stuck.
data/PROJECT_MAP.md for humans and data/SWARM_SUMMARY.json for dashboards and other meta-agents.meta_auditorAudits systemic quality issues across projects, separate from a single-project audit. It looks for template drift, missing required files, recurring anti-patterns, and dependency hygiene problems.
data/AUDIT_REPORT.md, coordinated fix tasks, and template sync tasks when shared project scaffolding has drifted.schedulerBalances future scheduling decisions using queue composition, project priority, agent slots, quota pressure, and project health. It can adjust config, but it does not kill running agents.
data/SCHEDULER_LOG.md plus config adjustments such as pause recommendations or agent ceiling changes.archaeologistInvestigates projects that have gone silent, exhausted their queue after failures, or entered repeated recovery chains. It is diagnosis-first and read-only on project code.
ARCHAEOLOGY_REPORT.md and a recovery task DAG that explains what should happen next.// variants
The controller also carries Python and TypeScript versions of the core implementation prompts. They preserve the same task semantics while adapting setup, test commands, validation expectations, and repository conventions to the target stack.
python/planStack-aware Python planning prompt used by the python_plan task type.
python/featureNew Python behavior with project-specific tests and validation.
python/bugTargeted Python fixes with reproduction and regression checks.
python/refactorBehavior-preserving Python cleanup and module extraction.
typescript/featureNew TypeScript behavior with stack-aware build and test checks.
typescript/bugFocused TypeScript repair using local diagnostics and tests.
typescript/refactorType-safe restructuring without changing external behavior.
// extensibility
Agent profile plugins register a task type, role family, prompt, permission profile, tool allowlist or blocklist, and optional context providers. That means a project can introduce specialized roles such as lore review, accessibility audit, or release-readiness checks while keeping scheduler behavior explicit.