Part 9: Frontier Practices
Latest hot topics: Computer Use, Agentic Coding, Background Agents, Tiered Model Strategy
Chapter List
| Chapter | Title | Core Question | Shannon Reference |
|---|---|---|---|
| 27 | Computer Use | How do Agents operate browsers and desktops? | config/models.yaml multimodal |
| 28 | Agentic Coding | How to build code generation Agents? | file_ops.py, wasi_sandbox.rs |
| 29 | Background Agents | How to implement async long-running tasks? | schedules/manager.go |
| 30 | Tiered Model Strategy | How to optimize 50-70% of costs? | config/models.yaml, manager.py |
Chapter Summaries
Chapter 27: Computer Use
When Agents get "eyes" and "hands": From calling APIs to operating real interfaces
Core Content:
- Perceive-Decide-Execute Loop: Screenshot understanding -> Coordinate calculation -> Click/Input -> Result verification
- Multimodal Model Integration: Visual understanding is the key capability for Computer Use
- Coordinate Calibration: Handling different resolutions and DPI scaling differences
- Safety Protection: Dangerous zone detection, input content filtering, OPA policy extensions
- Verification Loop: Screenshot to verify results after each operation, auto-retry on failure
Shannon Code: config/models.yaml (multimodal_models), suggested tool extension patterns
Chapter 28: Agentic Coding
Making Agents your programming partner: From code generation to complete development workflows
Core Content:
- Safe File Operations: Whitelist directories, path validation, symlink protection
- WASI Sandbox Execution: Fuel/Epoch limits, memory isolation, timeout control
- Code Reflection Loop: Generate -> Review -> Improve iteration process
- Multi-file Edit Coordination: Atomic changes, backup rollback mechanism
- Git Integration: Branch management, auto-commit, PR creation
Shannon Code: python/llm-service/llm_service/tools/builtin/file_ops.py, rust/agent-core/src/wasi_sandbox.rs, go/orchestrator/internal/workflows/patterns/reflection.go
Chapter 29: Background Agents
Keep tasks running in the background: Temporal scheduling and scheduled task systems
Core Content:
- Temporal Schedule API: Native Cron scheduling, pause/resume, timezone support
- Resource Limits: MaxPerUser (50), MinCronInterval (60min), MaxBudgetPerRunUSD ($10)
- ScheduledTaskWorkflow: Wrapper workflow, records execution metadata (model, tokens, cost)
- Orphan Detection: Periodically detect Temporal and database state inconsistencies, auto-cleanup
- Budget Injection: Cost tracking and limits per execution
Shannon Code: schedules/manager.go, scheduled_task_workflow.go
Chapter 30: Tiered Model Strategy
Smart routing to achieve 50-70% cost reduction: Not every task needs the strongest model
Core Content:
- Three-tier Architecture: Small (50%) / Medium (40%) / Large (10%) target distribution
- Priority Routing: Multiple providers per tier selected by priority, auto-Fallback
- Complexity Analysis: Auto-select model tier based on task characteristics
- Capability Matrix: multimodal, thinking, coding, long_context capability markers
- Circuit Breaking Degradation: Circuit Breaker + auto-degradation to backup providers
- Cost Tracking: Centralized pricing configuration, real-time cost monitoring
Shannon Code: config/models.yaml, llm_provider/manager.py
Learning Objectives
After completing this Part, you will be able to:
- Understand Computer Use's perceive-decide-execute loop
- Design safe Agentic Coding workflows (sandbox + reflection)
- Use Temporal Schedule API to implement scheduled background tasks
- Configure three-tier model strategy to achieve 50-70% cost reduction
- Add frontier capabilities to Research Agent (v0.9)
Shannon Code Guide
Shannon/
├── config/
│ └── models.yaml # Three-tier model config, pricing, capability matrix
├── go/orchestrator/
│ └── internal/
│ ├── schedules/
│ │ └── manager.go # Scheduled task manager (CRUD, resource limits)
│ └── workflows/scheduled/
│ └── scheduled_task_workflow.go # Wrapper workflow
├── python/llm-service/
│ ├── llm_provider/
│ │ └── manager.py # LLM manager (routing, circuit breaking, Fallback)
│ └── llm_service/tools/builtin/
│ ├── file_ops.py # Safe file read/write tools
│ └── python_wasi_executor.py # Python sandbox execution
└── rust/agent-core/src/sandbox/
└── wasi_sandbox.rs # WASI sandbox implementation
Hot Topic Correlations
| Topic | Representative Products | Shannon Implementation | Chapter |
|---|---|---|---|
| Computer Use | Claude Computer Use, Manus | Multimodal + Tool extensions | Ch27 |
| Agentic Coding | Claude Code, Cursor, Windsurf | WASI sandbox + File tools | Ch28 |
| Background Agents | Claude Code Ctrl+B | Temporal Schedule API | Ch29 |
| Cost Optimization | Enterprise cost reduction needs | Three-tier model strategy | Ch30 |
Cost Optimization Example
Without tiering (all Large):
1M requests x $0.09/request = $90,000/month
With tiered strategy (50/40/10):
Small: 500K x $0.0006 = $300
Medium: 400K x $0.0018 = $720
Large: 100K x $0.09 = $9,000
Total: $10,020/month
Savings: $79,980/month (89%)
Prerequisites
- Part 1-8 completed (especially Part 7-8 production architecture and enterprise features)
- Browser automation basics (Playwright/Puppeteer) - optional
- Cron expression basics - optional
- Multi-model API experience - optional
Research Agent v0.9
Frontier capability modules covered in this Part:
| Module | Chapter | Capability |
|---|---|---|
| Computer Use | Chapter 27 | Web browsing, content extraction |
| Agentic Coding | Chapter 28 | Analysis script generation |
| Background Agents | Chapter 29 | Scheduled research reports |
| Tiered Models | Chapter 30 | Smart model selection |
Final Form:
User: "Generate an AI industry daily report at 9 AM every day"
Research Agent v0.9:
1. [Schedule] Create Cron scheduled task (0 9 * * *)
2. [Tiered] Use Small model for complexity assessment
3. [Multi-Agent] Parallel search/analysis/writing
4. [Browser] Access websites without APIs for content extraction
5. [Coding] Generate data visualization scripts
6. [Budget] Control per-execution cost < $2
7. [Output] Send structured report