v1.2 — 3 revisions in 27 seconds
Statement of Work
Vertex AI APAC Scale-Up — Phase 1
SOW-2026-NEON-001 · Generated by CVMO Co-Creation Engine
Executive Summary
This Statement of Work covers the deployment of a 30-person engineering team to support the scaling of Google Vertex AI inference workloads across the APAC region. The engagement addresses a 340% surge in demand identified by internal demand-signal analysis.
Phase 1 delivers ML engineering, platform architecture, and MLOps capabilities over a 12-month engagement window. BigQuery data pipeline integration is deferred to Phase 2 pending Q3 FY26 review.
Scope of Work
- Deploy and optimize Vertex AI inference endpoints across 3 APAC regions
- Scale ML model serving infrastructure to handle 340% workload growth
- Implement autoscaling and cost-optimization for inference pipelines
- Establish platform architecture for multi-region model deployment
- ADDED v1.1 MLOps pipeline: model monitoring, drift detection, automated retraining
- DEFERRED v1.2 BigQuery data pipeline integration (moved to Phase 2)
Team Composition
| Role | Count | Location | Rate |
|---|---|---|---|
| ML Engineer | 20 | Singapore / Bangalore | $145/hr |
| Platform Architect | 5 | Singapore | $175/hr |
| Data Engineer | 3 | Bangalore | $130/hr |
| NewMLOps Engineer | 2 | Bangalore | $155/hr |
| Total | 30 | 3 locations | — |
Timeline & Milestones
Day 0-30
Team mobilization & onboardingMonth 1-3
Infrastructure assessment & initial scalingMonth 4-8
Full deployment & optimizationMonth 9-12
Stabilization, MLOps handover & Phase 2 scoping