VIOS — Vendor Intelligence Operating System

v1.2 — 3 revisions in 27 seconds

Statement of Work

Vertex AI APAC Scale-Up — Phase 1

SOW-2026-NEON-001 · Generated by CVMO Co-Creation Engine

Executive Summary

This Statement of Work covers the deployment of a 30-person engineering team to support the scaling of Google Vertex AI inference workloads across the APAC region. The engagement addresses a 340% surge in demand identified by internal demand-signal analysis.

Phase 1 delivers ML engineering, platform architecture, and MLOps capabilities over a 12-month engagement window. BigQuery data pipeline integration is deferred to Phase 2 pending Q3 FY26 review.

Scope of Work

Deploy and optimize Vertex AI inference endpoints across 3 APAC regions
Scale ML model serving infrastructure to handle 340% workload growth
Implement autoscaling and cost-optimization for inference pipelines
Establish platform architecture for multi-region model deployment
ADDED v1.1 MLOps pipeline: model monitoring, drift detection, automated retraining
DEFERRED v1.2 BigQuery data pipeline integration (moved to Phase 2)

Team Composition

Role	Count	Location	Rate
ML Engineer	20	Singapore / Bangalore	$145/hr
Platform Architect	5	Singapore	$175/hr
Data Engineer	3	Bangalore	$130/hr
NewMLOps Engineer	2	Bangalore	$155/hr
Total	30	3 locations	—

Timeline & Milestones

Day 0-30

Team mobilization & onboarding

Month 1-3

Infrastructure assessment & initial scaling

Month 4-8

Full deployment & optimization

Month 9-12

Stabilization, MLOps handover & Phase 2 scoping