Staff/Senior Staff Engineer, Public Cloud Operations
TLDR
Lead the full lifecycle management of enterprise-grade hybrid cloud infrastructure across AWS and Alibaba Cloud, driving high availability, cost optimization, security, and cross-team collaboration.
Who We Are
About The Opportunity
We are currently seeking a Public Cloud Operations Expert to join our Singapore team. You be will responsible for the full lifecycle management of enterprise-level hybrid cloud infrastructure, leading unified orchestration, operations, and cost optimisation of AWS and Alibaba Cloud resources, ensuring high availability, high performance, and compliance.
What You’ll Be Doing
-
Cloud Platform Architecture & Operations
-
Plan, deploy, monitor, and maintain AWS services (EC2, S3, VPC, Lambda, EKS, etc.) and Alibaba Cloud services (ECS, OSS, VPC, Function Compute, ACK, etc.).
-
Design highly available, auto-scaling cloud architectures, optimizing network (e.g., Alibaba Cloud CEN, AWS Direct Connect), storage, and compute resource configurations.
-
-
Monitoring & Incident Management
-
Implement full-stack monitoring and alerting using cloud-native tools (AWS CloudWatch, Alibaba Cloud CloudMonitor) and open-source solutions (Prometheus+Grafana, ELK).
-
Lead critical incident response, perform root cause analysis, and implement preventive measures (e.g., resource contention, misconfigurations, network latency).
-
-
Cost Optimisation & Resource Management
-
Analyse cloud resource usage, reduce costs via reserved instances, auto-scaling, and storage lifecycle policies (e.g., AWS S3 Intelligent-Tiering, Alibaba Cloud OSS Archive).
-
Establish resource quota management strategies to prevent waste and overspending.
-
-
Security & Compliance
-
Implement cloud security baselines (security groups, IAM policies, Alibaba Cloud RAM permissions, AWS Security Hub), conduct regular security audits, and remediate vulnerabilities.
-
Design granular access controls using AWS IAM and Alibaba Cloud RAM, and enforce database auditing (e.g., AWS CloudTrail + Alibaba Cloud DAS).
-
-
Cross-Team Collaboration & Knowledge Sharing
-
Collaborate with development teams to optimize application architectures and provide cloud-native solutions (Server-less, Microservices).
-
Document operational procedures (SOP manuals) and lead internal technical training sessions.
-
What We Look For In You
-
Technical Skills
-
Mastery of core services (compute/storage/network/security) on AWS or Alibaba Cloud, with familiarity in the other platform.
-
Proficient in Linux/Windows system operations and automation tools (Shell/Python/Ansible).
-
Hands-on experience with containerized operations (Kubernetes, ECS/EKS, ACK) and cloud-native technologies (e.g., Service Mesh).
-
-
Experience Requirements
-
5+ years of operations experience, with at least 3 years focused on public cloud (AWS/Alibaba Cloud) environments managing 100+ instances.
-
Experience in building cloud platforms from scratch, hybrid cloud architecture design, or large-scale migration projects (e.g., IDC-to-cloud) is preferred.
-
-
Team Management
-
Excellent team management and communication skills to lead and collaborate with operations, development, testing, and security teams.
-
-
Certifications & Education
-
AWS Certified SysOps Administrator or Alibaba Cloud ACP/ACE certifications are preferred.
-
Bachelor’s degree or higher in Computer Science, Network Engineering, or related fields.
-
Nice-To-Haves
-
Familiarity with multi-cloud management platforms (AWS, Alibaba Cloud, Azure) or FinOps cost optimisation methodologies.
-
Experience in cloud security practices, including Web Application Firewall (WAF) and DDoS protection (Alibaba Cloud Anti-DDoS Premium, AWS Shield).
-
Exposure to big data/AI operations (e.g., Alibaba Cloud MaxCompute, AWS EMR).
-
Team leadership experience is preferred.
Perks & Benefits
-
Competitive total compensation package
- L&D programs and education subsidy for employees' growth and development
-
Various team building programs and company events
- Wellness and meal allowances
- Comprehensive healthcare schemes for employees and dependants
- More that we love to tell you along the process!
Benefits
Education Stipend
L&D programs and education subsidy for employees' growth and development
Health Insurance
Comprehensive healthcare schemes for employees and dependants
Wellness Stipend
Wellness and meal allowances
OKX operates as a prominent cryptocurrency exchange, enabling users to buy, sell, and trade a wide range of digital assets, including Bitcoin and Ethereum. In addition to facilitating crypto trading, they've developed OKX Wallet, a widely-used platform for accessing decentralized applications and exploring the Web3 landscape.
- Founded
- Founded 2017
- Employees
- 500+ employees
- Industry
- Diversified Financial Services