Operations Automation Expert (AWS/Alibaba Cloud)

OKX

OKX

Operations
hong kong
Posted on May 8, 2025

Who We Are

At OKX, we believe that the future will be reshaped by crypto, and ultimately contribute to every individual's freedom. OKX is a leading crypto exchange, and the developer of OKX Wallet, giving millions access to crypto trading and decentralized crypto applications (dApps). OKX is also a trusted brand by hundreds of large institutions seeking access to crypto markets. We are safe and reliable, backed by our Proof of Reserves. Across our multiple offices globally, we are united by our core principles: We Before Me, Do the Right Thing, and Get Things Done. These shared values drive our culture, shape our processes, and foster a friendly, rewarding, and diverse environment for every OK-er. OKX is part of OKG, a group that brings the value of Blockchain to users around the world, through our leading products OKX, OKX Wallet, OKLink and more.

About the Opportunity

Lead the design and implementation of an enterprise-level operations automation platform in a multi-cloud environment, integrating AWS and Alibaba Cloud resources to build a standardized, intelligent operational framework that enhances efficiency and reliability.

What You’ll Be Doing:

  1. Automation Platform Design & Development
    1. Lead the architecture design of a cross-cloud (AWS/Alibaba Cloud) operations automation platform, covering core modules such as resource orchestration, monitoring/alerting, self-healing, and cost optimization.
    2. Develop unified operational APIs and a visual console, integrating AWS SDK/Boto3 and Alibaba Cloud OpenAPI/SDK to standardize cross-cloud resource operations.
  2. Toolchain Integration & Optimization
    1. Build end-to-end resource lifecycle management using IaC tools (Terraform, AWS CloudFormation, Alibaba Cloud ROS), enabling one-click environment provisioning and teardown.
    2. Integrate CI/CD pipelines (GitLab, cloud-native toolchains) to automate application deployment, configuration changes, and database migrations.
  3. Intelligent Operations Capability Development
    1. Design an automated operations rule engine, leveraging AI/ML (e.g., anomaly detection, root cause analysis) for predictive fault resolution (e.g., AWS Lambda + CloudWatch event-triggered remediation).
    2. Build a knowledge base system to document SOPs and enable automated execution (e.g., Alibaba Cloud OOS).
  4. Multi-Cloud Coordination & Standardization
    1. Design a unified operations model across AWS and Alibaba Cloud, abstracting common interfaces to address multi-cloud differences (e.g., aligning ECS and EC2 instance management strategies).
    2. Establish operational standards and drive configuration standardization/automated validation across dev, test, and production environments.
  5. Security & Compliance Governance
    1. Embed security baseline checks to automatically scan cloud configurations (e.g., security group rules, IAM policies, Alibaba Cloud RAM permissions) and generate compliance reports.
    2. Automate approval workflows for sensitive operations (e.g., Alibaba Cloud ActionTrail and AWS CloudTrail log-triggered approval tickets).
  6. Cost Optimization Framework
    1. Develop resource utilization analysis tools, leveraging AWS Cost Explorer and Alibaba Cloud Cost Management APIs to generate automated optimization recommendations (e.g., idle resource cleanup, scaling policy tuning).
    2. Design FinOps automation solutions for budget alerts, cost allocation, and multi-dimensional cost visualization.

What We Look For In You:

  1. Technical Skills
    1. Proficient in at least one programming language (Python/Go/Java), with experience in large-scale operations platform development and familiarity with microservices architecture (Spring Cloud/Dubbo) and full-stack technologies.
    2. Deep understanding of AWS and Alibaba Cloud core service APIs, cloud-native technologies (Serverless, K8s Operator), and DevOps toolchains (Ansible, Prometheus).
    3. Skilled in automated testing frameworks to ensure platform stability.
  2. Experience Requirements
    1. 5+ years in DevOps/operations development, with proven experience in designing and deploying enterprise-level automation platforms (e.g., CMDB, operations middleware).
    2. Hands-on experience with AWS/Aliyun hybrid cloud automation tools, including cross-cloud resource synchronization and federated authentication (e.g., Alibaba Cloud RAM SSO, AWS IAM Identity Center).
  3. Soft Skills
    1. Product-oriented mindset, capable of designing user-friendly and efficient operational features.
    2. Strong cross-team collaboration skills to drive adoption of automation platforms across development, operations, and security teams.
  4. Certifications & Education
    1. AWS Certified DevOps Engineer or Alibaba Cloud ACP/ACE (DevOps track) certifications preferred.
    2. Bachelor’s degree or higher in Computer Science, Software Engineering, or related fields.

Perks & Benefits

  • Competitive total compensation
  • Comprehensive insurance coverage for employees and their dependants
  • More that we love to tell you along the process!