Job Responsibilities:
Lead the strategic planning and technical architecture of the company’s infrastructure to ensure stable and efficient operations in the cloud.
Drive cloud platform architecture design, build automated operations systems, and implement SRE best practices.
Lead the development of a unified developer platform to enhance engineering efficiency and system reliability.
Promote DevOps process optimization, improve CI/CD pipelines, and implement Infrastructure as Code (IaC) practices.
Oversee cloud service security governance, cost control, and ensure platform SLA commitments.
Collaborate with other technical leaders to foster cross-departmental collaboration and strengthen engineering culture.
Job Requirements:
Bachelor’s degree or above, preferably in Computer Science or related fields, with 10+ years of experience in infrastructure or DevOps/SRE-related roles.
Deep understanding of AWS, GCP, or Azure cloud platforms, with hands-on experience in large-scale production architecture and operations.
Proficient in containerization technologies (Docker/Kubernetes) and familiar with cloud-native services.
Experienced with mainstream DevOps toolchains (e.g., Jenkins, GitHub Actions, AWS CDK, Grafana).
Practical experience in implementing SRE methodologies, including SLI/SLO, incident response, and postmortem processes.
Experience in platform engineering or improving developer experience is a plus.
Minimum 5 years of technical team management experience, with proven ability in team building and organizational development.
Strong communication and cross-functional collaboration skills, with strategic thinking and solid execution capabilities.