Key Responsibilities
Provide comprehensive support and management for our Kubernetes (K8s) container platform.
Design, maintain, and optimize CI/CD pipelines for various applications, supporting deployment strategies such as canary releases, blue-green deployments, and rolling updates, including emergency rollbacks.
Manage continuous delivery and GitOps workflows, ensuring seamless, automated application deployments and updates across Kubernetes clusters.
Manage and monitor service log collection and alerting systems, leveraging tools like Prometheus and the ELK Stack to provide visibility and proactively identify issues.
Maintain platform security, including regular patching and container image vulnerability management.
Collaborate closely with development teams to ensure seamless software integrations and deployment processes.
Handle production incidents, perform thorough root cause analysis (RCA), and produce detailed incident reports.
Requirements
University Degree or Higher Diploma in Information Technology, Computer Science, or a related discipline.
At least 5 years of experience in a Site Reliability Engineer (SRE), DevOps role, Kubernetes, NGINX, Networking principles, Jenkins, GitLab, ArgoCD, and Terraform.
Strong, solid and practical experience in Redhat OpenShift platform is a must.
Strong troubleshooting skills in OpenShift / K8S platforms.
Solid focus on ensuring the availability, performance, and scalability of platforms and services using modern observability and orchestration tools.
Proficiency in scripting languages (e.g., Bash/Python).
Solid experience in system infrastructure, including database technologies (e.g., PostgreSQL, MySQL), Windows/Linux platforms, virtualization platforms (e.g., KVM, Hyper-V), and container technology (Docker, Kubernetes).
RHCE Certification is an advantage.
MCSE Certification is an advantage.
Knowledge of ISO 27001 standards is an advantage.
Excellent collaboration and communication skills.
Good command of both written and spoken English and Chinese.
Ability to work nights and handle on-call emergency situations as required.
Strong awareness of security best practices, compliance requirements, and credential management.
Interested parties, please submit your resume to fiona.tang@manpowergrc.hk
Type: Permanent
Category: I.T & T - Support & Operations/Systems Administration
Reference ID: 210-20260616-FT
Date Posted: 16/06/2026