Yêu cầu công việc
Technical Skills
Cloud Expertise:
Strong GCP experience with migration expertise
Security services: IAM, KMS, WAF, Shield, GuardDuty, Security Hub
Cloud networking: VPC, Transit Gateway, Direct Connect, Global Accelerator
Multi- cloud architecture and management
AWS services mastery: EC2, ECS/EKS, Lambda, RDS, S3, CloudFront, Route53
Expert- level AWS knowledge (Solutions Architect Professional preferred)
DevOps & Automation:
GitOps practices with ArgoCD or Flux
CI/CD platforms: Jenkins, GitLab CI, GitHub Actions, AWS CodePipeline
Configuration management: Ansible, Chef, or Puppet
Infrastructure as Code: Terraform, CloudFormation, AWS CDK
Container orchestration: Kubernetes (EKS), Docker, Helm
Scripting languages: Bash, Go, Python
Site Reliability:
Monitoring/Observability: Prometheus, Grafana, ELK, Datadog, New Relic
Chaos engineering tools: Gremlin, Chaos Monkey
Incident management: PagerDuty, Opsgenie
Performance tuning and capacity planning
APM and distributed tracing: OpenTelemetry, Jaeger
SRE practices: Error budgets, SLI/SLO definition, toil reduction
Leadership SkillsExcellent stakeholder management across technical and business teams
Experience managing remote and distributed teams
Budget management and cost optimization experience
Change management expertise for large- scale transformations
Strong project management and organizational skills
Proven ability to lead and inspire technical teams
Soft SkillsStrong problem- solving skills with calm demeanor during incidents
Ability to influence and drive consensus across organizations
Excellent written and verbal communication skills in both English and Vietnamese
Adaptable to rapidly changing requirements and technologies
Mentoring mindset with passion for developing talent
Strategic thinking with ability to balance long- term vision with immediate needs
ExperienceExperience with enterprise B2B SaaS platforms at scale (millions of requests/day)
7+ years of DevOps/SRE experience with 3+ years in a leadership role
Proven experience leading large- scale cloud migrations (GCP to AWS preferred)
Demonstrated success improving system reliability from
Track record of managing DevOps teams of 4+ engineers
Preferred Qualifications
Serverless architecture and event- driven systems
AWS Certified DevOps Engineer or Solutions Architect Professional
Experience with computer vision and image processing pipelines
Public speaking experience at DevOps/SRE conferences
Experience with regulated environments (SOC2, ISO 27001, GDPR)
Experience with AI/ML workload infrastructure and GPU clusters
FinOps certification or demonstrated cost optimization achievements
Knowledge of automotive industry compliance and regulations
Contributions to open- source DevOps/SRE tools
Experience scaling startups to enterprise level