Lead DevOps Engineer Go, Python

PAVE
Mức lương
Đang cập nhật
Địa điểm làm việc
Quận 1, Hồ Chí Minh
Kinh nghiệm yêu cầu
Cập nhật
Thông tin cơ bản

Mô tả công việc

Mô tả công việc

We&039;re seeking an experienced Lead DevOps Engineer to spearhead our critical infrastructure transformation as PAVE.ai scales to enterprise level. This role will lead the strategic migration from Google Cloud Platform to AWS while building and managing a high- performing DevOps team. As Lead DevOps Engineer at PAVE.ai, you&039;ll architect enterprise- grade infrastructure, establish site reliability engineering practices, and ensure 99.9%+ uptime for our vehicle inspection platform serving global automotive enterprises. This is a pivotal role that will define our infrastructure strategy and operational excellence as we process millions of vehicle inspections for dealerships, fleet operators, insurers, and vehicle marketplaces worldwide.
Cloud Migration Leadership

Optimize costs during and after migration while improving performance and reliability
Document migration processes and create runbooks for knowledge transfer
Design and implement AWS enterprise architecture following Well- Architected Framework principles
Lead and execute the complete migration strategy from GCP to AWS, ensuring zero downtime
Architect hybrid cloud solutions during transition phase to maintain business continuity
Create detailed migration roadmaps with clear milestones, risk assessments, and rollback plans

Team Leadership & Development

Create career development paths and training programs for team members
Lead incident response and post- mortem processes to drive continuous improvement
Define team structure, roles, and responsibilities for 24/7 operational coverage
Establish DevOps culture and best practices across the engineering organization
Foster collaboration between DevOps, development, and security teams
Build and lead a world- class DevOps team, including hiring, mentoring, and performance management

Site Reliability Engineering (SRE)

Establish and maintain SLIs, SLOs, and SLAs for all critical services
Implement chaos engineering practices to identify and fix potential failures
Build automated incident detection and response systems
Create capacity planning models to support 10x growth
Ensure 99.9%+ uptime for production systems through proactive reliability engineering
Design and implement comprehensive monitoring and observability strategies

Infrastructure & Automation

Implement auto- scaling and self- healing infrastructure
Design scalable, secure, and cost- effective AWS infrastructure for enterprise workloads
Design disaster recovery and business continuity strategies
Build CI/CD pipelines supporting multiple deployment strategies (blue- green, canary)
Implement Infrastructure as Code (IaC) using Terraform/CloudFormation
Automate security compliance and governance using AWS native tools
Develop and enhance logging systems and observability tools (ongoing improvement initiative)

Enterprise Platform Development

Architect multi- tenant infrastructure supporting enterprise isolation requirements
Design data residency and compliance solutions for global operations
Implement enterprise- grade security including VPN, SSO, and zero- trust networking
Create developer self- service platforms to accelerate delivery
Build platform services for logging, monitoring, secrets management, and service mesh
Establish FinOps practices for cloud cost optimization

Strategic Planning

Develop long- term infrastructure roadmap aligned with business objectives
Establish vendor relationships and manage AWS enterprise support
Evaluate and introduce new technologies to improve operational efficiency
Create business cases for infrastructure investments with ROI analysis
Drive infrastructure standardization and consolidation initiatives
Partner with leadership to define technology strategy and investments

Success Metrics

Complete GCP to AWS migration within 6 months with zero critical incidents
Build and retain a high- performing DevOps team with
Reduce infrastructure costs by 30% while improving performance
Reduce MTTR (Mean Time To Recovery) by 50%
Achieve and maintain 99.9% uptime across all production services
Decrease deployment frequency from weekly to multiple times daily

Yêu cầu công việc

Yêu cầu công việc

Technical Skills

Cloud Expertise:

Strong GCP experience with migration expertise
Security services: IAM, KMS, WAF, Shield, GuardDuty, Security Hub
Cloud networking: VPC, Transit Gateway, Direct Connect, Global Accelerator
Multi- cloud architecture and management
AWS services mastery: EC2, ECS/EKS, Lambda, RDS, S3, CloudFront, Route53
Expert- level AWS knowledge (Solutions Architect Professional preferred)

DevOps & Automation:

GitOps practices with ArgoCD or Flux
CI/CD platforms: Jenkins, GitLab CI, GitHub Actions, AWS CodePipeline
Configuration management: Ansible, Chef, or Puppet
Infrastructure as Code: Terraform, CloudFormation, AWS CDK
Container orchestration: Kubernetes (EKS), Docker, Helm
Scripting languages: Bash, Go, Python

Site Reliability:

Monitoring/Observability: Prometheus, Grafana, ELK, Datadog, New Relic
Chaos engineering tools: Gremlin, Chaos Monkey
Incident management: PagerDuty, Opsgenie
Performance tuning and capacity planning
APM and distributed tracing: OpenTelemetry, Jaeger
SRE practices: Error budgets, SLI/SLO definition, toil reduction

Leadership Skills

Excellent stakeholder management across technical and business teams
Experience managing remote and distributed teams
Budget management and cost optimization experience
Change management expertise for large- scale transformations
Strong project management and organizational skills
Proven ability to lead and inspire technical teams

Soft Skills

Strong problem- solving skills with calm demeanor during incidents
Ability to influence and drive consensus across organizations
Excellent written and verbal communication skills in both English and Vietnamese
Adaptable to rapidly changing requirements and technologies
Mentoring mindset with passion for developing talent
Strategic thinking with ability to balance long- term vision with immediate needs

Experience

Experience with enterprise B2B SaaS platforms at scale (millions of requests/day)
7+ years of DevOps/SRE experience with 3+ years in a leadership role
Proven experience leading large- scale cloud migrations (GCP to AWS preferred)
Demonstrated success improving system reliability from
Track record of managing DevOps teams of 4+ engineers

Preferred Qualifications

Serverless architecture and event- driven systems
AWS Certified DevOps Engineer or Solutions Architect Professional
Experience with computer vision and image processing pipelines
Public speaking experience at DevOps/SRE conferences
Experience with regulated environments (SOC2, ISO 27001, GDPR)
Experience with AI/ML workload infrastructure and GPU clusters
FinOps certification or demonstrated cost optimization achievements
Knowledge of automotive industry compliance and regulations
Contributions to open- source DevOps/SRE tools
Experience scaling startups to enterprise level

Quyền lợi

Tại sao bạn sẽ yêu thích làm việc tại đây

Competitive Compensation & Perks

Thoughtful appreciation gifts throughout the year.
13th- month bonus
15 days of annual leave.
Attractive salary package.
Premium healthcare coverage for you and your family.

Growth & Learning Opportunities

Work on cutting- edge, large- scale products in the car inspection field.
Clear career paths for both technical experts and aspiring leaders.
Continuous learning programs to sharpen your skills and grow your career.
Learn from everything, everywhere—but be a smart copy- paster, not a copycat!
Be ready to embrace and implement new ideas in a fast- paced environment.

An Inspiring Workplace

Respect and care for your teammates, your environment, and even yourself.
Flexible hybrid work model and a strong focus on work- life balance.
A modern, fully- equipped Office with a well- stocked pantry.
Treat yourself well, and while you’re at it, save the Earth too.
Be motivated, creative, and passionate—we can’t ask for more!

A Mindset for Growth

Always look back at your work and strive to make it better—nothing is perfect, and that’s where you come in.
It’s okay to be late sometimes, but make sure you’re fully accountable and aware of your actions.
Have the courage to move fast, stay flexible, and take full responsibility for every single line of code.

A Dynamic and Open Culture

We don’t stick rigidly to the gameplan, so feel free to add or remove your own “blah blah” from this list. 😉

Cập nhật gần nhất lúc: 2025-10-08 05:10:03

Xem thêm

Đặc điểm công việc

Hạn nộp hồ sơ
11/11/2025
Hình thức làm việc
Đang cập nhật
Cấp bậc
Nhân Viên
Số lượng cần tuyển
Đang Cập Nhật
Ngành nghề
IT phần mềm
Khu vực
Quận 1, Hồ Chí Minh
Xem thêm
Xem thêm
Người tìm việc lưu ý:
Bạn đang xem tin Lead DevOps Engineer Go, Python - Mã tin đăng: 5317757. Mọi thông tin liên quan tới tin tuyển dụng này là do người đăng tin đăng tải và chịu trách nhiệm. Chúng tôi luôn cố gắng để có chất lượng thông tin tốt nhất, nhưng chúng tôi không đảm bảo và không chịu trách nhiệm về bất kỳ nội dung nào liên quan tới tin việc làm này. Nếu người tìm việc phát hiện có sai sót hay vấn đề gì xin hãy báo cáo cho chúng tôi

PAVE

Quy mô: Cập nhật
Trụ sở: Cập nhật

Bí kíp tìm việc an toàn

Dưới đây là những dấu hiệu của các tổ chức, cá nhân tuyển dụng không minh bạch:
1. Dấu hiệu phổ biến:
Hình ảnh 1
Nội dung mô tả công việc sơ sài, không đồng nhất với công việc thực tế
Hình ảnh 2
Hứa hẹn "việc nhẹ lương cao", không cần bỏ nhiều công sức dễ dàng lấy tiền "khủng"
Hình ảnh 3
Yêu cầu tải app, nạp tiền, làm nhiệm vụ
Hình ảnh 4
Yêu cầu nộp phí phỏng vấn, phí giữ chỗ...
Hình ảnh 5
Yêu cầu ký kết giấy tờ không rõ ràng hoặc nộp giấy tờ gốc
Hình ảnh 6
Địa điểm phỏng vấn bất bình thường
2. Cần làm gì khi gặp việc làm, công ty không minh bạch:
- Kiểm tra thông tin về công ty, việc làm trước khi ứng tuyển
- Báo cáo tin tuyển dụng với 123job thông qua nút "Báo cáo tin tuyển dụng" để được hỗ trợ và giúp các ứng viên khác tránh được rủi ro
- Hoặc liên hệ với 123job thông qua kênh hỗ trợ ứng viên của 123job:
Hotline: 0961.469.398

Việc làm đề xuất liên quan

Việc làm đã xem gần đây

Từ khóa tìm việc làm tại 123Job
Lead devops engineer tại tỉnh/thành