Mô tả công việc
Working time:
- USA Work Hours: The resource will attend a 3- hour daily standup Monday- Friday, likely from 9 PM to 12 AM VNT.
- Remaining Work Hours: Standard Vietnam work hours (VNT)- 09:30 AM to 04:00 PM.
- Expectation to Travel to USA: The expectation is 1 to 4 trips per year, with each trip lasting one to two weeks. No plans this year for on- site to USA.
- Maintenance Work Hours: The resource will need to work USA hours for three days every three months to perform maintenance on key production systems.
- Manage Jenkins plugins, master/agent nodes, and pipeline libraries to ensure the stability and scalability of our CI/CD platform.
- Troubleshoot and debug automation code and interconnected systems to quickly identify and resolve issues, ensuring minimal disruption to services.
- Implement and manage infrastructure as code, monitoring, and logging solutions to ensure high availability and performance of our systems.
- Effectively communicate complex technical concepts to both technical and non- technical stakeholders through clear written and verbal communication.
- Collaborate with development teams to improve the entire software development lifecycle, from code to production.
- Skilled in implementing security compliance measures, including repaving infrastructure, key rotation, and periodic updates to meet industry standards.
- Develop and maintain workflows in Airflow to orchestrate complex data and application tasks.
- Containerize applications using Docker to ensure consistency across development, testing, and production environments.
- Troubleshoot and resolve production incidents, participate in on- call rotation, perform root cause analysis and perform key maintenance activities quarterly.
- Strong knowledge of monitoring and alerting systems, including Prometheus, Cloud Monitoring, and PagerDuty, to ensure system reliability and proactive incident response.
- Build, configure, and maintain CI/CD pipelines using Jenkins and Groovy scripts to automate software delivery from code commit to production deployment.
- Manage core GCP services including Compute Engine, Managed Instance Groups (MIG), Disk Snapshots, Storage, and Artifact Registry to support our application ecosystem.
- Strong expertise in managing and repaving Windows and Linux machines, ensuring security compliance through automated processes.
- Design and manage infrastructure on Google Cloud Platform (GCP) using Terraform for Infrastructure as Code (IaC).