Back to search:DevOps Engineer / Yogyakarta

Job Brief

Volantis Technology is seeking a highly skilled and motivated 
Mid/Senior DevOps Engineer
 to join our growing team in 
Yogyakarta
. As part of our mission to accelerate digital transformation through AI, data, and automation, you will play a key role in building and maintaining the infrastructure that powers our cutting-edge products and solutions. This position offers the opportunity to work with modern cloud-native technologies, drive process automation, and ensure the performance, scalability, and reliability of mission-critical systems. The ideal candidate is passionate about cloud infrastructure, continuous integration and delivery (CI/CD), and operational excellence, with a strong desire to contribute to a collaborative, fast-paced environment that values innovation and continuous improvement.

Job Description

This is a full-time, on-site position for a 
Mid/Senior DevOps Engineer
 based in 
Yogyakarta
. The role involves leading the design, implementation, tuning, and ongoing operation of data infrastructures and cloud-native application platforms while managing the availability, reliability, performance monitoring, and capacity planning for both application and data layers at scale. The DevOps Engineer will drive continuous improvement by enhancing deployment pipelines, automation, observability, and service operations for maximum efficiency and reliability. Responsibilities also include collaborating on system and architecture reviews, CI/CD design, infrastructure as code (IaC), capacity testing, and launch readiness to ensure robust pre-production support. In production, the role requires maintaining live and staging environments through active monitoring of availability, latency, and performance, as well as leading incident response, root cause analysis, and post-mortem actions. Additionally, the DevOps Engineer will develop and manage automation tooling for repetitive operational tasks, scaling, monitoring, and cost optimization, while producing comprehensive infrastructure documentation and contributing to internal knowledge sharing initiatives.

Job Responsibilities

  • Oversee Implementation and Maintenance:
  • Lead the design, implementation, tuning, and ongoing operation of data infrastructures and cloud-native application platforms.
  • Availability & Performance Management:
  • Manage the availability, reliability, performance monitoring, and capacity planning for both application and data layers at scale.
  • Continuous Improvement:
  • Drive enhancements in deployment pipelines, automation, observability, and service operation for maximal efficiency and reliability.
  • Pre-Production Support:
  • Collaborate on system and architecture review, CI/CD design, infrastructure as code (IaC), capacity testing, and launch readiness reviews to ensure robust service rollouts.
  • Production Support and Incident Response:
  • Maintain live and staging environments by measuring and monitoring availability, latency, and performance. Lead incident response, root cause analysis, and manage post-mortem actions.
  • Automation and Tooling:
  • Develop and manage automation tooling for repetitive operational tasks, scaling, monitoring, and cost optimization.
  • Documentation and Knowledge Sharing:
  • Produce clear, comprehensive infrastructure documentation and contribute to internal knowledge bases.

Job Requirements

  • Bachelor's degree or higher in Computer Science, Software Engineering, or related technical discipline.
  • Hands-on experience with cloud-native architecture (AWS, GCP, or Azure) and associated technologies (e.g., serverless, managed databases, container orchestration).
  • Deep experience with Kubernetes (K8s), Helm, and Docker Compose for deployment and scaling of distributed applications.
  • Experience deploying and managing API gateways (e.g., Kong, Traefik, or Istio), including authentication, observability, and traffic management.
  • Proficiency with Prometheus, Grafana, ELK/EFK stacks, application performance monitoring (APM), and alerting best practices.
  • Competence with Terraform, Ansible, Pulumi, or similar IaC frameworks for repeatable, versioned infrastructure deployment.
  • Strong understanding of SRE principles, SLAs/SLOs, and incident management.
  • Strong ability to debug and optimize code and automate routine operational tasks using Bash, Python, or Go.
  • Methodical, metrics-driven approach to troubleshooting and root cause analysis for large-scale, distributed systems.
  • Deep knowledge of both RDBMS (e.g., PostgreSQL) and NoSQL (e.g., PubSub, Redis) systems, including data replication, backup, and recovery strategies.
  • Excellent communicator, able to clearly articulate complex technical concepts to internal teams and stakeholders.
  • Preferred:
  • Familiarity with GitOps workflows and tools.
  • Experience with secure systems design and regulatory compliance (e.g., GDPR)
  • Experience supporting highly available, multi-region, zero-downtime platforms