Job description :
- Design and implement CI/CD pipelines using tools like Jenkins, GitLab CI, or GitHub Actions.
- Manage Infrastructure as Code (IaC) using Terraform, Ansible, or CloudFormation.
- Monitor application and infrastructure performance using observability tools such as Prometheus, Grafana Stack, and the ELK Stack.
- Ensure security best practices in deployment workflows and cloud environments, including secrets management and vulnerability scanning.
- Collaborate with development, QA, and operations teams to streamline software delivery and incident response.
- Automate system provisioning, configuration, and deployment processes.
- Maintain and improve container orchestration platforms like Kubernetes and Docker Swarm.
- Participate in incident management, post-mortems, and root cause analysis to improve system reliability.
Requirements
- Bachelor's degree in Computer Science, Software Engineering, or a related field.
- Minimum 3 years of experience in DevOps, Site Reliability Engineering (SRE), or related roles.
- Proficiency in scripting languages such as Python, Bash, or Go.
- Hands-on experience with cloud platforms (AWS, Azure, GCP).
- Familiarity with DevSecOps practices and tools.
- Experience working in a similar business domain or industry is highly preferred.
- Certifications such as AWS,GCP, DevOps Engineer, CKA/CKAD, or Docker Certified Associate are a plus.
Soft Skills:
- Strong collaboration and communication skills across cross-functional teams.
- Analytical mindset with a focus on problem-solving and continuous improvement.
- Ability to work in a fast-paced, agile, and dynamic environment.
- Proactive attitude toward automation, efficiency, and resilience.