Responsibilities
- Build and manage infrastructure using Infrastructure as Code (IaC) tools such as Terraform, Helm, or similar technologies.
- Manage and optimize containerized environments using Kubernetes and Docker in production environments.
- Design, develop, and maintain CI/CD pipelines to enable efficient and automated deployment processes.
- Manage and ensure the performance and reliability of large-scale data warehouses and databases, including:
- PostgreSQL
- ClickHouse
- Distributed / High Availability RDBMS
- PostgreSQL
- Implement and maintain monitoring and observability systems using tools such as Prometheus, Grafana, Loki, or similar platforms.
- Support infrastructure requirements for AI / Machine Learning workloads.
Qualifications
- Bachelor’s degree in Computer Science, Information Technology, Engineering, or a related field.
- Experience with automation and Infrastructure as Code (IaC) practices.
- Familiarity with Kubernetes, Docker, and container orchestration.
- Experience managing large-scale databases and distributed systems.
- Strong understanding of monitoring, observability, and system reliability.
- Automation mindset with a focus on improving efficiency and reducing manual processes.