Monitoring: SREs monitor software systems to ensure their reliability, performance, and availability for 24x7 by shifting
Monitoring incident from Jira tickets from L2 Team and follow up to related teams
- Trace the problems with related logs and documents
- Monitor the log files to manage infrastructure
Support and respond issued by using email, chat, ticket comments and phone
Incident Prevention: SREs work proactively to prevent incidents from reoccurring
Collaborate with development teams and other stakeholders to identify the potential risks
- Identified and analyze the issue with logs and documents
Once the issue is done, continuously monitor and review the effectiveness solutions
Infrastructure Management: Managing infrastructure using tools like DBeaver, Postman, Putty, Visual Studio and Google cloud console
Reporting: Performance reporting the system and infrastructure
Supporting:
Help the testing team for managing user's database on the application system
Assisting developers during testing or deployment relate request or problems.
Preventive Maintenance and Improvement:
Managing observability tools and reliability processes.
- Developing automation for monitoring, deployment, and recovery.
Minimum Qualifications: Educational Background:
- Bachelor's degree in Computer Science, Information Technology, or a related field with 5 years or more experience as SRE/DevOps
Technical Skills:
- Familiar with Linux/Unix systems, networking concepts, and cloud platforms (e.g., Google Cloud, AWS).
- Experience using tools such as DBeaver, Postman, Putty, Visual Studio, and Google Cloud Console.
- Knowledge in monitoring and observability tools (e.g., Grafana, Prometheus, ELK, or similar).
- Basic understanding of databases (SQL/NoSQL) and application logs analysis.
- Experience in automation or scripting (e.g., Bash, Python, or Shell) is a plus.
Soft Skills:
- Strong analytical thinking and problem-solving ability.
- Good communication and teamwork skills — able to collaborate effectively with developers and testers.
- Able to work under pressure, especially during system incidents or outages.
- Detail-oriented and proactive in preventing potential issues.
Additional Advantages:
- Familiar with CI/CD processes or DevOps concepts.
- Experience handling incident response and performance optimization.
- Understanding of system reliability principles (SLA, SLO, SLI).
Willing to be placed at Banking Sector
Tiga Daya Digital Indonesia, a susidiary company of Triputra Group and DCI Group To be IT partner to enable client growth rapidly. Eksad Providing Services High Quality Based on Strong Experience in the industry and technology. Building the right IT Service Solution to enable it Partners in speeding up business development based on digital technology by providing professional and high competency resources. Vision To be Preferred IT Partner In The Region. Mission Establish excellent end to end IT Services to enable clients to grow their business rapidly thru high competence and professional resources.