Site Reliability Engineer (SRE)

Role Overview

We are seeking a skilled Site Reliability Engineer (SRE) with a strong software engineering background and a passion for automation. You will play a critical role in ensuring the reliability, scalability, and performance of our infrastructure and trading platforms.

This role focuses on applying a software engineering mindset to operational challenges — automating processes, improving observability, and enhancing the stability of production systems. You’ll work closely with development, infrastructure, and DevOps teams to deliver robust, resilient systems that power MultiBank’s global trading operations.

Key Responsibilities

Ensure system reliability, uptime, and performance across production environments.
Apply automation to reduce manual effort, eliminate operational toil, and improve efficiency.
Design and implement monitoring, alerting, and observability systems to detect and resolve issues proactively.
Collaborate with development and operations teams to build scalable and resilient systems.
Contribute to incident response, root-cause analysis, and post-mortem reviews, driving continuous improvement.
Implement and refine Infrastructure as Code (IaC) and CI/CD pipelines for reliable deployments.
Enhance disaster recovery and business continuity capabilities to ensure uptime SLAs are met.
Participate in capacity planning, performance tuning, and resource optimization.
Integrate security and compliance best practices into all infrastructure operations.
Support the release process, ensuring safe and efficient production deployments.
Stay current with emerging SRE tools, frameworks, and cloud technologies to continuously improve reliability practices.

Qualifications & Skills

Bachelor’s degree in Computer Science, Information Technology, or related field.
3+ years of experience as a Site Reliability Engineer or in a DevOps/Infrastructure role.
Strong background in software engineering, automation, and infrastructure management.
Hands-on experience with containerization (Docker) and orchestration tools like Kubernetes.
Proficiency in Linux administration, networking, and system security.
Familiarity with Infrastructure as Code (IaC) tools and principles.
Proficiency with scripting languages such as Python, Bash, or PowerShell.
Experience managing CI/CD pipelines using tools such as Jenkins, GitLab CI/CD, or similar.
Expertise with monitoring and observability tools (Prometheus, Grafana, Zabbix, etc.).
Solid understanding of cloud platforms (AWS, Azure) and related services (EKS, EC2, S3, RDS, Lambda).
Knowledge of incident management and on-call best practices.
Excellent analytical and problem-solving skills with a proactive mindset.
Relevant certifications (AWS, Kubernetes, or SRE-related) are a plus.

Why Join Us?

Work with an industry-leading global financial institution.
Competitive salary and comprehensive employee benefits.
Opportunities for professional growth and career advancement.
Collaborative, inclusive, and dynamic work environment.
Commitment to innovation and professional excellence.
Become part of our international community at MultiBank Group, dedicated to excellence, innovation, and shaping the future of finance.

To Apply

Please upload your CV to apply for this position.
If shortlisted, we will request additional details on your experience improving system reliability, automation, and observability in production environments.

Choose Where To Go Next

Want to get started?

Choose Where To Go Next

Want to get started?