Back to jobs

DevOps & Site Reliability Engineer

Job description

About the Role


Our client is a growing payment platform company looking to hire a DevOps & Site Reliability Engineer to support, maintain, and scale their production systems. This role will work closely with engineering teams to ensure system reliability, performance, and security.

Key Responsibilities


* Manage and maintain cloud infrastructure (AWS / Azure / GCP)
* Build, maintain, and improve CI/CD pipelines
* Ensure high availability, performance, and reliability of systems
* Monitor system health, troubleshoot incidents, and perform root cause analysis
* Automate infrastructure provisioning and deployment using IaC tools
* Support application deployments and production releases
* Work closely with developers to improve system scalability and resilience
* Ensure security best practices across infrastructure and deployments

Requirements


* At least 5 years of experience in DevOps, SRE, or Infrastructure Engineering
* Strong experience with cloud platforms (AWS preferred)
* Hands-on experience with CI/CD tools (e.g. Jenkins, GitHub Actions, GitLab)
* Experience with containerization and orchestration (Docker, Kubernetes)
* Knowledge of monitoring and logging tools (Prometheus, Grafana, ELK, etc.)
* Scripting experience (Bash, Python, or similar)
* Experience supporting production systems in a high-availability environment
* FinTech or payments industry experience is a plus