Site Reliability Engineer
We are forming a team of rapid response to resolve business impacting technical incidents and to shore up processes and build automation to reduce or mitigate downtime. This role involves being the first point of technical escalation of issues within our infrastructure both in cloud and on-prem. It also includes participating in stand-ups with development teams and informing your squad of updates and changes to our platform. The role focuses on automating everything, including workflow and tool automation, such as deployments of distributed applications and infrastructure using various scripting languages to allow 24/7 Incident Engineers to mitigate incidents without escalation. The Site Reliability Engineer will be able to analyse, diagnose and solve issues in the production environment with minimal escalations to supporting 3rd Level support teams. This position also involves participating in the Change Management process via review of RFC’s to ensure “Definition of Done” as well as executing and supporting software and hardware deployments. Developing and documenting ways-of-working between the LiveOps (NOC) Team and the development teams to improve efficiencies in diagnostics and impact mitigation is also a key aspect of this role.
- Being the first point of technical escalation of issues within our infrastructure both in cloud and on-prem.
- Participating in stand-ups with the development teams and informing your squad of updates and changes to our platform.
- Automating everything – Workflow and tool automation - such as deployments of distributed applications and infrastructure using various scripting languages to allow our 24/7 Incident Engineers to mitigate incidents without escalation.
- Able to analyse, diagnose and solve issues in the production environment with minimal number of escalations to supporting 3rd Level support teams.
- Participate in Change Management process via review of RFC’s to ensure “Definition of Done” as well as executing and supporting software and hardware deployments.
- Developing and Documenting ways-of-working between the LiveOps(NOC) Team and the development teams to improve efficiencies in diagnostics and impact mitigation.
- Supporting and troubleshooting.
- Using Automation and configuration management tools (Octopus, Team City, Terraform) (required).
- AWS Cloud infrastructure, CDNs, and other various systems running in multiple data centres and environments (required).
- Cloud Application Load Balancer, preferably with experience on AWS ALB (required).
- Cloud DNS support such as AWS Route 53, GCP Cloud DNS, or Azure DNS (required).
- Serverless Computing such as AWS Lambda (required).
- Cloud Firewall such as AWS WAF (required).
- Server virtualisation such as VMware, IaaS and PaaS cloud such as AWS and Azure (required).
- Open-source monitoring and alerting tools (Prometheus, Loki, Grafana and Jaeger) (required).
- Scripting in Python, Bash, Powershell or others (required).
- Microsoft SQL databases via Stored Procedures, Locking/Unlocking tables and running select statements to assess impact and diagnose problems (required).
- Bachelors degree or equivalent experience, technical degree beneficial (preferred).
- Aws Cloud practitioner or equivalent would be beneficial (preferred).
- You will be working on 24/7 shift basis with opportunity for remote working on limited basis.
Betsson is a diversified, multinational gaming group whose history dates back to 1963 and which is now listed on Nasdaq Stockholm. The group employs around 3,000 people of more than 75 nationalities across over 20 locations; Betsson AB is registered in Stockholm, while its operational headquarters in Ta' Xbiex, Malta, run the day-to-day business. Through brands such as Betsson, Betsafe and NordicBet, it offers casino, sportsbook and other gaming products in regulated markets across Europe, the Americas and Central Asia. Its proprietary technology supports a scalable model serving both B2C customers and B2B partners, with responsible growth and customer protection central to its strategy.

