Apply now »

Site Reliability Engineer Specialist - 999 Platform

Job Req ID:  47731
Posting Date:  17 Jun 2025
Function:  Software Engineering
Unit:  Networks
Location: 

Snowhill, Birmingham, United Kingdom

Salary:  Competitive with Benefits

Recruiter:

Hiring Manager:

Career Grade:

Internal Closing Date:

Why this job matters

This section should be limited to no more than 5 lines outlining how this job delivers for our customers and our business. 
This role is key in keeping the UK connected and you are responsible for ensuring the 999, DQ and OA services that customers are reliant upon, are available 24 x 7 x 365. 

These services are consumed by all CFUs and are provided to residential customers, small to large businesses, including the financial sector & public services, fixed and mobile Communication Providers and Emergency Authorities.Due to the importance of these services, the platforms that you will be working on are deemed to be Critical National Infrastructure.

What you’ll be doing

Responsible for providing professional and effective 2nd and 3rd line support services and work with the development teams to resolve issues
Responsible for providing technical leadership to drive service restoration in the event of outages, including any remedial actions and ensure all parties who are impacted or can help to address the problem are engaged in a timely fashion to reach a solution
Proactively identifies and manages risk through regular assessment and diligent execution of controls and mitigations, proactively raising any concerns 
Responsible for performing and managing change activities in live and model environments to agreed timescales
Operational responsibility for developing resource plans; prioritizes and allocates work to achieve projects, or components.
Exercise effective control through planning, monitoring, progressing and cost control of the tasks assigned to you and where appropriate, third-party suppliers.
Executes metric/monitoring analysis that creates stability, security, and performance improvements 
Designs, analyses, develops and troubleshoots highly-distributed large-scale production systems spanning on-prem and cloud-based hosting 
Executes approaches that scale systems sustainably through mechanisms like automation and evolves systems by pushing for changes that improve reliability and velocity 
Inspects queue and support processing to ensure early warning of support issues 
Implements robust monitoring and alerting systems and performs root cause analysis and post-mortems with an eye towards future prevention 
Executes retrospective and preventive actions after each high severity production incident 
Analyses complex systems from a reliability and resilience perspective and identifies sources of instability in distributed systems 
Champions, continuously develops and shares with team knowledge on emerging trends and changes in site reliability engineering best practices and industry standards 
Mentors other site reliability engineers, helping to improve the team's abilities by acting as a technical resource

Our leadership standards

Looking in:
Leading inclusively and Safely
I inspire and build trust through self-awareness, honesty and integrity.
Owning outcomes
I take the right decisions that benefit the broader organisation.

Looking out:
Delivering for the customer
I execute brilliantly on clear priorities that add value to our customers and the wider business.
Commercially savvy
I demonstrate strong commercial focus, bringing an external perspective to decision-making.

Looking to the future:
Growth mindset
I experiment and identify opportunities for growth for both myself and the organisation.
Building for the future
I build diverse future-ready teams where all individuals can be at their best.

Skills

Windows 10 and Server OS monitoring, support and software rollout
Active Directory, DNS and WSUS 
Knowledge of IP networking with SIP/RTP protocols
Linux OS support 
Database administration using SQL, MariaDB , REDIS and TimesTen
Can troubleshoot system and network problems and diagnose and solve hardware or software faults 
Proficient at analysing, testing, and evaluating systems   
Demonstrates great customer service skills, listens carefully and exercises patience 
Has experience of call centre support and understands call centre behaviour 
Installs and configures computer hardware, operating systems and applications and replace parts as required 

Benefits

Annual On target bonus 10% (personal and company multipliers)
BT Pension scheme; minimum 5% employee contribution, BT contribution 10%
Life Assurance
Direct share scheme
Exclusive colleague discounts on our latest and greatest BT broadband packages
50% off EE mobile pay monthly or SIM only plans and 50% discount for friends and family on EE SIM only plans
My Discounts gives colleagues access to unbeatable savings on everyday purchases at hundreds of retailers
Discounted EE TV including TNT Sport and the NOW Entertainment membership
Great support for working parents including pay whilst on maternity, adoptive, and paternity leave
Option to join the Healthcare Cash Plan or other benefits such as dental insurance, gym memberships etc.
25 days annual leave (not including bank holidays), increasing with service with buy holiday option
Volunteering days so you can give back to your local community
Brand new electric vehicle salary sacrifice arrangement, known as ‘My EV’

About us

BT Group was the world’s first telco and our heritage in the sector is unrivalled.  As home to several of the UK’s most recognised and cherished brands – BT, EE, Openreach and Plusnet, we have always played a critical role in creating the future, and we have reached an inflection point in the transformation of our business. 
 
Over the next two years, we will complete the UK’s largest and most successful digital infrastructure project – connecting more than 25 million premises to full fibre broadband.  Together with our heavy investment in 5G, we play a central role in revolutionising how people connect with each other. 
 
While we are through the most capital-intensive phase of our fibre investment, meaning we can reward our shareholders for their commitment and patience, we are absolutely focused on how we organise ourselves in the best way to serve our customers in the years to come.  This includes radical simplification of systems, structures, and processes on a huge scale. Together with our application of AI and technology, we are on a path to creating the UK’s best telco, reimagining the customer experience and relationship with one of this country’s biggest infrastructure companies.  
 
Change on the scale we will all experience in the coming years is unprecedented.  BT Group is committed to being the driving force behind improving connectivity for millions and there has never been a more exciting time to join a company and leadership team with the skills, experience, creativity, and passion to take this company into a new era.

A FEW POINTS TO NOTE:

Although these roles are listed as full-time, if you’re a job share partnership, work reduced hours, or any other way of working flexibly, please still get in touch.

We will also offer reasonable adjustments for the selection process if required, so please do not hesitate to inform us.

DON'T MEET EVERY SINGLE REQUIREMENT?

Studies have shown that women and people who are disabled, LGBTQ+, neurodiverse or from ethnic minority backgrounds are less likely to apply for jobs unless they meet every single qualification and criteria. We're committed to building a diverse, inclusive, and authentic workplace where everyone can be their best, so if you're excited about this role but your past experience doesn't align perfectly with every requirement on the Job Description, please apply anyway - you may just be the right candidate for this or other roles in our wider team.

Apply now »