Apply now »

Site Reliability Engineer

Job Req ID:  31786
Posting Date:  26 Apr 2024
Function:  Software Engineering
Unit:  Digital
Location: 

Sovereign Street, Leeds, United Kingdom

Salary:  Competitive

Location

This role is based at our Leeds office. We have a hybrid work model of 3 days in the office per week and 2 days remotely.

Why this job matters

You will play a pivotal role in ensuring the reliability, availability, and performance of complex software systems and digital services. We have more than 200 applications to support where it is important to have:
 
Smooth Operation of Online Services: the seamless functioning of online services is critical for us. Downtime, performance glitches, and unstable software releases can significantly impact user experience and business outcomes.
 
Different from DevOps: While DevOps focuses on delivering applications with a short and stable release lifecycle, SRE concentrates on production reliability. Our SRE teams ensure that software runs smoothly in real-world scenarios, with high availability and stability.

What you’ll be doing

  • SREs responsible for ensuring the reliability, availability, and performance of critical systems and applications.
  • Define and execute the SRE strategy and roadmap in alignment with organizational goals.
  • Resolve application related issues raised by the users through ticketing tools.
  • Follow Change Request (CR) process to implement PROD changes on regular basis.
  • Foster a culture of collaboration, innovation, and continuous improvement within the team.
  • Drive initiatives to reduce incidents and improve system resilience.
  • Identify opportunities for automation to streamline operational tasks and reduce manual intervention.
  • Collaborate with development teams to integrate reliability practices into the software development lifecycle.
  • Conduct post-incident reviews to identify root causes and prevent recurrence.
  • Define scaling strategies and capacity thresholds to maintain system performance.
  • Collaborate with security teams to ensure system security and compliance with industry regulations.
  • Implement security best practices and incident response plans to address security incidents.
  • Maintain comprehensive documentation of system architecture, configurations, and processes.
  • Track and report on key performance metrics to measure the effectiveness of SRE efforts.
  • Make data-driven decisions to improve system reliability and performance.
  • Collaborate with product teams to align SRE efforts with business objectives.

Skills and Experience

Required

  • Understanding of continuous integration and continuous deployment (CI/CD) pipelines and associated tools like Jenkins.
  • Should be good in Unix / Linux, JIRA, Git, Kubernetes, database concepts.
  • Familiarity with DevOps principles and practices, emphasizing collaboration between development and operations teams.

 


Desired

  • Proficient in key software and project management areas including production support, customer service, project management, delivery management, and transition & migration.
  • Solid knowledge of essential tools and platforms including Confluence, Docker, Git Lab, Sonar Qube, Dynatrace, Nexus IQ, and Nexus Repo.
  • Expertise in cost optimization across all levels, both on-premises and in the cloud, with a focus on efficiency and savings.

Benefits

  • 25 days annual leave (plus bank holidays)
  • 10% on target bonus
  • Life Assurance
  • Pension scheme
  • Direct share scheme
  • Option to join the Healthcare Cash Plan or other benefits such as dental insurance, gym memberships etc.
  • 50% off EE mobile pay monthly or SIM only plans
  • Exclusive colleague discounts on our latest and greatest BT broadband packages
  • BT TV with TNT Sports and NOW Entertainment
  • 50% discount for friends and family on EE SIM Only plans & airtime element off a Flex Pay plan

Our leadership standards

Looking in:
- Leading inclusively and Safely
I inspire and build trust through self-awareness, honesty and integrity.
- Owning outcomes
I take the right decisions that benefit the broader organisation.

 

Looking out:
- Delivering for the customer
I execute brilliantly on clear priorities that add value to our customers and the wider business.
- Commercially savvy
I demonstrate strong commercial focus, bringing an external perspective to decision-making.

 

Looking to the future:
- Growth mindset
I experiment and identify opportunities for growth for both myself and the organisation.
- Building for the future
I build diverse future-ready teams where all individuals can be at their best.

 

DON'T MEET EVERY SINGLE REQUIREMENT?

Studies have shown that women and people who are disabled, LGBTQ+, neurodiverse or from ethnic minority backgrounds are less likely to apply for jobs unless they meet every single qualification and criteria. We're committed to building a diverse, inclusive, and authentic workplace where everyone can be their best, so if you're excited about this role but your past experience doesn't align perfectly with every requirement on the Job Description, please apply anyway - you may just be the right candidate for this or other roles in our wider team.

Apply now »