Recruiter: Nishita Jena
Hiring Manager: Hari Annamalai
Career Grade: D
About the role
The Software Engineering Specialist independently executes advanced activities to deliver the engineering strategy and roadmap that supports BT's commercial strategy through cross functional business partnering and the participation of a team that pursues innovation as well as engineering excellence.
What you’ll be doing
Architecture Ownership & Vision
• Own the overall system architecture of the observability platform across ingestion, processing, storage, and query layers.
• Work closely with Enterprise Architect to align system architecture with long-term architectural vision and technical roadmap aligned with business and platform goals.
• Design and review high-level system designs, data flows, integration patterns, and core technology choices.
• Act as the technical authority for complex architectural decisions, trade-offs, and design reviews.
Platform & Domain Leadership
• Architect systems for infrastructure monitoring, metrics, logs, and distributed tracing at scale.
• Guide the evolution from infrastructure monitoring to a full observability platform including APM.
• Define architectural patterns for high-throughput telemetry ingestion, real-time processing, and query-at-scale.
• Ensure architectural consistency for multi-tenant, cloud-native distributed platforms.
Standards, Governance & Enablement
• Establish and evolve architecture standards, design principles, and best practices across teams.
• Identify architectural risks and proactively drive mitigation strategies.
• Enable teams through reference architectures, design frameworks, and technical guidance.
• Support modernization initiatives including scalability, performance optimization, resilience, and cost efficiency.
Hands-On Architectural Validation
• Perform code reviews of critical and performance-sensitive components.
• Build and guide proof-of-concepts (POCs) to validate architectural decisions and de-risk new technologies.
• Develop reference implementations to demonstrate architectural intent.
• Collaborate closely with senior engineers to troubleshoot complex system-level issues
Delivery Collaboration & Execution Enablement
• Work closely with Software Engineering Managers to align architectural decisions with delivery and release plans.
• Assist in breaking down large architectural initiatives into phased, incremental deliverables.
• Identify architectural and technical risks early and proactively surface them to influence release planning.
• Support release readiness by validating that architecture, scalability, and non‑functional requirements are addressed ahead of key milestones.
The skills you’ll need
• Strong foundation in system design, distributed systems, and application scalability
• Experience designing microservice based, event driven architectures
• Ability to make architectural trade offs involving scalability, reliability, performance, and cost
• Proven experience designing large scale, cloud based distributed platforms
• Strong backend experience with Java (Spring Boot–based microservices)
• Working knowledge of Python and/or Go for scripting, automation, and collectors
• Ability to review and reason about performance critical backend code
• Strong understanding of infrastructure monitoring and observability concepts with hands on experience on follow:
o Metrics, logs, and distributed tracing
o Agent based and agentless monitoring models
o Push / pull / subscription based data collection
• Hands on or deep working knowledge of:
o Metric data models and storage (e.g., VictoriaMetrics or equivalent)
o Log aggregation and search platforms (Elasticsearch / OpenSearch)
• Familiarity with OpenTelemetry, exporters, and custom collectors
• Understanding of common monitoring tools and protocols (e.g., SNMP, Prometheus style systems)
• Strong experience with:
o Time series databases for metrics
o Relational databases (PostgreSQL/MySQL) for metadata and control plane services
o NoSQL / in memory stores (e.g., Redis) for high volume or low latency workloads
• Working knowledge of search and analytics engines for logs and traces
• Understanding of data modeling, query patterns, and data lifecycle management
• Experience with Kafka class distributed messaging systems
• Designing and operating event driven telemetry ingestion pipelines
• Understanding of throughput, backpressure, and reliability concerns in streaming systems
• Strong hands on exposure to Docker and Kubernetes based platforms
• Infrastructure automation using:
o Terraform (IaC)
o Ansible (deployment and configuration automation)
• CI/CD pipeline understanding and usage (e.g., GitLab CI or equivalent)
• Familiarity with Grafana, Kibana, or similar visualization platforms
• Awareness of frontend technologies such as Angular (architecture level understanding)
• Experience with OpenStack, MAAS, Juju, or similar cloud/service orchestration platforms
• Exposure to Canonical Observability Stack (COS) or equivalent platforms
• Ability to perform code reviews for critical and performance sensitive components
• Experience building or guiding proof of concepts (POCs) to validate architecture
• Strong collaboration with Engineering Managers and senior engineers
• Clear technical communication and mentoring capability
BT Group’s Behaviours
Customer First - Prioritize customer needs in every decision and action.
Challengers - Challenge the status quo and bring innovative ideas to life.
Committed - Own outcomes and deliver with integrity.
Clear - Communicate openly and simply, ensuring alignment.
Connected - Collaborate across teams to achieve shared goals.
BT Group is the UK’s leading communications group and the holding company behind some of the country’s most recognised brands – including BT, EE, Openreach and Plusnet. Our purpose is as simple as it is ambitious: we connect for good. Our customers include consumers, small, medium and large businesses, public sector organisations and other communications providers.
BT Group’s role is about setting direction, unlocking value and creating the conditions for our brands and businesses to thrive.
Having come through the most capital-intensive phase of our fibre investment, our focus now is on what comes next – simplifying how we operate, using technology and AI to work smarter, and organising ourselves to serve customers better and grow sustainably. Group teams shape strategy, policy, brand, capital allocation and transformation, helping the whole organisation perform at its best.
We have a singular culture that unites all our people: we are customer-first challengers, who are committed, clear and connected. These behaviours unite us as one team to deliver for our colleagues, our customers, our stakeholders and the country. Joining BT Group means working at the heart of a business that matters to the UK, with the opportunity to shape decisions, influence outcomes and help set the future course of one of the country’s most important companies.