← All roles
Platform
Site Reliability Engineer (SRE)
Lagos · Nigeria · Hybrid / remote · Full-time
Experience requirements for this role are set in the required qualifications below—some roles ask for more than three years (for example sales leadership). Compensation and benefits are discussed with shortlisted candidates only—we do not publish salary ranges on public postings.
Own reliability targets, error budgets, and incident lifecycle for critical services—balancing velocity with sustainable operations.
What you will do
- Define SLIs/SLOs with product and engineering; track error budgets and governance
- Lead incident response, communication, and blameless postmortems with action tracking
- Improve detection and reduction of toil through automation and self-healing where safe
- Drive capacity planning, load testing, and failover drills
- Collaborate with DevOps on runbooks, dashboards, and paging policies
- Participate in architecture reviews from a reliability and operability lens
- Mentor developers on production readiness checklists and operational excellence
Required qualifications
- Minimum 3 years of professional experience in production engineering, SRE, or equivalent operations-heavy software roles
- Strong coding ability (Python, Go, or similar) for automation—not only tickets
- Deep experience with monitoring stacks (Prometheus, Grafana, Datadog, New Relic, or similar)
- Proven incident leadership and structured problem solving under pressure
- Understanding of distributed systems failure modes and mitigation patterns
Preferred qualifications
- SRE book methodology (error budgets, toil budgets) applied in practice
- Multi-region or multi-cloud resilience patterns
- Customer-facing SaaS at scale
Apply
Submit your details and resume (PDF or Word, up to 5MB). We use your information only for recruiting and related HR processes.