Staff Site Reliability Engineer



Software Engineering
Kenya · South Africa
Posted on Thursday, September 7, 2023

About Zepz

Zepz is the group powering two leading global remittance brands: WorldRemit and Sendwave. Since 2010, we have been disrupting an industry previously dominated by offline legacy players with our relentless focus to reduce the cost of remittances and increase safety and convenience for our users. Every day, our people work to unlock the prosperity of cross-border communities through finance and technology - driven by our vision of a world that celebrates migrants’ impact on prosperity, at home and abroad.

Our brands helped cross-border communities send over $15bn from 50 countries to recipients in 130 countries in 2022. We operate over 5,000 money transfer corridors worldwide and employ over 1,400 people globally. Zepz is a remote-first employer, with team members located across six continents.
Our vision is to create a world that celebrates migrants’ impact on prosperity, at home and abroad. Our purpose is to unlock the prosperity of cross-border communities through finance and technology.


Our Commitments:

  1. We act like owners - We are relentlessly delivering for our users and spending money thoughtfully.
  2. We embrace embarrassing honesty - We function best when we're open and honest with one another — especially about our challenges and doubts.
  3. We have a bias to action - We get to first outcomes quickly, iterate and learn.
  4. We strive to be better - We may make mistakes, but always learn from them.
  5. We are inclusive - to better reflect and serve our users.

About the role:

We are seeking a highly skilled and experienced Staff Engineer to join our Site Reliability Engineering (SRE) team. As a Staff Engineer, you will play a critical role in designing, implementing, and maintaining scalable and reliable infrastructure and services. You will collaborate with cross-functional teams, including software engineers, DevOps engineers, and system administrators, to ensure the availability, performance, and stability of our production systems.

The ideal candidate will have a strong background in software engineering, and infrastructure automation, and a passion for driving operational excellence through the implementation of SRE best practices.

What you will own:

Reporting to the Site Reliability Engineering Manager, you will:

  • Demonstrate strong analytical skills to interpret data from diverse sources and draw informed conclusions. Make confident decisions even in the face of ambiguity and unpopular choices.
  • Collaborate with product managers to architect long-term roadmaps, ensuring that engineering choices do not constrain product development.
  • Uphold the technical quality standards across various product areas and serve as a trusted resource for reviewing complex designs.
  • Lead by example and exemplify a strong work ethic, consistently delivering results and focusing on getting things done.
  • Take accountability for your team's strategic direction, creating an environment conducive to the timely delivery of business outcomes.
  • Quickly grasp complex and unfamiliar code or tooling, demonstrating adaptability and a continuous learning mindset.
  • Foster functional and collaborative relationships within and outside your team, actively contributing to the company's overarching goals.
  • Lead or influence teams, proactively solving ambiguous problems that span different areas of the business.
  • Produce effective design documentation for intricate and high-impact systems involving multiple stakeholders across the organisation.
  • Encourage innovative thinking and welcome challenges from colleagues, fostering a culture of continuous improvement.
  • Identify underlying issues and recognize complex patterns in problematic situations, displaying strong conceptual thinking abilities.
  • Align technology strategy with short and long-term company goals, serving as a trusted resource in shaping the organisation's direction.

What you bring to the table:

  • An accomplished Engineer with significant industry experience, including substantial years in roles such as SRE, DevOps, or Infrastructure.
  • Extensive understanding of SRE principles and best practices, with a proven track record of successfully managing large-scale, highly available systems.
  • In-depth understanding of system design principles, including distributed systems, microservice architecture, fault tolerance, scalability, and performance optimization.
  • Proficiency in programming or scripting languages: Java, NodeJS or Python is a plus.
  • Familiarity with cloud platforms like AWS and how to deploy to AWS using infrastructure as code tools such as Terraform.
  • Containerization and Orchestration: Knowledge of containerization technologies such as Docker and container orchestration platforms like Kubernetes to deploy and manage applications at scale.
  • Proficiency in using monitoring tools like Prometheus and Grafana Cloud and log management systems like Splunk or Loki to ensure system health and root cause analysis.
  • Understanding of incident response processes, including incident detection, escalation, mitigation, and post-incident analysis.
  • Proficiency in using version control systems like GitHub for source code management and collaboration.
  • Continuous Integration and Deployment: Experience with CI/CD pipelines and related tools such as Jenkins, GitLab CI/CD, or CircleCI to automate software builds, testing, and deployment

What we offer you:

Please note that the benefits below will apply to Full-time roles.

We have five core benefits for our talent in the US, UK, Philippines, Poland, and South Africa. If you're not in one of those regions, don’t worry - the Talent team can let you know what is available for you specifically:

  • Unlimited Annual Leave: Most Zepz team members are eligible for unlimited annual leave. Colleagues in customer-facing roles, receive a competitive holiday allowance and four recharge days a year. Feel free to make the most of your time off and maintain a healthy work-life balance!

  • Private Medical Cover: ​​You can opt-in to a Private Medical Insurance scheme. This provides you with access to thorough medical coverage, so you can feel confident in your health and well-being.

  • Retirement: We offer pension schemes to help you plan for and secure your future.

  • Life Assurance: Life assurance is available to give you peace of mind and protect your loved ones in case of the unexpected..

  • Parental Leave: We offer competitive parental leave schemes to ensure you are spending as much quality time with your new bundle of joy as possible.

We are also remote-first as an organisation, offering flexibility for you to work where you need to be most productive. In many locations, we have workspaces, which you can use as you desire.

Most roles in the Philippines are predominately office-based, with this we offer free meals for those 100% on-site.

In addition to the above, you will discover that we have a range of secondary perks (such as the cycle-to-work scheme and employee discounts) depending on your location, to help you thrive at Zepz!

Why choose Zepz?

  • Our team of over 1400 employees is fully distributed across the world. We are working from coffee shops, homes, and co-working spaces — making us one of the larger fully distributed growth-stage startups in the world but we also offer workspace in our talent cluster locations - spaces we can meet, collaborate and connect.

  • We are proud parents, community organizers, farmers, band members, yoga teachers, YouTube influencers, former Olympians, and serial entrepreneurs.

  • We collectively speak over twenty languages, including Akuapem, Amharic, Bengali, Ewe, Fante, Ga, Igbo, Kalenjin, Luganda, Oromo, Somali, Swahili, Wolof, Bulgarian, Croatian, Czech, Danish, Dutch, English, Estonian, Finnish, French, German, Greek, Hungarian, Irish, Italian, Latvian, Lithuanian, Maltese, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish and Swedish.

  • At Zepz, embodying our commitments binds us together. We are collectively passionate about striving to achieve our vision and purpose - to continue to provide the best service to our users.

Ready to apply?

Applications will be reviewed on a rolling basis. If interested, please submit your resume along with a cover letter (optional), highlighting why your experience demonstrates you meet the requirements of the role. Please also indicate the countries in which you have work authorization. While Zepz supports visa sponsorship, sponsorship opportunities may be limited to certain roles and skills.

At Zepz we record interviews using Metaview (https://metaview.ai). It helps us become better interviewers by recording and transcribing our interviews, and ensures we interview candidates in a fair & consistent manner. It is not required. Please let us know if you’d like to opt out of the use of Metaview - this will not affect the outcome of your interview.

Confidence can sometimes hold us back from applying for a job. But we'll let you in on a secret: there's no such thing as a 'perfect' candidate. Zepz is a place where everyone can thrive.

So however you identify and whatever background you bring with you, and if at all you might need any form of support to make the process as comfortable as possible, please let us know and give us a shot by applying. We want you to be excited to wake up to make an impact every day.