Staff Software Engineer, Compute
The Compute team is looking to hire a Staff Software Engineer that thrives at the intersection of infrastructure and software development. This team’s challenges break into 2 domains, which we consider platform engineering and cluster engineering.
Platform Engineering: Higher-level orchestration of both compute capacity and workload primitives to support our multi-cloud, multi-region, deployments. A subset of current focuses include:
- Software automation that creates, manages, and destroys clusters in our fleet.
- APIs and controllers that support multi-cluster deployment and scheduling mechanics.
- Core SDKs that enable controller development in the larger organization.
- Software that codifies out-of-cluster ancillary concerns such as network configurations and managed services.
Cluster Engineering: Intra-cluster engineering problems involving balancing performance, efficiency, and stability. A subset of current focuses include
- Detection of node-level performance characteristics and making availability decisions based on the data.
- Schedulers that support more efficient packing of resources along with reactive rescheduling on the basis of changing compute availability.
- Kubernetes controllers that offer APIs in the cluster and perform reconciliation to reach a desired state.
- Cluster upgrades, both mechanical process concerns and automation.
As a member of the Compute team, your work will span these 2 domains, which are rich with challenging infrastructure and software engineering problems. Your work will directly impact hundreds of millions of users around the world. Join us and help build the future of Reddit!
In your day-to-day, you can expect to:
- Work collaboratively with a team of software engineers to create and maintain the foundational platform for running Reddit’s infrastructure.
- Design, write (Golang), and deliver software to improve the availability, scalability, latency, and efficiency of Reddit’s.
- Contribute feedback to the technical and strategic direction of the compute platform.
- Automate critical aspects of the development process such as service creation and management, as well as critical infrastructure operations.
- Share on-call responsibilities with the Compute team.
- 7+ years of experience developing internet-scale software, preferably in the context of infrastructure.
- Language proficiency in either Go.
- Experience developing on top of Kubernetes or similar distributed systems.
- Kubernetes controller or operator development experience is a huge plus.
- Proficiency operating Linux with a solid understanding around cgroups, namespaces, other multi-tenancy primitives.
- Strong troubleshooting competency ranging from higher-level orchestration concerns to lower-level runtime ones.
- Experience designing large systems, scoping work, and building consensus with other engineers.
- Excellent communication skills to collaborate with a service-oriented team and company.
- Comprehensive Health benefits
- 401k Matching
- Workspace benefits for your home office
- Personal & Professional development funds
- Family Planning Support
- Flexible Vacation (please use them!) & Reddit Global Days Off
- 4+ months paid Parental Leave
- Paid Volunteer time off
This job posting may span more than one career level.
In addition to base salary, this job is eligible to receive equity in the form of restricted stock units, and depending on the position offered, it may also be eligible to receive a commission. Additionally, Reddit offers a wide range of benefits to U.S.-based employees, including medical, dental, and vision insurance, 401(k) program with employer match, generous time off for vacation, and parental leave. To learn more, please visit https://www.redditinc.com/careers/.
To provide greater transparency to candidates, we share base pay ranges for all US-based job postings regardless of state. We set standard base pay ranges for all roles based on function, level, and country location, benchmarked against similar stage growth companies. Final offer amounts are determined by multiple factors including, skills, depth of work experience and relevant licenses/credentials, and may vary from the amounts listed below.