Twitter Site Reliability Engineers (SREs) are Software Engineers who focus on Availability, Reliability, Disaster Recovery, and other challenges of scale. They possess a breadth and depth of knowledge about Twitter’s production environment that allows them to craft tools, processes, and frameworks to guide colleagues through safely releasing production code, provide guidance and support for monitoring distributed systems, reduce operational overhead, and enable teams to achieve their desired reliability outcomes.
The Coordination team develops and operates highly available foundational services that are used by almost every engineer at Twitter. We manage one of the world’s largest ZooKeeper deployments and are actively involved with the open-source community! The Blobstore team stores and serves petabytes of blob data, including the media uploaded by our users. We’re looking to re-architect the current Blobstore system to be able to provide better utilization of the modern hardware based on SSDs for on-prem storage. These services are critical for Twitter's success, and an opportunity to directly make a positive impact on the experience of every Twitter user. You’ll be focused on creating an environment where the SREs, who are embedded with these two teams, can improve Reliability and meet the challenges of operating at our continuously increasing scale.
We believe passion and personality matter; as such, we need leaders that can manage teams of diverse, smart, and driven engineers - while balancing day-to-day people management with moving the business forward both technically and culturally.
Your responsibilities include, but are not limited to:
- Help bring great SREs to Twitter. Source and hire hardworking SREs, grow a team with different perspectives and help your peers do the same.
- Mentor, grow, and empower your team by giving them the skills, confidence, space, and motivation to make decisions independently that lead to their personal and professional success, and enable them to become technical leaders. In other words, align the best outcomes for the growth of people around and business impact.
- Take an active role in contributing to the roadmap for the Coordination and Blobstore teams.
- Participate in deep technical design discussions within your team, and across partner teams, and ensure that we're building the right systems and keeping the quality high. Understand the Observability stack such that you can contribute meaningfully to architectural decisions.