Site Reliability Engineer - Email
Who We Are:
Twitter’s globally distributed, real-time communications network generates trillions of events and petabytes of data per day. As an SRE on our email team, you will manage all of email flow inside Twitter and ensure good email deliverability and reputation of Twitter email brands.
What You’ll Do:
As a member of the organization, you will be dedicated to improving the reliability of our end-to-end platform. Your work integrates directly with Twitter's products. Our core infrastructure receives hundreds of millions of tweets per day and serves tens of billions of API requests. Our other systems serve over 2+ billion search queries per day, render hundreds of millions of ad impressions, and process hundreds of terabytes of log and interaction data daily. We send equivalent number of emails and the postmaster is responsible for the deliverability. Postmaster will also does consultation work, when on boarding new and existing email clients.
You will work with open-source technologies and the SRE community.
You will actively participate in the vision to move away from high operational cost tasks like break fix, cluster migrations, new services buildout, abuse etc. You will want to be at stage in future where the service can shrink and expand based on demand, self heal, auto rollouts, with the least amount of human effort.
Your responsibilities include but not limited to:
You would perform deep dives into both systemic and latent reliability issues; partner with software and systems engineers across the organization to produce and roll out fixes
You will build relationships with ISPs and other industry partners.
Troubleshooting tools (i.e tcpdump, netstat, iostat, traceroute)
Experience with iptables or other firewall solutions
Ability to work with engineering teams and minimal hand-holding
Troubleshoot issues across the entire stack: hardware, software, application and network
You will Identify and drive opportunities to improve automation for the company; scope and create automation for deployment, management and visibility of our services
Participate in code reviews
You will participate in an oncall rotation with other engineers to support your services.
Who You Are:
You have a solid understanding of MTAs.
You have solid understanding of email authentication and authorization protocols like dkim/dmarc
Practical knowledge of various anti-spam technologies
Practical, solid knowledge of shell scripting and at least one higher-level language (Python or Ruby preferred)
You have deep knowledge of feedback loops and bounce handling
Comfortable configuring DNS, DHCP, and LAN/WAN technologies
You are an experience with Linux systems engineering and troubleshooting
Minimum 1+ years of handling services in an internet scale *nix environment
- Coding in one or more of the following languages: Python or BASH
- You should work well with and be able to influence myriad personalities at all levels
You can prioritize tasks and work independently
You should be adaptable and able to focus on the simplest, most efficient & reliable solutions
- B.S. in computer science or similar field or equivalent experience.
You have a track record of successful practical problem solving, excellent written and social communication, and documentation skills
Practical experience in Postfix, Exim or Msys.
Strong contacts at ISPs and track record of excellent deliverability
We are committed to an inclusive and diverse Twitter. Twitter is an equal opportunity employer. We do not discriminate based on race, ethnicity, color, ancestry, national origin, religion, sex, sexual orientation, gender identity, age, disability, veteran status, genetic information, marital status or any other legally protected status.
San Francisco applicants: Pursuant to the San Francisco Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records.
After you apply, a recruiter may reach out to you for an introductory call.
If your background is a match for the role, you may phone interview with 1-2 people.
If you continue through the process, you will come onsite 1-2 times to interview with a total of 5-10 people.
We're the People Team at Twitter. We Tweet about who we're hiring, what we're doing, and why you should work at Twitter! #LoveWhereYouWork
We're your one stop shop for anything University related. That means campus outreach, student advice/tips, & of course, our University Recruiting efforts!