Job Detail

Manager, Site Reliability Engineering at SigFig
San Francisco, CA, US
SigFig is tearing down the barriers to investing. We've raised $60 million to support our growth as the #1 Online Investment Advisor for large Financial Institutions. Help us define the next generation of investing!

We're looking for a Manager of Site Reliability Engineering to join our team and increase coverage of our 24x7 operations. Sigfig’s enterprise platform powers automated investment services for some of the largest financial institutions in the world. We need someone with a proven track record in leading Site Reliability teams, who is passionate about uptime, reliability, and the overall performance of our enterprise platform.

WHAT YOU’LL BE DOING:

Manage a team of Site Reliability Engineers focused on providing 1st level response for production issues and drive resolution
Establish best practices and create runbooks for the SRE team to follow
Create and implement a strategy to ensure we meet uptime and performance SLA's
Act as the final gatekeeper for all changes going out to customer environments
Work with DevOps and Engineering teams to ensure the stability and reliability of our platform
Work with the latest monitoring tools and triage network, server and database issues
WE'RE LOOKING FOR SOMEBODY WITH:

3+ years of experience in a technical management position
3+ years of experience with Linux Systems Administration (Red Hat or CentOS preferred)
Extensive experience with monitoring tools like Nagios, StatsD, InfluxDB, New Relic and Splunk
Experience with MySQL, PostgreSQL, NGINX, Varnish, RabbitMQ, and Redis
Experience managing multiple release branches and schedules
Experience managing and supporting 24x7 on-call operations
PERKS & BENEFITS:

Competitive compensation packages
Medical, dental and vision benefits for employees and their dependents
Flexible vacation policy with uncapped time off
Catered lunches daily and a kitchen stocked full of drinks and snacks
Fitness subsidy
Commuter subsidy