Job Detail

Staff Site Reliability Engineer at
Palo Alto, CA, US is the leading business payments network, with 3 million members paying and getting paid over $52 billion per year. saves companies more than 50% of the time typically spent on financial back-office operations and helps businesses get paid 3 - 4 times faster by automating end-to-end payment processes.  The company is the choice of 4 of the top 10 U.S. banks; leading accounting software providers QuickBooks Online and Xero; and over 50 percent of the top 100 U.S. accounting firms. It is the only business payments solution endorsed by the American Institute of CPAs (AICPA). The recipient of more than 70 awards, proudly received multiple PC Magazine's Editor's Choice Awards and CEO René Lacerte was recently recognized as an E&Y Entrepreneur of the Year.



Professional Experience/Background to be successful in this role:

  • 10+ years experience as a production operations engineer with experience in debugging complex problems across the whole stack, networking (ex: Cisco, Nexus, F5 LTM), storage (ex: Pure, NetApp), and systems (ex: RHEL, Centos, Dell, Nutanix)
  • Expert with AWS services (certified SysOps Administrator or Solutions Architect preferred)
  • Experience with automating systems and infrastructure via Ansible, Puppet, or Chef, and CloudFormation or Terraform
  • 3+ years of automation experience (Python preferred)
  • 5+ years supporting production in a SaaS multi-tenant environment with a modern application framework (Resin/Tomcat/Java) with a highly-transactional database (Oracle/MySQL)
  • Experience with a variety of monitoring and application performance management tools (NewRelic, PagerDuty, Grafana, etc.)
  • Experience with regulatory compliance and bank-level security (PCI, SOC 1/2/3, SOX, bank audits, internal audits)
  • BS or MS degree in Management Information Systems or related discipline


Competencies (Attributes needed to be successful in this role):

  • Have the ability to effectively communicate decisions, ideas, designs, and operation of systems and services in a clear and concise manner
  • Both a generalist, capable of picking up and working with multiple, disparate systems, and an expert, having an ability to dive deep into specific topics and quickly master them
  • Have curiosity about how things work and love to share that knowledge with others
  • Have a passion for helping others and making their lives better, you do this by simplifying complex systems to make them understandable and operable
  • Team player - humble, hungry, and smart
  • Conceptual problem solving - drive projects
  • Industry knowledge - people come to you with questions/help
  • Influence - you are seen as a leader within the team and in the organization
  • Business thinking - seek to understand business needs and apply solutions using technology
  • Project and issue management - able to break down complex projects into bite-size chunks


Expected Outcomes:

  • Drive the migration from on-premise systems to the cloud
  • Help design and implement a highly available infrastructure to meet the needs of our growing and evolving product
  • Help measure and improve reliability and performance
  • Drive continuous improvement by reducing the amount of manual operational work
  • Coordinate with application engineering to drive new technology to support our growth and applications
  • Support a highly available environment as part of an on-call rotation Culture:

  • Humble – No ego
  • Fun –  Celebrate the moments
  • Authentic – We are who we are
  • Passionate – Love what you do   
  • Dedicated – To each other and the customer