Engineer (SRE) - Reliability Tooling and Engineering Health (Intermediate and Senior) at Xero

Reliability, Permanent, Melbourne, AU melbourne engineering full-time
Description
Posted 13 days ago

Xero is a beautiful, easy-to-use platform that helps small businesses and their accounting and bookkeeping advisors grow and thrive. 

At Xero, our purpose is to make life better for people in small business, their advisors, and communities around the world. This purpose sits at the centre of everything we do. We support our people to do the best work of their lives so that they can help small businesses succeed through better tools, information and connections. Because when they succeed they make a difference, and when millions of small businesses are making a difference, the world is a more beautiful place.


About the team

In Site Reliability Engineering (SRE), we drive and influence Xero to provide the most reliable experience for our customers. We are a global team based across New Zealand, Australia and the USA. In SRE at Xero, we combine software and systems engineering to enable engineers across Xero to build and support products that are observable, stable, performant, tolerant to failure, and operate as intended in the face of varying conditions.

We strive to maximise the impact of post incident learning across the organisation to improve the reliability and robustness of the Xero platform, while providing enablement and training across observability, reliability engineering, incident management and service ownership.

We also enable engineers across Xero through developing, supporting and integrating a collection of proprietary and off the shelf tooling to enable incident management and response, incident analysis and learning, monitoring and observability and resource ownership. We surface data and metrics, and provide detailed insights across operational health, production operations and developer productivity.

About the roles

We are currently seeking Software Engineers within our Reliability Tooling and Engineering Health teams in Site Reliability Engineering (SRE). Our teams develop and integrate a collection of tools that enable teams at Xero to easily visualise and manage operations and incidents, to support reliability, operational excellence, continuous delivery and engineering productivity at Xero. In these roles, you will have the opportunity to leverage your technical experience to drive & contribute to team deliverables and also broader SRE and Xero initiatives.

As a member of our Reliability Tooling and Engineering Health teams, you will help enable and empower Xero engineering teams to improve their engineering practices by a combination of the following

  • Contribute to the delivery of projects aligned with team goals, solving ambiguous problems with innovative solutions.
  • Design and maintain robust software components, understanding when to refactor or maintain existing systems.
  • Make data-driven decisions, balancing various perspectives to achieve well-rounded solutions.
  • Advocate for continuous improvement of systems and processes within the team, and across the organisation
  • Establish reliable processes for feature rollouts, monitor success metrics, and ensure system health and quality
  • Exposure to on-call duties, including incident management and response, troubleshooting efforts, as well as conducting post-incident reviews and learning from incidents.
  • In order to be successful in this role, you will have

  • Experience using software engineering to solve operational, reliability challenges and deliver technical initiatives
  • Proficiency in one or more object-oriented programming languages (e.g. C#, JavaScript, Java, Python) or experience with infrastructure-as-code (e.g. Terraform, Cloudformation)
  • Experience working with cloud providers such as AWS, Azure or GCP, alongside experience with logging and monitoring tooling such as sumo logic and new relic
  • Experience with designing, developing and operating internal developer tooling, in a complex distributed systems environment. 
  • Strong experience working in a DevOps environment, preferably with exposure to more mature CICD practices and capabilities
  • The ability to work in a cross-functional, collaborative environment and identify technical dependencies to ensure project success.
  • Why Xero? 
    Offering very generous paid leave to use however you’d like (plus statutory holidays!), dedicated paid leave to care for your physical and mental wellbeing as well as an Employee Assistance Program to access mental health care for you and your family, health insurance, life insurance, and income protection, wellbeing and sports programmes, employee resource groups, 26 weeks of paid parental leave for primary caregivers, an Employee Share Plan, beautiful offices, flexible working, career development, and many other benefits that reflect our human value, you’ll do the best work of your life at Xero.