Site Reliability Engineer - Equalisers Team at Atlassian

Engineering, Full Time, Sydney, Australia sydney engineering full-time
Description
Posted 1 months ago
Atlassian is continuing to hire with all interviewing and on-boarding done virtually due to COVID-19. Everyone new to the team, along with our current staff, will temporarily work from home until it is safe to return to our offices.

Enjoy tackling problems at massive scale? Love building robust services? Love experimenting with new software and environments? Like being challenged? Join us in Atlassian Site Reliability Engineering for your next big challenge!

We are looking for an experienced Backend Engineer who is passionate about building scalable cloud services to join our growing SRE team. You will be joining a high-impact team based in the Sydney office tasked with building platform services that manage and scale our RDS and EC2 infrastructure for our ever-growing Jira and Confluence products.

As a Software Engineer in SRE you will design, implement, and operate services which support millions of users all over the world. You'll bring your experience with agile software development, web architecture, unit testing, code review and pair programming to help the team maintain the highest standard of engineering.

You'll be joining a team responsible for managing the capacity of the RDS infrastructure for Vertigo Platform (Jira & Confluence). As part of this wide responsibility we take on the challenging and fascinating task of balancing the load of thousands of new and existing customers across our fleet, and controlling major components in the customer on-boarding flow. We are a SRE team that is committed to solving complex problems using data driven backend services, building in multiple languages such as Java, Kotlin and Python. We build on top of Atlassian’s observability platform to gather metrics and data to make intelligent decisions about how to optimally place our customers, minimising cost and maximising experience for every customer on our platform.

An ideal candidate is someone who passionate about building reliable services, enjoys systems engineering, keeps on top of industry trends (particularly those related to reliability & scale) and loves bringing their knowledge into the team. We'd love it if you brought a deep understanding of modern cloud infrastructure, programming expertise, operational experience and a desire to change the status quo. We'll support you with robust backend systems, mature processes, and a motivated team with a strong desire to not f*** the customer. We're looking for an engineer who can analyse and help improve our services and processes to get us to an even higher level of availability, scalability, and reliability.

This is an amazing opportunity to join a vibrant team and make huge impact & can offer an open runway for the right person, with room to grow.

More about you
On your first day, we'll expect you to have:
Strong experience with one or more languages: Java, Python, Kotlin.
Experience working with RDBMS eg Postgres.
Experience and/or interest in building 12 Factor Applications / services.
Experience building web-services and clients using REST/GraphQL
Hands on experience with public cloud offerings (AWS components like EC2, CloudFormation, RDS, S3, DynamoDB, SQS, Kinesis - or equivalents, e.g. in GCP / Azure)
Strong organisational and interpersonal skills, with experience developing and instilling a culture of operational maturity.
It would be great, but not mandatory, if you had
Experience building monitoring into your code, tweaking dashboards, defining alerts, writing runbooks, etc...
Experience with managing RDS infrastructure will be highly valued.
Experience with SpringBoot 2
Working knowledge of monitoring tools and time series metrics.