logo

Site Reliability Engineer

Project Description

We are looking for a Site Reliability Engineer to join our team and develop software systems and automated solutions for operational aspects in an organization.

Site Reliability Engineer responsibilities include monitoring computer systems and building alerts for various operational issues that computer systems can experience.

Ultimately, you will work with our IT team to ensure our organization can continue to deliver products and services in our computer system environment.

Responsibilities

  • Administer production jobs
  • Understand debugging info
  • “Drain” traffic away from a clusterRoll back a bad software push
  • Block or rate-limiting unwanted traffic
  • Bring up additional serving capacity
  • Use the monitoring systems (for alerting and dashboards)
  • A site reliability engineer (SRE) creates a bridge between development and IT operations by taking on the tasks typically done by operations.

Requirements

  • Proven work experience as a Site Reliability Engineer or similar role
  • Collaborate and communicate asynchronously
  • Document all the things so you don’t need to learn the same thing twice
  • Have an enthusiastic, go-for-it attitude
  • Relevant training and/or certifications as a Site Reliability Engineer

 

Have some questions about this position?

We are happy to support you and respond any questions you have.

Talk to the recruiter