Senior Site Reliability Engineer

We are looking for engineer who will be engaged in maintaining, securing and scaling cutting-edge geo-distributed cloud infrastructure software to meet users' needs while keeping an ever-watchful eye on capacity and performance.

Job location: Remote / Serbia

Job Description:

  • You will be responsible for building, maintaining, and scaling across multiple clouds our complex and data-intensive kubernetes based digital edge fabric.
  • You will be writing / extending Kubernetes operators, Serverless like knative etc.
  • You will automate the continuous deployment process using approaches like GitOps such that once engineering releases, the product walks itself through various stages to join the fleet on the clouds.
  • You will also act as a consultant to development on infrastructure, networking, scalability, monitoring, operational process, infrastructure efficiency and release process.
  • You will scale systems sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability and velocity.

Requirements:

  • 5+ experience within the DevOps, CI-CD and similar.
  • You have extensive background in provisioning, scaling and operating cloud-based geo-distributed applications by monitoring availability and taking a holistic view of system health.
  • Deep understanding of Docker, Kubernetes, Helm and ways to orchestrate container systems. You have a good understanding & experience operating on either AWS/GCP/Azure using container native technologies.
  • Fluent in Linux systems. You like to automate your job, using Go, Python, Bash or the likes.
  • You enjoy troubleshooting in a distributed Kubernetes environment and are comfortable in tracing problems through applications, systems and networks.
  • You enjoy talking about stability, scalability and performance limits of web-service
  • Advanced knowledge of Python, Go or Shell scripting.
  • Experience in designing, automating securing and supporting big, fast data stacks on AWS/GCE in containers/kubernetes.
  • Experience operating critical production systems at scale.
  • Excellent knowledge of English, written and spoken.

What we offer 👈

Job Application

Fill out the enquiry form and we’ll get back to you as soon as possible.




    Yes, I agree that BLUE GRID DOO, registration number 21259128, may store and process my personal... data in order to contact me regarding available work position at BLUE GRID DOO. More Info