Site Reliability Engineer (Kafka)

IT Teams is a Romanian company specialized in software outsourcing and remote staffing. We aim to assemble passionate experts to help companies and startups achieve success. We deliver the perfect combination of technical skills, methodology and high speed delivery techniques to help startups and established companies extend their development teams.

 

Apply now »

Job Openings

Site Reliability Engineer (Kafka)

We are looking for a Site Reliability Engineer (Kafka) that will carry out SRE duties for a Kafka Streaming Platform and have thorough understanding on the Kafka architecture along with the concepts of Producer, Consumer, topics, partitions etc.

Hybrid, Warsaw

Long term, 8h per day

April 2024

Responsibilities:

  • Carry out SRE duties for Kafka Streaming Platform. 
  • Have thorough understanding on the Kafka architecture along with the concepts of Producer, Consumer, topics, partitions etc 
  • Keep an eye on the platforms and adhere to runbooks/SOPs to manage platform and application problems 
  • Familiarize yourself with the cluster maintenance processes and implement changes as per the documented installation and validation plans 
  • Showcase robust troubleshooting and debugging skills, aiming to pinpoint and rectify the issue, while also offering advice on how to prevent such problems in the future 
  • Conduct thorough root cause analysis of major production incidents, document for future reference, and put in place proactive measures to enhance system reliability 
  • Automate routine tasks using scripts or automation tools to lessen manual work, decrease the chance of human errors, and boost system reliability.

 

Technical Skills:

  • At least 2-3 years of experience for a junior level role and 5+ for mid-level/senior level working as a Site reliability engineer for Kafka Platform. 
  • Deep level Knowledge on core Kafka components like producers, consumers, topics, partitions etc. 
  • Troubleshooting both Kafka platform service, application problems and identifying the root cause. 
  • Writing Ansible playbooks and automate manual tasks using Ansible, shell scripting and python. 
  • Should be familiar with Unix/Linux system internals, networking, and distributed systems.

 

 

Apply now »
Screenshot 2018-10-12 at 01.45.47

Why IT Teams:

We are driven by curiosity so we conduct a Discovery Process, ensuring that the technology we deliver stays in tune with customer’s business goals. We encourage our team members to speak up, to share their advices and worries if any, so we are able to properly handle the software development projects risks. Honesty is a key ingredient of our collaboration with customers as well with our colleagues. Our Commitment towards customer goals is reflected by our regular progress reporting, a constant revision of project goals and deadlines and a solid quality control.

Apply now »