Who We Are

At Kyndryl, we design, build, manage and modernize the mission-critical technology systems that the world depends on every day. So why work at Kyndryl? We are always moving forward – always pushing ourselves to go further in our efforts to build a more equitable, inclusive world for our employees, our customers and our communities.


The Role

Join us as a Site Reliability Engineer (SRE) and embark on an exciting journey of ensuring reliability, resiliency, and innovation in our information systems and ecosystems. As an SRE at Kyndryl, you'll be at the forefront of driving continuous improvement and delivering exceptional service to our customers. 

Your role goes beyond traditional engineering, as you'll have the opportunity to analyze business needs, tackle complex problems, and provide strategic advice and designs. You'll be involved in every stage of the software lifecycle, from building and testing to deploying changes and maintaining robust systems.

We're looking for a true visionary who can think strategically and help shape the future of our services. Your expertise in building trusted relationships with customers and partnering with them for success will be instrumental in driving our growth.

As an SRE, you'll have the unique opportunity to work on end-to-end services, spanning customer sites and platforms. Collaboration and proactivity are key as you work alongside a talented team of professionals, eager to make a difference. You'll embrace an entrepreneurial mindset, taking ownership of your responsibilities and constantly seeking innovative solutions.

With an unwavering focus on quality, robustness, and security, you'll be a driving force in implementing cutting-edge tools that enhance our operations, improve reliability, and gather valuable feedback on our platforms. Your ability to identify and mitigate common operational issues will play a crucial role in delivering seamless experiences to our customers.

If you're passionate about pushing the boundaries of technology, thrive in a collaborative environment, and are motivated by the opportunity to shape the future of reliability engineering, then we want to hear from you. Join our team and be part of a dynamic and forward-thinking organization that values innovation and excellence in everything we do.

You will be responsible to Develop and maintain key reliability metrics such as Service Level Indicators (SLIs), Service Level Objectives (SLOs) and Service Level Agreements (SLAs) in line with the business goals & Continuously assess and improve the reliability of our systems, ensuring they meet defined metrics. You will deliver reliable and scalable solutions by following SRE principles such as automation, infrastructure as code (IaC) and proactive cost monitoring and failure detection & Collaborate with engineering and operations teams to ensure the alignment of architecture and design with reliability goals. Incorporate security best practices into all aspects of SRE work, ensuring systems are resilient against vulnerabilities and threats. Establish and document best practices for system monitoring, backup, restore, disaster recovery (DR), and resiliency processes. Ensure comprehensive coverage of monitoring, including vulnerability checks, application performance, infrastructure health and user experience. Own Incident Management Process and responsible for Continuous Improvement.

Your Future at Kyndryl
Kyndryl has a global footprint, which means that as a Site Reliability Engineer at Kyndryl you will have opportunities to work on projects and collaborate with colleagues from around the world. This role is dynamic and influential – offering a wide range of professional and personal growth opportunities that you won’t find anywhere else.


Who You Are

You’re good at what you do and possess the required experience to prove it. However, equally as important – you have a growth mindset; keen to drive your own personal and professional development. You are customer-focused – someone who prioritizes customer success in their work. And finally, you’re open and borderless – naturally inclusive in how you work with others.

Required Skills and Experience

  • Total experience of 6 to 9 years as Site Reliability Engineer.

  • At least 6 years of professional experience in Systems engineering.

  • Minimum 4 years of experience in programming in Shell script, JavaScript, Python, Amazon Web Services cloud components, including Compute, Network and Storage (IaaS and PaaS) and services.

  • Minimum 4 years of experience in delivering infrastructure as code with CloudFormation and other frameworks like SAM, Terraform etc.

  • Minimum 4 years of experience in supporting production applications, release management & providing escalated on-call support.

  • Experience in SRE best practices, Application Performance Monitoring, Deep understanding of SRE concepts such as Service Level Indicators and Service Level Objectives, Change management best practices, Version Control Systems - Git, Trunk based development & AWS Systems.

  • Familiarity with containerization, especially Kubernetes.

Preferred Skills and Experience

  • Bachelor’s Degree or equivalent practical experience.

  • AWS Certified DevOps - Professional.

  • AWS Certified Solutions Architect – Associate.

  • Hands on experience with Observability Platforms.

  • Experience in Automation Testing Frameworks like Selenium or Playwright, DevOps, SecOps, AWS Cloud Development Kit (CDK), AWS Management, Security, Scalability, Reliability and Cost Optimization.


Being You

Diversity is a whole lot more than what we look like or where we come from, it’s how we think and who we are. We welcome people of all cultures, backgrounds, and experiences. But we’re not doing it single-handily: Our Kyndryl Inclusion Networks are only one of many ways we create a workplace where all Kyndryls can find and provide support and advice. This dedication to welcoming everyone into our company means that Kyndryl gives you – and everyone next to you – the ability to bring your whole self to work, individually and collectively, and support the activation of our equitable culture. That’s the Kyndryl Way.


What You Can Expect

With state-of-the-art resources and Fortune 100 clients, every day is an opportunity to innovate, build new capabilities, new relationships, new processes, and new value. Kyndryl cares about your well-being and prides itself on offering benefits that give you choice, reflect the diversity of our employees and support you and your family through the moments that matter – wherever you are in your life journey. Our employee learning programs give you access to the best learning in the industry to receive certifications, including Microsoft, Google, Amazon, Skillsoft, and many more. Through our company-wide volunteering and giving platform, you can donate, start fundraisers, volunteer, and search over 2 million non-profit organizations.  At Kyndryl, we invest heavily in you, we want you to succeed so that together, we will all succeed.

Get Referred!

If you know someone that works at Kyndryl, when asked ‘How Did You Hear About Us’ during the application process, select ‘Employee Referral’ and enter your contact's Kyndryl email address.

Is a Remote Job?
No

Kyndryl is the world's largest provider of IT infrastructure services serving thousands of enterprise customers in more than 60 countries.  

We design, build, manage and modernize the mission-critical...

Apply Now