Your opportunity

At New Relic, we provide our customers real-time insights, so they can innovate faster. Our software delivers insightful observability tools across different technologies and distributed systems, enabling software engineering teams to identify, understand, and tackle issues, analyze performance, and maximize their software and infrastructure.

The Kubernetes infrastructure organization is looking for a Senior Software Engineering Manager to join our group. This organization develops and maintains the backend infrastructure for deploying and maintaining hundreds of Kubernetes clusters that host the applications of New Relic on the multi-cloud platform of Amazon AWS,  Microsoft Azure, and Google GCP. The infrastructure components used are open source and team members learn and contribute to them (example: Cluster API).

We encourage async communication and independence, but we also build strong human relationships between team members during our ceremonies or in-person gatherings when possible.

Our Engineering Managers are a select group of technology leaders and mentors who shape the solutions we bring to the market and foster an inclusive environment that brings out the best in people.

If you have a strong technical background but you are also a person with great mentoring skills and that knows how to collaborate with multiple teams to ship great software, we would like to hear from you


What you'll do
  • Work with a team of engineers to design, build, implement and manage secure, scalable and reliable cloud based infrastructure to support the New Relic Product Platform portfolio.
  • Driving and contributing to the software architecture, design of our existing\current large-scale distributed systems and new systems we are investing in.
  • Own, and drive vision and strategy for the Kubernetes Infrastructure platform and provide architectural guidance for security, reliability, scalability, resilience, Disaster Recovery,  performance optimization, and cost efficiency.
  • Driving and contributing to our culture of operational and business efficiency, secure code, CI\CD pipeline, unit test, regression testing, uptime, using smart telemetry data to make decisions, investment in technical and infrastructure debt, blameless root cause with a learner and growth mindset.
  • Collaborating with team members and across organizations to come up with new ways to solve old and new problems.
  • Attending Livesite meetings and bringing postmortem learning back to the team and organization.
  • Continue to learn from some of the brightest minds in the business, they are also looking to learn from you. Coaching and mentoring people. 
  • Drive the recruitment, training, and development of top-tier talent in the region.
  • Regularly collaborates with PMs, technical leaders, and other stakeholders to clarify project requirements and execute cross team initiatives.
  • Work in an agile environment with a DevOps approach—teams build and maintain their own software, including participating in second-layer support rotations.
  • Work in a multi-cloud environment ingesting over 3 billion data points per minute, 150 petabytes of data per month, and accepting over 70 billion HTTP requests daily from our customers.



This role requires
  • BS/MS in Computer Science, or education/experience in a relevant field.
  • 10+ years of professional work experience in software development and/or operations.
  • 6+ years of people management experience.
  • Hands-on experience coding in programming languages such as Golang, Java, Python, or Bash.
  • Hands-on experience driving the software architecture, design, large-scale multi-tiered, distributed software applications, tools, systems, and services using service-oriented architecture, object-oriented design and with RESTful API endpoints.
  • Strong experience in architectural patterns and anti-patterns for resiliency, geo-availability, and disaster recovery.
  • Strong DevOps experience in developing and maintaining cloud-agnostic Kubernetes infrastructure on major public cloud platforms (AWS, Azure, GCP).
  • In-depth experience with containerization tools- Kubernetes, Istio, Cluster API, Crossplane, etc.
  • Solid background in application and system programming, and Linux system administration.
  • Involvement in building Infrastructure as Code (IAC) 
  • Deep understanding of security, reliability, performance and cost-effectiveness of distributed systems on public cloud providers like Amazon AWS, Microsoft Azure and Google GCP.
  • Comfortable working across time zones (US/EU/India time zones) in a fast-paced setting.
  • Demonstrated experience with building and scaling tier-1, tier-2 systems.
  • Proven track record of setting and achieving ambitious goals, guided by metrics.
  • Familiarity with modern cloud deployments, compute, storage, networking, containers, databases, and security.
  • Demonstrated ability to coach and mentor engineers or managers in all aspects of their engineering duties and growth.




Bonus points if you have
  • Experience working with Continuous Integration/Continuous Delivery (CI/CD) tools such as Jenkins, Ansible, Travis CI, Circle CI,
  • API design and development exposure with Go programming language
  • Developed Kubernetes Controllers
  • Solid understanding of the Linux operating system and concepts such as virtual machines and containers
  • Active participation in open source development.
Is a Remote Job?
Remote

New Relic helps engineers and developers do their best work every day — using data, not opinions — at every stage of the software lifecycle. The world’s best engineering teams rely on New Relic to...

Apply Now