Are you intrigued by planetary scale, distributed, intelligent systems?
Do you like to build services that can run themselves? Then this is the role for you!
Join our Performance & Reliability Engineering Organization!
Akamai is the world's largest, most trusted cloud delivery platform. We ensure that infrastructure services have reliability and uptime that meet service level objectives and agreements. SREs monitor our service capacity and performance. We focus on optimizing services, building infrastructure, eliminating manual operations work with automation.
Partner with the best
The UMP Team is a part of the Akamai's Systems Communications group . A cross-functional engineering team that develops the distributed systems and services that underpin Akamai's global network. Our system allows fast and reliable configuration of Akamai's global network. You'll develop automation that prevents service from recurring and handles non-exceptional service conditions to reduce manual operations.
As a Senior Site Reliability Engineer, you will be responsible for:
- Being a SME and tuning systems to optimize performance and to operate more reliably
- Providing ongoing technical assistance in areas including model database management, configuration management, and simulation runs
- Guiding software releases, automating activations for new features, maintenances of services prioritizing safety
- Developing monitoring tools and automate processes to help scale our systems better
- Implementing and improving monitoring, alerting and emergency response procedures and maneuvers
- Troubleshooting complex application issues, service incidents, performance and availability issues
- Providing expertise developing code that provides predictive results from analytical trending and modeling
- Managing on premises resources through infrastructure, code frameworks and declarative configuration management
Do what you love
To be successful in this role you will:
- Have experience working as an SRE or operations-adjacent role including troubleshooting
- Have experience in network troubleshooting and working with bare metal as well as cloud servers
- Have experience in Linux administration, programming language (python, go) scripting language (bash)
- Have a significant background in performance analytics and performance optimization
- Have experience with writing queries and big data technologies (PostgreSQL)
- Have experience with deployment/configuration management tools (Ansible, Terraform, Puppet, Chef, SaltStack)
- Be experienced in monitoring large scale systems (using Prometheus, Grafana, etc.)
- Have experience with CI/CD tools (i.e. Jenkins, Travis, Gitlab CI)
Work in a way that works for you
FlexBase, Akamai's Global Flexible Working Program, is based on the principles that are helping us create the best workplace in the world. When our colleagues said that flexible working was important to them, we listened. We also know flexible working is important to many of the incredible people considering joining Akamai. FlexBase, gives 95% of employees the choice to work from their home, their office, or both (in the country advertised). This permanent workplace flexibility program is consistent and fair globally, to help us find incredible talent, virtually anywhere. We are happy to discuss working options for this role and encourage you to speak with your recruiter in more detail when you apply.
Learn what makes Akamai a great place to work
Connect with us on social and see what life at Akamai is like! |
We power and protect life online, by solving the toughest challenges, together.
At Akamai, we're curious, innovative, collaborative and tenacious. We celebrate diversity of thought and we hold an unwavering belief that we can make a meaningful difference. Our teams use their global perspectives to put customers at the forefront of everything they do, so if you are people-centric, you'll thrive here.
Working for you
At Akamai, we will provide you with opportunities to grow, flourish, and achieve great things. Our benefit options are designed to meet your individual needs for today and in the future. We provide benefits surrounding all aspects of your life:
- Your health
- Your finances
- Your family
- Your time at work
- Your time pursuing other endeavors
Our benefit plan options are designed to meet your individual needs and budget, both today and in the future.
About us
Akamai powers and protects life online. Leading companies worldwide choose Akamai to build, deliver, and secure their digital experiences helping billions of people live, work, and play every day. With the world's most distributed compute platform from cloud to edge we make it easy for customers to develop and run applications, while we keep experiences closer to users and threats farther away.
Join us
Are you seeking an opportunity to make a real difference in a company with a global reach and exciting services and clients? Come join us and grow with a team of people who will energize and inspire you!
#LI-Remote
We power and protect life online. Global companies trust us to build, deliver, and secure digital experiences — helping billions to live, work, and play online. Akamai’s intelligent edge platform...
Apply Now