Are you passionate.
We do this while maintaining Akamai's mission at the forefront of what we do.
Make life better for billions of people, billions of times a day.
Partner with the best The Senior Engineer creates solutions to improve automation and efficiency for systems and teams.
Expertise in Linux administration, configuration management, and performance tuning is essential.
Focus on reliability, scalability, and efficiency through automation and resource optimization.
Promote continuous improvement and operational excellence across all systems.
As a Senior Site Reliability Engineer, you will be: Providing support and mentorship for other engineers within the department Developing and maintaining automated tools and scripts to enhance system reliability, deployment processes, and incident response efficiency.
Improving our system monitoring to speed error detection and remediation, enhancing performance and reliability of virtualization platform Participating in on-call rotations, guiding restoration and repair of service-impacting issues Writing automation and tooling to reduce operational toil, improve deployment safety, and accelerate incident response Contributing to capacity planning, autoscaling configuration, and workload scheduling for AI compute infrastructure Do what you love To be successful in this role you will: Possess expert level experience in a SysAdmin (Linux/Unix Administration), DevOps or SRE role, working with large scale distributed systems Demonstrate expertise in Kubernetes and large-scale containerization.