How You Contribute to Our Vision: Key Responsibilities
The SRE Lead will lead engineering and support teams to ensure the health and performance of multiple B2B eCommerce capabilities that enable our customers to manage their business. You will drive teams to automate, perform deep-level analysis, and improve the observability of our systems, minimizing customer disruptions. In addition, you will use the Incident Management process to find opportunities for trending issues that the team can help find permanent solutions to address. From Kubernetes to the kernel and everything in-between, you'll be working with the latest technology in a fast-paced engineering environment. As the SRE Manager, you will be responsible for the operations engineers in your assigned time zone. In addition, you will lead customer service management, managed services operations, and provide consistent product improvement engineering. Your collaboration with internal customers, product engineering, and development groups is critical to success.
You will make a high business impact by:
Leading your team in daily agile SRE practices
Optimizing the quality and velocity of both support and operational teams
Mentoring engineers and support specialists to improve their skills
Identifying and measuring team health indicators
Implementing structured engineering and operations processes
Ensuring proper team focus on priorities, milestones, and deliverables
Working to meet service level agreements with customer deployments around the globe
Delivering quality managed services in a consistent, timely manner
Representing the Site Operations team to stakeholders, customers, and internal teams
Experience, Skills, Education & Licenses/Certifications:
5+ years of Experience delivering high SLA production outcomes (ideally in a Public Cloud environment) - leveraging cloud-native architectures to build and manage resilient, highly available infrastructures that deliver customer outcomes with high SLAs.
5+ years of management experience leading teams of engineers
At least 2 years of Experience leading an SRE team
Proven experience managing engineering and support teams in eCommerce
Experience with Linux/Windows Server troubleshooting
Experience with Microservice architecture and container technology troubleshooting
Experience with agile software development methodologies
Experience working in and managing distributed teams
Experience with cloud topologies and technologies (Microsoft Azure preferred)
Experience driving automation and recoverability process through a continuous improvement mindset
Excellent problem solving, critical thinking, and interpersonal skills - Lead by example to empower and challenge the team to deliver their best.
Knowledge of observability stacks (Dynatrace, Application Insights, Graylog)
Technical aptitude for understanding complex distributed systems
Experience working with teams in different countries
Track record of building and managing high-performance SRE and Operational teams.
Extensive experience leading teams responsible for customer-facing systems in a high uptime 24-7 environment.