SRE Project Manager - eTeam Inc.
Austin, TX 27518
About the Job
Program Management role in Client for Austin, TX location with the DevOps background having some relative experience on tools and processes like.
" GITHUB,
" CI\CD implementation end to end,
" Containerization
" Data Infrastructure knowledge - Kubernetes / EKA on AWS / Kube on GCP / Cluster build / OS installation / Spark Jobs / Flink / Snowflake / Schedulers / Multi tired storage (S3 / Glacier Equal)
Site Reliability Engineer (SRE) Program Manager role:
1. Role Description:
" The SRE Program Manager is responsible for overseeing and managing the reliability, performance, and availability of software systems.
" They collaborate with cross-functional teams, including development operations, infrastructure, and customer support, to ensure seamless system operation.
2. Key Responsibilities:
" Assessment and Analysis: Evaluate the reliability, performance, and availability of software systems.
" Automation: Utilize programming and scripting languages (such as Python, Go, or Java) to automate operational tasks.
" Troubleshooting: Resolve complex issues related to system architecture and performance.
" Infrastructure Management: Collaborate with development operations staff to create, monitor, and troubleshoot system infrastructure.
" Resilience Enhancement: Increase system resilience to serve larger customer volumes.
" Responsible for overall program management including kick off of the program, planning, scheduling and tracking projects.
" Coordinate and facilitate the execution of SRE projects, ensuring the tasks are assigned and timelines are met.
" Collaborate with cross functional teams including Engineering Dev team, Operations and Product management.
" Identify and mitigate the risks.
" Documentation and Reporting of projects, prepare status reports and communicate project updates to stakeholders and leadership.
3. Skills and Qualifications:
" Strong experience in a Linux / AWS environment
" Expert-level coding skills.
" Knowledge of system architecture and performance optimization
" Good experience or understanding of the design principles.
" Good amount of experience in automation using python scripts.