SRE Linux / Windows Primarily Linux position - Expert In Recruitment Solutions
Glendale, CA 91203
About the Job
The Systems Reliability Engineering (SRE) team helps Imagineers create and deliver the
software solutions that power experiences in our theme parks and resorts.
Systems Reliability Engineers use a software engineering approach to architect, design, automate,
monitor, and build applications at scale.
The Senior Systems Engineer is expected to have expert level systems administration skills on both the Linux and Windows platforms, and must
have experience with CI/CD platforms (GitHub Actions, GitLab CI)), systems automation (Chef/Ansible/Terraform), systems development (Go,
Python, Ruby) and cloud automation tools (Boto, CloudFormation, Terraform), source control, cloud hosting, container computing, web
technologies and the DevOps team culture. This position will also bring expertise on systems, operational excellence and application stability,
security, performance, and capacity management, as well as documentation.
This position works closely with Imagineering Technology Studio teams to brainstorm, architect, gather requirements, troubleshoot, and provide
stellar customer support. The role requires someone who is creative, proactive, constructive, and highly motivated. The Senior Systems Engineer
must be prepared to work in an extremely collaborative and high-energy environment.
Job Responsibilities and Duties:
Summarize job responsibilities and major duties. What duties are required for the position to exist?
-Focus on major areas of work, typically 20% or more of role
-An ideal list would have 3-5 major responsibilities/duties
-Estimate and include percentage of time spent in each, and whether performed (D) Daily, (W) Weekly, (M)
Monthly or (A) Annually
Design: Leading project/planning efforts, architectural design, engineering, attending meetings w/ various teams.
Build: Implementing, integrating and configuring solutions, tools, infrastructure and systems.
Basic Qualifications:-
Understand how to install and configure operating systems, specifically with
expertise in Linux and Windows Server.
Software Development Continuous Integration (CI) Pipeline knowledge (GitLab
CI, Github Actions).
Experience in public cloud hosting services (AWS, Google Cloud, Azure) as
well as familiarity with container computing (eg. Docker, ECS, Kubernetes).
Proficiency in Infrastructure as Code (Terraform, CloudFormation, Bicep,
Pilumi).
Experience with Source Control Management systems (Git).
Proficient in web or web server technologies: Java, Node.js, Tomcat, IIS,
Apache/nginx, MySQL, PostgreSQL, etc., including being able to perform basic
setup, configuration, and troubleshooting.
Understand internet technologies and network protocols, including HTTP,
basic load balancing configurations, security zones, VIPs, SNMP, REST and
DNS.
with mentoring for all of the following:
o Site monitoring and instrumentation
Experience supporting and/or developing backend tools or services
Able to perform and provide in depth analysis on load test runs against a
moderately complex system.
Demonstrates exceptional troubleshooting methodology, including the ability to
author and instruct new methodologies to the SRE team.
Independently resolve moderately to highly complex system and application
incidents.
Able to identify and propose system and application fixes for performance
bottlenecks.
Able to evaluate new application requirements for capacity and run-time best
practices.
Masters of Science degree in computer science or related field or equivalent experience in technical
operations and software engineering
Required Education
BS in Computer Science or related field with 7+ years of relevant experience
Source : Expert In Recruitment Solutions