Senior DevOps/Senior Site Reliability Engineer (SRE) - TechDigital
Jacksonville, FL
About the Job
Job Summary:
We are seeking a highly skilled and experienced DevOps/Senior SRE professional to join our dynamic team. The ideal candidate will play a crucial role in designing, implementing, and maintaining robust infrastructure and deployment pipelines. The candidate should have in-depth technical expertise in various tools and technologies, including Packer, Github, Github Actions, Powershell, Terraform, Ansible, and extensive experience in Azure Cloud.
Key Responsibilities:
Must have at least 10+ years of experience Citris Cloud/ Terraforms, Ansible (who has experience in automation specially in production environments )
Packer:
Utilize Packer to create machine images for multiple platforms, ensuring efficient and consistent deployment processes.
Implement Packer templates to automate the provisioning of infrastructure.
Understanding of the importance of automation in the software delivery lifecycle and the ability to contribute to the automation of repetitive tasks using Packer.
Experience with IaC principles and practices, with a focus on creating reproducible and scalable infrastructure using Packer.
Proficiency in writing Packer templates to define machine images.
Familiarity with version control systems, such as Git, to manage and version Packer templates.
Scripting skills, particularly in languages commonly used with Packer (e.g., JSON, HCL).
Github:
Manage and optimize source code repositories on Github.
Collaborate with development teams to establish branching strategies, code review processes, and versioning practices.
User management along with the authorization.
Github Operations management
Github Actions for CI/CD:
Design and implement CI/CD pipelines using Github Actions to automate the software delivery lifecycle.
Configure workflows for build, test, and deployment processes to ensure efficiency and reliability.
Proficiency in configuring and managing GitHub Actions workflows for CI/CD.
Knowledge of YAML syntax for defining workflows and jobs.
Experience in setting up and optimizing Continuous Integration (CI) and Continuous Deployment (CD) pipelines.
Understanding of best practices for building, testing, and deploying applications using CI/CD.
Strong understanding of version control concepts and practical experience with Git.
Proficiency in scripting languages like Bash, PowerShell, or Python for automation tasks.
Familiarity with programming languages relevant to the tech stack used in the organisation.
Experience with containerization technologies like Docker.
Powershell:
Develop and maintain Powershell scripts for automation tasks and system management.
Implement infrastructure as code (IaC) using Powershell to achieve consistent and repeatable deployments.
Demonstrated experience and proficiency in PowerShell scripting for automation tasks, system administration, and infrastructure management.
Ability to write efficient, reusable, and maintainable PowerShell scripts for various tasks, such as system configuration, deployment, and automation of repetitive processes.
Familiarity with using PowerShell for Active Directory administration, including user management, group policy, and security settings.
Strong knowledge of Windows operating systems and experience in managing and administering Windows-based environments using PowerShell.
Capacity to analyze complex systems and processes, identifying areas for improvement and automation.
Strong documentation skills to provide clear and comprehensive documentation for PowerShell scripts and automation processes.
Certifications (Optional): PowerShell Certifications: Relevant certifications, such as Microsoft Certified: Azure Administrator Associate, that demonstrate proficiency in PowerShell and related technologies.
Terraform:
Design and implement infrastructure using Terraform, emphasizing infrastructure as code principles.
Manage and version infrastructure components to facilitate collaboration and ensure traceability.
Develop, maintain, and optimize infrastructure as code using Terraform.
Collaborate with cross-functional teams to ensure infrastructure requirements are met through code.
Utilize Terraform to provision and manage resources on Azure Cloud.
Optimize cloud infrastructure for scalability, cost-effectiveness, and performance.
Implement best practices to ensure modular and reusable code.
Working Knowledge on terraform modules
Ansible:
Utilize Ansible for configuration management and application deployment.
Automate repetitive tasks and ensure consistency across environments through Ansible playbooks.
Design, develop, and maintain Ansible playbooks for automating infrastructure provisioning, configuration, and application deployment.
Implement and optimize Ansible roles for efficient code reuse and maintainability.
Utilize Ansible for configuration management to ensure consistency across environments.
Implement best practices for managing configuration files and system settings.
Develop and maintain effective monitoring and logging for Ansible workflows.
Proficiency in writing Ansible playbooks and roles.
Experience with version control systems, especially Git.
Scripting skills in languages such as Python or Bash.
Familiarity with Linux/Unix systems administration.
Strong problem-solving and troubleshooting skills.
Excellent communication and collaboration skills.
Azure Infrastructure Design and Deployment:
Design, implement, and manage scalable and reliable Azure infrastructure solutions.
Collaborate with architects and other teams to translate business requirements into Azure-based solutions.
Azure Resource Management:
Manage and optimize Azure resources to ensure cost-effectiveness and performance.
Implement and maintain resource policies, tags, and governance best practices.
Azure Networking:
Configure and manage Azure networking services, including Virtual Networks, Subnets, and ExpressRoute.
Implement and manage Azure Firewall, VPNs, and other network security features.
Identity and Access Management (IAM):
Implement and manage Azure Active Directory for identity and access management.
Configure role-based access control (RBAC) and ensure secure access to Azure resources.
Security and Compliance:
Implement and monitor security best practices and compliance standards in Azure.
Conduct regular security assessments and implement corrective measures as needed.
Automation and Scripting:
Develop automation scripts using tools like PowerShell or Azure CLI to streamline repetitive tasks.
Implement Infrastructure as Code (IaC) using tools such as ARM templates or Terraform.
Azure DevOps Integration:
Integrate Azure services into the CI/CD pipeline using Azure DevOps.
Implement continuous integration and continuous deployment (CI/CD) for Azure-based applications.
Monitoring and Troubleshooting:
Implement monitoring solutions using Azure Monitor and other relevant tools.
Troubleshoot and resolve issues related to Azure infrastructure and applications.