Sr. System Engineer - Fulcrum Digital
New York, NY 10001
About the Job
Must Have Skills :
- Expertise in end-end Cloud migration including strategy development, assessment, solution design, architecture, and execution on Azure, AWS
- Operational knowledge of key technologies, such as Windows OS, Linux OS, AD, DFS, SFTP, Cluster etc and In-depth knowledge on Active Directory Replication troubleshooting, hands on replication tool Repadmin, Dcdiag, AD Rep etc. Experience in group policies and troubleshooting & implementation
- Day-to-day management of the VMs including: installation, upgrades, troubleshooting, patching, backup and recovery.
- In-depth knowledge of data center environments, servers, and network equipment.
- Install, upgrade, and maintain Linux based systems
- help troubleshoot problems with Linux servers, running various versions of Linux, including Red Hat, Ubuntu, CentOS etc
- Azure infrastructure enterprise level projects that design and deploy cloud environments for hosting business application services;
- Working closely with application, network, and security teams to ensure requirements are reflected appropriately in the Azure design;
- Designing, testing, and implementing application services migrations in both a manual and automated manner;
- Good Experience with Windows Based Application Hosted IIS, NET
- Lead proof of concept and/or pilot implementations and define the plan to scale implementations across multiple technology domains
- Use your problem-solving and critical thinking skills to effectively manage diverse projects, delivering against tight deadlines in a fast paced and demanding environment
- Drive in-depth knowledge and understanding of key value drivers of a business and how they impact the scope and approach of the engagement
- Develop and own the technical roadmap for relevant service lines and backlog to ensure consistent architecture.
Understanding of event-driven architectures Distributed systems - How clusters are formed, Quorum management, Failure handling. 3 to 5 years of hands-on Experience in MQ or NATS broker or similar messaging solutions. Understanding of Kafka clustering would be good to have. Knows Client-Server communication aspects - sockets, TLS protocol etc Understands the concept of region and AZs. Provide L2 support production systems like application, database, middleware components, infrastructure and network components. Manage production incidents end-to-end within defined SLAs with focus on resolution rather than who caused it. Interact with various stakeholders such as Release managers, program leads, service managers, development and test leads Review operational readiness requirements such as monitoring and alerting, log rotation and resilience of the components and report the gaps Provide pre-implementation support with activities such as release notes review and implementation dry runs. Protect production components by running health checks monitoring latency and memory utilization. Automate day-to-day activities and propose changes that improve reliability Participate in CAB and provide feedback on change requests Support the DevOps team in testing the promoted pipelines and suggest automation of configuration items. Practice incident management best practices and perform RCA. Participate in disaster recovery tests and operational acceptance tests Analyze the technology stack that makes up the product and optimize recovery time objective. Work with team members spread across and time zones Share knowledge, document improvements and mentor junior resources It is good to have skills using Jenkins to orchestrate builds and link to Sonar, Maven, etc. to build out the CI/CD pipeline. Support deployments of code into multiple lower environments. Supporting current processes needed with an emphasis on automating everything as soon as possible. It is good to have skill to design, Implement, and enhance our deployment automation based on Chef. We need proven experience designing and implementing an overall release and deployment process. It is good to have skill to design and implement a Git based code management strategy that will support multiple environment deployments in parallel. Experience with automation for Branch management, code promotions, and version management. Engage in and improve the whole lifecycle of services—from inception and design through deployment, operation, and refinement. Requirements MQ/EB Understanding of event-driven architectures Distributed systems - How clusters are formed, Quorum management, Failure handling. 3 to 5 years of hands-on Experience in MQ or NATS broker or similar messaging solutions. An understanding of Kafka clustering would be good to have. Knows Client-Server communication aspects - sockets, TLS protocol etc Understand the concept of region and AZs. Deployments MTF/Prod, Maintenance items (including stop/start, Disaster Recovery-related activities, etc.), CR for changes in MTF/Prod Good knowledge on Nginx Tools - Log Monitoring Tool - Splunk Application Monitoring tool - Dynatrace Ticketing incident/problem management tool - Remedy Dev-ops Basics - CI-CD Basics, Overview of Git, Bit-bucket, SonarQube, Ansible/Chef Skills - Linux & Shell Scripting ITIL / ITSM PL/SQL Troubleshooting Jenkins - CI/CD Groovy Scripting/Yaml Ansible/Chef Nginx Java / JEE Event-Driven Architectures MQ or NATS broker or similar messaging solutions. Kafka Client-server communication aspects - sockets, TLS protocol Understand the concept of region and AZs.
Source : Fulcrum Digital