Lead Data Engineer at Global IT Con
About the Job
Role – Lead Data Engineer
100% Remote
Long term
A Brief Overview
The Data Engineer will be part of a team building Artificial Intelligence including providing health care solutions in the areas of patient care, medical research and administrative services. This group is designed to bring Artificial Intelligence (AI), predictive algorithms and other emerging machine learning (ML) based innovations in data science into healthcare and will partner closely with individuals across clinical specialties and operations areas to deploy algorithms that can lead to better patient outcomes. This role will be responsible for maintaining compute frameworks, analysis tooling, and/or model implementations used or created by the team. The individual will design, implement, and support data processing software and infrastructure.
What you will do
- Build end-to-end data pipelines and infrastructure for ML models used by the Data Science team and others.
- Understand the requirements of data processing and analysis pipelines and make appropriate technical design and interface decisions. Elucidating these requirements will require training, developing, and validating researcher-built or vendor provided machine learning algorithms on hospital data as well as working with other members of the data science team.
- Understand data flows among the SHC applications and use this knowledge to make recommendations and design decisions for languages, tools, and platforms used in software and data projects.
- Troubleshoot and debug environment and infrastructure problems found in production and non-production environments for projects by the Data Science Team.
- Work with other groups at SHC and the Technology and Digital Solutions (TDS) group to ensure servers and system maintenance based on updates, system requirements, data usage, and security requirements
Education Qualifications
- Bachelor’s or master’s degree in computer science, Engineering, or related, or equivalent working experience
- Bachelor’s or master’s degree in computer science, Engineering, or related, or equivalent working experience
Experience Qualifications
- 2+ years’ experience in building data infrastructure for analytics teams, including ability to write code in SQL, R, or Python for processing large datasets in distributed cloud environments
- Experience with cloud deployment strategies and CI/CD
- Experience building and working with data infrastructure in a SaaS environment
Preferred Knowledge, Skills and Abilities
- Knowledge of multiple programming languages, commitment to choosing languages based on project-specific requirements, and willingness to learn new programming languages as necessary.
- Knowledge of resource management and automation approaches such as workflow runners.
- Collaborative mentality and excitement for iterative design working closely with the Data Science team.