Principal Data Engineer - Expert In Recruitment Solutions
Denver, CO
About the Job
Need the Candidate local to CO
Location: Denver, CO (Hybrid)
Duration: 09+ Months (Possibility of Extension)
Must have Skillset:
- AWS EMR
- Scala
- Spark
- Data Engineering
Job Overview:
The principal data engineer is responsible for collaborating on the design and implementation of the data infrastructure and data software solutions used in internal experimentation platform. The principal data engineer will build, deploy, and manage data solutions in accordance with program needs as part of the experimentation engineering team. The experimentation engineering team is responsible for enabling and aggregating new metrics as well maintaining and updating existing metrics for our data science and product stakeholders. The experimentation engineering team facilitates experimentation through AWS S3 buckets storing large amounts of data, AWS EMR jobs pre-processing and aggregating data, and GraphQL APIs exposing the data.
Responsibilities:
Data Solutions
- Architect robust, scalable data solutions, including interfaces, data stores, and data pipelines to meet the needs of our data scientists and other stakeholders.
- Develop data solutions and infrastructure that will enable efficient storage and retrieval of the data for use in experimentation platform analysis, stakeholder-led analysis, and data science ad hoc analysis.
- Create data solutions that enable data engineers to maintain scalable, reliable, consistent, and repeatable systems that support data operations for data science and analytics tools.
- Deploy, test, and troubleshoot data architecture applications.
- Builds, maintains, and enhances the datasets and systems for experimentation results.
- Hands-on implementation and prototyping of data solutions
Technical Leadership
- Guide a team of data engineers and software engineers in the best practices around data engineering.
- Review code to ensure business outcomes and cost/benefit balanced performance is achieved.
- Technical support for our data environments (occasionally on-call)
Establishing Requirements and Stakeholder Relationships
- Work closely with data scientists and software engineers to understand how data will be used and define a solution to satisfy those requirements.
- Convert business requirements into technical requirements with a focus on the data strategy and architecture.
- Technical product owner for data solutions
- Acts as a liaison between data engineering team and stakeholders.
- Support the achievement of the experimentation groups stated goals and outcomes.
Documentation and Standards
- Develops data architecture, data flow, and entity relationship diagrams representing existing and future state data solutions.
- Document experimentation data pipeline procedures and queries
- Defines and enforces experimentation program's data standards in accordance with the organization's data governance policies.
Minimum Qualifications/Skills:
- 5+ years in a data engineering role
- Professional experience designing and building efficient, cost-effective datasets, using petabytes of data from disparate sources, for consumption by data scientists and data analysts.
- Expert Scala programming skills with a data focus
- Professional experience with AWS S3, EMR, Document DB
- Professional experience using Spark.
- Experience using TypeScript and GraphQL
- Experience with both SQL and NoSQL data stores
- Bachelor's degree in related field
Preferred Experience:
- AWS Certified Cloud Data Engineer
- Professional experience with AWS S3 and Amazon EMR (Elastic MapReduce)
- Professional experience using Spark.
- Experience designing and building ETL and ELT systems.
- Master's degree or higher in related field
Source : Expert In Recruitment Solutions