Principal Data Engineer- Remote, USA - Ambry Genetics
Aliso Viejo, CA 92656
About the Job
Compensation: $180,000 - $200,000 per year.
You are eligible to a Short-Term Incentive Plan with the target at 10% of your annual earnings, terms and conditions apply. Principal Data Engineer- Remote USA The Principal Data Engineer is responsible for driving the design, development, and delivery of Ambry's data infrastructure and solutions.
This role will play a pivotal part in building and maintaining scalable, reliable, and efficient data pipelines, operational data stores, data warehouses, and data lakes.
The Principal Data Engineer will collaborate with architects, scientists, and analysts to ensure that data is governed, accessible, secure, and aligned with business objectives. As a Principal Data Engineer at Ambry You will tactically lead Ambry data engineers into a lived experience “of what good looks like” as you commit code to git and drive continuous delivery of our data product deliveries with GitOps into our AWS Cloud Infrastructure. You will receive support from our cloud infrastructure delivery team with recommended patterns for SDLC, CICD, and GitOps driven deployment into our cloud dev, test, stage, and prod environments.We have a mix of batch and stream data processing techniques that we want you to standardize and professionalize for uptake and re-use by our data engineers Expect to lead junior engineers as you review their commits to our code base.
Expect to exercise authority and accountability on what makes it into the code base as the team adjusts to meet standards. Essential Functions LeadershipYou will serve as the Technical Team Lead for the Data Engineering and Operations Team at Ambry.You will set an example for (and lead) the team technically via the code you commit; same for the supporting documentation you offer to explain concepts for existing and future developers.What increment will we deliver in a sprint? You will assist your product owner and scrum lead in helping scope use case and acceptance criteria while empowering team members to grow and deliver.You will work within a “data as a product” operating and delivery model. Customers will ask for (via a governed specification within our data catalog) and receive delivery from our enterprise operational data stores that endured a bronze, silver, gold lifecycle treatment through our data pipelines]\Technical You will help define and lead the transition for our team to crawl, walk, run into modern cloud native data mesh architectures as we scale our operational data stores, data warehouse, and data lake house.We currently maintain disparate technical implementations for performing batch processing, stream processing, and data pipeline orchestration.You will lead us as we journey to standardize and support one governed set of capabilities.Here is a list of some of the technology you will work with. Proviso: we are expectant of (and eager to receive) your recommendations and improvements – the list below is not set in stone.Data Pipeline Orchestration and Execution: AWS Glue, AWS Step Functions, & Databricks Change Data Capture: Amazon Database Migration Service, Amazon Managed Streaming for Apache Kafka with Debezium PluginBatch AWS step functions (and Glue Jobs)Asynchronous queueing of batch job commands with RabbitMQ to various “ETL Jobs”Cron and subervisord processing on dedicated job server(s): Python & PHP. Streaming Real time processing via AWS MSK (Kafka), Apache Hudi, & Apache Flink Near real time processing via worker (listeners) spread over AWS Lambda, custom server (daemons) written in Python and PHP Symfony.Languages: Python & PySpark, Unix Shell, PHP Symfony (with Doctrine ORM).Monitoring & Reliability: Datadog & Cloudwatch Things you will do Build dashboards using Datadog and Cloudwatch to ensure system health and user support. Build schema registries that enable data governanceBuild schema registries that enable data governancePartner with end-users to resolve service disruptions and evangelize our data product offeringsVigilantly oversee data quality and alert upstream data producers of issues Support and contribute to the data platform architecture strategy, roadmap, and implementation plans to support the company's data-driven initiatives and business objectiveWork with Business Intelligence (BI) consumers to deliver enterprise wide fact and dimension data product tables to enable data-driven decision-making across the organization.Other duties as assigned Qualifications Basic understanding of and/or a willingness to learn basic genomic concepts and terminology Strong familiarity with any combination of our tech stacks as mentioned above.Experience building data APIs and offering Data as a ServiceExperience integrating with SaaS platforms such as SAP and SalesforceExperience or willingness to learn working with PHP MVC frameworks such as SymfonyExperience with Atlassian products, i.e.
Jira, Confluence, BambooExperience with system diagramming tools such as Miro, LucidCharts, or Visio6+ years’ experience working with professional scrum teams and/or equivalent schooling4+ years’ experience using Git versioning control3+ years’ experience designing and indexing relational databases2+ years’ experience building and operationalizing real-time data streamsBachelor’s or master’s degree in computer, data, math, or life sciences or equivalent work experience PreferredAWS Associate Solution Architect certification About Us: Ambry Genetics Corporation is a CAP-accredited and CLIA-licensed molecular genetics laboratory based in Aliso Viejo, California.
We are a genetics-based healthcare company that is dedicated to open scientific exchange so we can work together to understand and treat all human disease faster.At Ambry, everyone is welcome.
A career at Ambry Genetics is a chance to be part of a dynamic company that aims to improve health by understanding the relationships between genetics and human disease.
We earned our reputation as industry leaders by responsibly introducing cutting-edge genetic testing solutions and continually sharing what we learn with the global scientific community.At Ambry you will be learning, challenging yourself, and having fun while collaborating with teammates through the open exchange of ideas.
Our outstanding benefits program includes medical, dental, vision, 401k with a 4% employer match, FSA, paid sick leave and generous paid time off (PTO) program.
The Company reserves the right to make changes to the 401k plan from time to time.
You can learn more about the benefitshere.
Ambry Genetics is an Equal Opportunity Employer (EOE) and we maintain a drug-free work environment. The Company believes in second chance employment. Qualified applicants with arrest or conviction history will be considered regardless of their arrest or conviction history, consistent with local laws such as Los Angeles County Fair Chance Ordinance and the California Fair Chance Act. You do not need to disclose your criminal history or participate in a background check until a conditional job offer is made to you.
After making a conditional offer and running a background check, if the Company is concerned about conviction that is directly related to the job, you will be given the chance to explain the circumstances surrounding the conviction, provide mitigating evidence, or challenge the accuracy of the background report.
For the purpose of the above job description, “Essential Functions” are “Material Job Duties”.Our salary ranges are determined by role, level, and location.
The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position across all US locations.
Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training.All qualified applicants will receive consideration for employment without regard to race (and traits historically associated with race, including, but not limited to hair texture and protective hairstyles such as braids, locks, and twists), color, creed, religion, sex, sexual orientation, gender identity, gender expression (including transgender status), national origin, ancestry, age, marital status or protected veteran status and will not be discriminated against on the basis of disability, protected medical condition as defined by applicable state or local law, genetic information, or any other characteristic protected by applicable federal, state, or local laws and ordinances.
If you have a disability or special need that requires accommodation, please contact us at
We are not responsible for any fees related to resumes that are unsolicited or are received by Ambry.
Such resumes will be deemed the sole property of Ambry and will be processed accordingly.PRIVACY NOTICESTo review Ambry’s Privacy Notice, Click here: To review the California privacy notice, click here: California Privacy Notice | Ambry GeneticsTo review the UKG privacy notice, click here: California Privacy Notice | UKG#LI-REMOTE #LI-NK1 Job SummaryRequisition Number: PRINC003818Job Category: EngineeringSchedule: Full-Time
You are eligible to a Short-Term Incentive Plan with the target at 10% of your annual earnings, terms and conditions apply. Principal Data Engineer- Remote USA The Principal Data Engineer is responsible for driving the design, development, and delivery of Ambry's data infrastructure and solutions.
This role will play a pivotal part in building and maintaining scalable, reliable, and efficient data pipelines, operational data stores, data warehouses, and data lakes.
The Principal Data Engineer will collaborate with architects, scientists, and analysts to ensure that data is governed, accessible, secure, and aligned with business objectives. As a Principal Data Engineer at Ambry You will tactically lead Ambry data engineers into a lived experience “of what good looks like” as you commit code to git and drive continuous delivery of our data product deliveries with GitOps into our AWS Cloud Infrastructure. You will receive support from our cloud infrastructure delivery team with recommended patterns for SDLC, CICD, and GitOps driven deployment into our cloud dev, test, stage, and prod environments.We have a mix of batch and stream data processing techniques that we want you to standardize and professionalize for uptake and re-use by our data engineers Expect to lead junior engineers as you review their commits to our code base.
Expect to exercise authority and accountability on what makes it into the code base as the team adjusts to meet standards. Essential Functions LeadershipYou will serve as the Technical Team Lead for the Data Engineering and Operations Team at Ambry.You will set an example for (and lead) the team technically via the code you commit; same for the supporting documentation you offer to explain concepts for existing and future developers.What increment will we deliver in a sprint? You will assist your product owner and scrum lead in helping scope use case and acceptance criteria while empowering team members to grow and deliver.You will work within a “data as a product” operating and delivery model. Customers will ask for (via a governed specification within our data catalog) and receive delivery from our enterprise operational data stores that endured a bronze, silver, gold lifecycle treatment through our data pipelines]\Technical You will help define and lead the transition for our team to crawl, walk, run into modern cloud native data mesh architectures as we scale our operational data stores, data warehouse, and data lake house.We currently maintain disparate technical implementations for performing batch processing, stream processing, and data pipeline orchestration.You will lead us as we journey to standardize and support one governed set of capabilities.Here is a list of some of the technology you will work with. Proviso: we are expectant of (and eager to receive) your recommendations and improvements – the list below is not set in stone.Data Pipeline Orchestration and Execution: AWS Glue, AWS Step Functions, & Databricks Change Data Capture: Amazon Database Migration Service, Amazon Managed Streaming for Apache Kafka with Debezium PluginBatch AWS step functions (and Glue Jobs)Asynchronous queueing of batch job commands with RabbitMQ to various “ETL Jobs”Cron and subervisord processing on dedicated job server(s): Python & PHP. Streaming Real time processing via AWS MSK (Kafka), Apache Hudi, & Apache Flink Near real time processing via worker (listeners) spread over AWS Lambda, custom server (daemons) written in Python and PHP Symfony.Languages: Python & PySpark, Unix Shell, PHP Symfony (with Doctrine ORM).Monitoring & Reliability: Datadog & Cloudwatch Things you will do Build dashboards using Datadog and Cloudwatch to ensure system health and user support. Build schema registries that enable data governanceBuild schema registries that enable data governancePartner with end-users to resolve service disruptions and evangelize our data product offeringsVigilantly oversee data quality and alert upstream data producers of issues Support and contribute to the data platform architecture strategy, roadmap, and implementation plans to support the company's data-driven initiatives and business objectiveWork with Business Intelligence (BI) consumers to deliver enterprise wide fact and dimension data product tables to enable data-driven decision-making across the organization.Other duties as assigned Qualifications Basic understanding of and/or a willingness to learn basic genomic concepts and terminology Strong familiarity with any combination of our tech stacks as mentioned above.Experience building data APIs and offering Data as a ServiceExperience integrating with SaaS platforms such as SAP and SalesforceExperience or willingness to learn working with PHP MVC frameworks such as SymfonyExperience with Atlassian products, i.e.
Jira, Confluence, BambooExperience with system diagramming tools such as Miro, LucidCharts, or Visio6+ years’ experience working with professional scrum teams and/or equivalent schooling4+ years’ experience using Git versioning control3+ years’ experience designing and indexing relational databases2+ years’ experience building and operationalizing real-time data streamsBachelor’s or master’s degree in computer, data, math, or life sciences or equivalent work experience PreferredAWS Associate Solution Architect certification About Us: Ambry Genetics Corporation is a CAP-accredited and CLIA-licensed molecular genetics laboratory based in Aliso Viejo, California.
We are a genetics-based healthcare company that is dedicated to open scientific exchange so we can work together to understand and treat all human disease faster.At Ambry, everyone is welcome.
A career at Ambry Genetics is a chance to be part of a dynamic company that aims to improve health by understanding the relationships between genetics and human disease.
We earned our reputation as industry leaders by responsibly introducing cutting-edge genetic testing solutions and continually sharing what we learn with the global scientific community.At Ambry you will be learning, challenging yourself, and having fun while collaborating with teammates through the open exchange of ideas.
Our outstanding benefits program includes medical, dental, vision, 401k with a 4% employer match, FSA, paid sick leave and generous paid time off (PTO) program.
The Company reserves the right to make changes to the 401k plan from time to time.
You can learn more about the benefitshere.
Ambry Genetics is an Equal Opportunity Employer (EOE) and we maintain a drug-free work environment. The Company believes in second chance employment. Qualified applicants with arrest or conviction history will be considered regardless of their arrest or conviction history, consistent with local laws such as Los Angeles County Fair Chance Ordinance and the California Fair Chance Act. You do not need to disclose your criminal history or participate in a background check until a conditional job offer is made to you.
After making a conditional offer and running a background check, if the Company is concerned about conviction that is directly related to the job, you will be given the chance to explain the circumstances surrounding the conviction, provide mitigating evidence, or challenge the accuracy of the background report.
For the purpose of the above job description, “Essential Functions” are “Material Job Duties”.Our salary ranges are determined by role, level, and location.
The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position across all US locations.
Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training.All qualified applicants will receive consideration for employment without regard to race (and traits historically associated with race, including, but not limited to hair texture and protective hairstyles such as braids, locks, and twists), color, creed, religion, sex, sexual orientation, gender identity, gender expression (including transgender status), national origin, ancestry, age, marital status or protected veteran status and will not be discriminated against on the basis of disability, protected medical condition as defined by applicable state or local law, genetic information, or any other characteristic protected by applicable federal, state, or local laws and ordinances.
If you have a disability or special need that requires accommodation, please contact us at
careers@ambrygen.comAmbry
does not accept unsolicited resumes from individual recruiters, third party recruiting agencies, outside recruiters or firms without an executed contract in place.We are not responsible for any fees related to resumes that are unsolicited or are received by Ambry.
Such resumes will be deemed the sole property of Ambry and will be processed accordingly.PRIVACY NOTICESTo review Ambry’s Privacy Notice, Click here: To review the California privacy notice, click here: California Privacy Notice | Ambry GeneticsTo review the UKG privacy notice, click here: California Privacy Notice | UKG#LI-REMOTE #LI-NK1 Job SummaryRequisition Number: PRINC003818Job Category: EngineeringSchedule: Full-Time
Source : Ambry Genetics