Analytics and Reporting - Senior Analytics Engineer - TechDigital

Summit, NJ

About the Job

Position location can be anywhere near a Client location within NJ. Any site can be utilized and 50% on-site requirement.

Top Skills:

-Advanced SQL skills (5+ years)
-2+ years experience working with dbt
-5+ years working with relational databases
-MS in Computer Science, Chemical Engineering, Biostatistics or similar with 6 years industry experience or PhD in Computer Science, Chemical Engineering, Biostatistics or similar with 3 years industry experience
-Intermediate python skills
- Intermediate visualization (tableau, dashboarding) experience

Responsibilities

Performs data engineering, preprocessing, exploratory data analysis, and model development by interacting with a variety of databases

Responsible for ingestion, integration and delivery of data across multiple platforms

Works to maintain and uphold data integrity and clean data principles

Responsible for leading team code review and improving team programming practices

Responsible for independently coordinating and managing analytics projects across several departments and with cross functional stakeholders

Ability to work on a global team and communicate across several time zones

Communicates with team members regularly to provide updates and collaborate on deliverables.

Accountable for leading, documenting and managing analytics URS and UAT through execution for GPO

Lead and engage colleagues who complete data related activities

Design and deliver digital solutions that streamline access to analytics & data

Work with domain SMEs to derive insight and value to improve manufacturing related data transformations and improvement initiatives.

Displays a high level of teamwork and collaboration both within and across functions

Utilizes supervised or unsupervised methods, learning from vast amounts of unlabeled data to drive insight

Experience working with unstructured text

Ensures life cycle management of code is maintained through version control and associated repositories.

Develops high quality analytical and statistical models, insights, patterns, visualizations, that can be used to improve decision making in manufacturing operations.

Responsible for documentation of all technical work both within and outside of formal document management systems

Independently develops code and analytical models to automate data transformation and analysis

Requirements:

MS in Computer Science, Chemical Engineering, Biostatistics or similar with 6 years industry experience or PhD in Computer Science, Chemical Engineering, Biostatistics or similar with 3 years industry experience

Dashboard development experience (Tableau, Spotfire, DASH)

Proficient in writing and developing analytical and machine learning models using python modules including pandas, numpy, scikitlearn, and tensorflow. Experiencing developing and implementing MLOps pipelines.

Experience building analytical and statistical models to answer key business questions

Experience using git via the command line

Strong understanding of core statistical concepts to solve real world problems

Intermediate to advanced proficiency (3+ years post academia experience as an independent contributor designing and delivering data solutions) in SQL.

Experience interacting with various data warehouses and large-scale, complex datasets using ETL and BI tools and platforms.

Self-motivated to identify and propose Client methodologies that will drive increased efficiency

demonstrate expert knowledge in machine learning and rule-based systems as applied to computational linguistics and natural language processing, as well as development and execution of annotation tasks with teams of experts

Proficiency in mathematics with the skill to translate complex mathematical algorithms into usable computational methods

Experience with data mining and analysis techniques across disparate data sources

Experience working in LINUX/UNIX environments

Experience interacting with PostgresSQL, Oracle, Impala Cloudera, Okera or similar databases

Experience with JupyterLabs, Anaconda, and RStudio

Intermediate proficiency with python

Experience developing visualizations using a variety of methods (plotly, matplotlib, seaborn)

Experience working within Domino Data Lab projects

Technical knowledge of performance tuning and query optimization across large data sets.

Experience with data cataloguing and enablement through APIs

Experience with a variety of computer science languages (C++, Java, html/css)

Exposure to bioprocess engineering/cell therapy data

Knowledge of GxP requirements (preferably related to data and code management)

Experience with Program/Project Management. SCRUM experience highly desired

Preferred:

Familiar with NET/SAP

Knowledge of deep learning methods for NLP (quantitative area of study, Computer Science, preferred)

Strong background and demonstratable experience in Natural Language Processing and Computational Linguistics is required

Experience working with the pharmaceutical industry

Experience working with ERP systems

Source : TechDigital