Data Defense Developer - eTeam Inc.
Remote, TX 27518
About the Job
As a Data Defense Operations Developer, you will be responsible for supporting the Data Defense Lead along with a team of cross-functional cyber, privacy, engineering, and data protection analysts to define, implement, manage, and measure controls to protect data in accordance with relevant geographical regulations, contractual commitments, and confidentiality
Bachelors Degree or industry equivalent work experience
3-5 years experience programming & debugging non-trivial projects using Python, Java, Golang, or C, involving topics like concurrency & async
Strong understanding of relational databases (MySQL, PostgreSQL, SQL Server, etc.) advanced topics (optimization, replication, isolation, etc.)
Strong understanding of one of the following:
Spark (architecture, debugging, optimization)
Flink (architecture, debugging, optimization)
Other distributed computing framework
Strong understanding of one of the following:
Distributed SQL databases (Hive, Hudi, Iceberg, ClickHouse, etc.)
NoSQL databases (Redis / Redis Cluster, HBase, Cassandra, etc.) - consistency, consensus, etc.
Message queues (e.g. Kafka, RocketMQ, RabbitMQ, etc.) - delivery semantics, etc.
Unstructured distributed storage (Object Storage, File Storage, Block Storage)
Strong understanding of containers and their orchestration (Kubernetes), and significant experience with public/hybrid/private cloud
Familiarity working with Linux
Ability to stay curious about new developments in the industry and learn & adapt quickly
Ability collaborating with cross-regional teams of diverse backgrounds to meet strategic and tactical objectives as well as serving as an individual contributor.
Responsibilities:
Develop low-level storage interfaces to enable advanced data discovery capabilities to identify sensitive data
Design, implement, and operate large scale distributed systems to perform data discovery using scalable, reusable, and configurable frameworks / methodologies
Define metrics and create / maintain dashboards for measuring and reporting key performance indicators (e.g., coverage, findings and remediations)
Build and manage data inventories and data flow mappings by collecting and aggregating datasets from multiple data source systems
Collaborate with both technical and business teams to gather functional requirements for in-house and external technologies to identify sensitive, high value data (e.g., consumer data, sensitive data, IP & source code) and ensure appropriate data protection controls are in place