[Remote] Lead Data Engineer
Note: The job is a remote job and is open to candidates in USA. IT Labs is an international software tech company specializing in high-performance and innovation. They are seeking an experienced Lead Data Engineer to support a client in the digital health space by designing and evolving their data platform, building scalable data pipelines, and ensuring the delivery of high-quality data for business decisions.
Responsibilities
- Lead the design, development, and optimization of scalable data platforms and pipelines
- Design, build, and maintain production-grade ETL/ELT workflows for batch and near real-time data processing
- Drive the migration and modernization of data assets from BigQuery and other analytical platforms into Snowflake
- Develop robust integrations with enterprise applications, APIs, and external data providers
- Build and optimize Snowflake data models, schemas, Snowpipe ingestion processes, and query performance
- Collaborate with business stakeholders, analysts, and engineering teams to translate business requirements into scalable data solutions
- Implement data quality, validation, monitoring, and observability frameworks
- Establish and promote best practices for data governance, lineage tracking, metadata management, and security
- Support both batch and streaming data architectures using technologies such as Kafka, Event Hub, or equivalent
- Mentor other engineers and contribute to architectural decisions, technical standards, and engineering best practices
- Ensure data platforms meet performance, reliability, scalability, and regulatory requirements
Skills
- 7+ years of experience in data engineering within cloud or hybrid environments
- Strong experience designing, building, and maintaining production-scale ETL/ELT pipelines
- Advanced SQL skills and proficiency with data transformation techniques
- Strong Python programming skills for data processing and automation
- Hands-on experience with Snowflake, including: Data modeling, Snowpipe, Query optimization, Schema design
- Experience migrating from BigQuery or similar columnar data warehouse technologies
- Experience with batch and streaming data processing platforms such as Kafka, Event Hub, or equivalent
- Experience building enterprise-grade API and source-system integrations
- Experience implementing data quality and validation frameworks
- Good understanding of: Data governance, Data lineage, Access control and security best practices
- Strong problem-solving and communication skills
- Ability to work effectively in a collaborative, cross-functional environment
- Experience with Databricks, including: Delta Lake, Unity Catalog, Spark-based pipelines
- Experience with Spark, Scala, or PySpark
- Familiarity with Tableau
- Understanding of HIPAA and HITRUST data handling practices
- Experience with dbt (data build tool) for transformation layer management
- Experience working within healthcare, insurance, or regulated industries
Benefits
- Full-time, 40h/week
- US-based — US citizenship is required
- Remote
- Contract or B2B arrangement
Company Overview