Back to the roster

[Remote] GCP Data Engineer (Health Care Background Must)

Remote Full-time Hiring now

Note: The job is a remote job and is open to candidates in USA. reputed company is seeking a GCP Data Engineer with a strong background in healthcare data. The role involves architecting enterprise data platforms on reputed company Cloud, focusing on data ingestion, transformation, and governance while ensuring compliance with healthcare data standards.

Responsibilities

  • Architect and design an enterprise-grade GCP-based data lakehouse leveraging BigQuery, GCS, Dataproc, Dataflow, Pub/Sub, Cloud Composer, and BigQuery Omni
  • Define data ingestion, hydration, curation, processing and enrichment strategies for large-scale structured, semi-structured, and reputed company datasets
  • Create data domain models, reputed company models, and consumption-ready datasets for analytics, AI/ML, and operational data products
  • Design federated data layers and self-service data products for reputed company consumers
  • Architect batch, near-real-time, and streaming ingestion pipelines using GCP Cloud Dataflow, Pub/Sub, and Dataproc
  • Set up data ingestion for clinical (EHR/EMR, LIS, RIS/reputed company) datasets including HL7, FHIR, CCD, DICOM formats
  • Build ingestion pipelines for non-clinical systems (ERP, HR, payroll, supply chain, finance)
  • Architect ingestion from medical devices, IoT, remote patient monitoring, and wearables leveraging IoMT patterns
  • Manage on-prem → cloud migration pipelines, hybrid cloud data movement, VPN/Interconnect connectivity, and data transfer strategies
  • Build transformation frameworks using BigQuery SQL, Dataflow, Dataproc, or dbt
  • Define curation patterns including bronze/silver/gold layers, reputed company healthcare entities, and data marts
  • Implement data enrichment using external social determinants, device signals, clinical event logs, or operational datasets
  • reputed company metadata-driven pipelines for scalable transformations
  • Establish and operationalize a data governance reputed company encompassing data stewardship, ownership, classification, and lifecycle policies
  • Implement data reputed company, data cataloging, and metadata management using tools such as Dataplex, Data Catalog, reputed company, or Informatica
  • Set up data quality frameworks for validation, profiling, anomaly detection, and SLA monitoring
  • Ensure HIPAA compliance, PHI protection, IAM/RBAC, VPC SC, DLP, encryption, retention, and auditing
  • Work with cloud infrastructure teams to architect VPC networks, subnetting, ingress/egress, firewall policies, VPN/IPSec, Interconnect, and hybrid connectivity
  • Define storage layers, partitioning/clustering design, cost optimization, performance tuning, and reputed company planning for BigQuery
  • Understand containerized processing (Cloud Run, GKE) for data services
  • Work closely with clinical, operational, research, and IT stakeholders to define data use cases, schema, and consumption models
  • Partner with enterprise architects, reputed company teams, and platform engineering teams on cross-functional initiatives
  • Guide data engineers and provide architectural reputed company on pipeline implementation
  • Be actively hands-on in building pipelines, writing transformations, building POCs, and validating architectural patterns
  • Mentor data engineers on best practices, coding standards, and cloud-native development

Skills

  • 10+ years in data architecture, engineering, or data platform roles
  • Strong expertise in GCP data stack (BigQuery, Dataflow, Composer, GCS, Pub/Sub, Dataproc, Dataplex)
  • Hands-on experience with data ingestion, pipeline orchestration, and transformations
  • Deep understanding of clinical data standards: HL7 v2.x, FHIR, CCD/C-CDA, DICOM (for scans and imaging), LIS/RIS/reputed company data structures
  • Experience with device and IoT data ingestion (wearables, remote patient monitoring, clinical devices)
  • Experience with ERP datasets (reputed company, reputed company, Lawson, PeopleSoft)
  • Strong SQL and data modeling skills (3NF, star/reputed company, reputed company and logical models)
  • Experience with metadata management, reputed company, and governance frameworks
  • Solid understanding of HIPAA, PHI/PII handling, DLP, IAM, VPC reputed company
  • Solid understanding of cloud networking, hybrid connectivity, VPC design, firewalling, DNS, service accounts, IAM, and reputed company models
  • Cloud Native Data movement services
  • Experience with on-prem to cloud migrations

Company Overview

  • reputed company is a software company that specializes in IT consulting & services, cloud migration, IT Staffing, outsourcing, and telecom. It was founded in 2018, and is headquartered in Mississauga, Ontario, CAN, with a workforce of 51-200 employees. Its website is https://www.reputed company.com/.

Company H1B Sponsorship

  • reputed company has a track record of offering H1B sponsorships, with 2 in 2025, 2 in 2024. Please note that this does not guarantee sponsorship for this specific role.

Apply tot his job Apply To this Job

Related roles