← Back to jobs
Toronto, ON, Canada
No related jobs found
Key Responsibilities
Cloudera Databricks ModernizationLead modernization of legacy Cloudera platforms (CDH CDP| Hive| HBase| Impala| Spark) into Databricks Lakehouse.
Redesign ingestion| transformation| and consumption patterns from HDFS centric architectures to cloud object storage and Delta Lake.
Refactor legacy HiveImpala logic into PySpark Spark SQL based ELT pipelines.
Ensure data parity| reconciliation| and audit integrity during platform migration.
Enterprise Data Warehouse Data Lake Architecture Design and govern enterprise Data Warehouse and Data Lake Lakehouse architectures.
Implement layered architectures spanning
o Raw Landing zones
o Curated conformed layers
o Semantic consumption layers Modernize traditional EDW patterns into domain aligned| scalable lakehouse designs.
Finance Risk Data Modeling Support implementation of finance and risk data models| including
o General Ledger Sub ledger data
o Accounting events and financial hierarchies
o Risk exposure| liquidity| credit| and market risk models Enable aggregation| drill down| and drill back from reports to transaction level data.
Support regulatory reporting| management reporting| and analytics use cases.
Semantic Consumption Layers Build and manage semantic consumption layers to ensure consistent business logic across
o BI and reporting tools
o Finance Risk analytics
o Self service analytics platforms Define metrics| dimensions| hierarchies| and KPIs aligned to finance and risk definitions.
Implement semantic models using
o Databricks SQL
o Delta tables
o dbt or equivalent transformation frameworks Databricks Engineering Optimization Engineer large scale pipelines using PySpark| Spark SQL| and Delta Lake
Any Gradute
No related jobs found
← Back to jobs