← Back to jobs
Gurgaon, Haryana, India
No related jobs found
We are seeking a Lead Data Engineer with strong expertise in Python and PySpark to design, build, and migrate scalable data pipelines on Databricks using Apache Spark. The role focuses on implementing and managing Medallion Architecture (Bronze, Silver, Gold) with Delta Lake to deliver reliable, high-performance data solutions. The successful candidate will optimize Databricks jobs and Spark workloads, enhance ETL/ELT pipelines, support batch and streaming data processing, and provide technical mentorship while leading Databricks-focused proof-of-concept initiatives
Roles & Responsibilities
Migration and Implementation : Lead the migration of legacy Java-based data pipelines to Databricks; design and maintain scalable data pipelines using Databricks and Spark.
Data Engineering and Management : Create, maintain, and update data models; build infrastructure for optimal data extraction, transformation, and loading (ETL).
Performance Analysis and Optimization : Analyse and optimize Databricks jobs and queries; monitor and tune Databricks environments for scalability and reliability. Address data-related technical issues; assist teams with data transformation workloads; perform root cause analysis to identify improvement opportunities.
Technical Improvement : Implement best practices for efficient data processing and storage; develop processes that support data transformation, workload management, data structures, and metadata.
Technical Mentorship : Mentor a team of data engineers; serve as the go-to person for Databricks-related queries; conduct Proof of Concepts (POCs) to demonstrate Databricks capabilities
Required Experience
Non-Technical / Behavioral Competencies Required
Any Graduate
No related jobs found
← Back to jobs