Description
You will build and manage data pipelines to support core business workflows.
Responsibilities
- Develop scalable ETL/ELT data pipelines.
- Implement data solutions for KYC/AML workflows and client lifecycle management.
- Process and manage large-scale data sets across distributed systems.
- Orchestrate data workflows using industry-standard tools.
Required Skills
- 6+ years of professional experience in data engineering.
- Strong programming skills in Python or Scala.
- Experience with big data frameworks including Spark, PySpark, Flink, Hadoop, and Kafka.
- Proficiency with cloud data platforms (AWS, Azure, or GCP).
- Expertise in SQL and relational databases (Oracle, PostgreSQL).
- Experience with NoSQL databases such as MongoDB or Couchbase.
- Familiarity with workflow orchestration tools like Airflow or Prefect.
- Understanding of distributed, multi-tier application architectures.
Preferred Skills
- Experience with containerization technologies (Docker, Kubernetes).
- Exposure to large-scale document processing or LLM-based pipelines.