Key Skills: Hadoop, PySpark, Hive, Impala, Tiger Graph, Neo4j, Graph Analytics, CI/CD, Data Engineering, Unix
Good to Have Skills: Experience building E2E analytics platform using Graph DB and Big Data platform. Knowledge of metadata management, data lineage, and principles of data governance. Experience with graph-based data workflows and working with graph analytics. Hands on experience on implementing CI/CD and automation using the Atlassian ecosystem.
Roles & Responsibilities:
- Work on the CSWT Graph Data Platform and SDP supporting development of Tiger Graph based RAG solution for AI use cases.
- Support development for Consumer applications, fraud detection, and AML tracking using graph databases.
- Work on the Hadoop ecosystem ensuring system stability and performance for large scale data processing.
- Supervise daily batch processes and monitor system performance to ensure operational excellence.
- Handle large datasets using big data technologies such as Hive, PySpark, and Impala for data processing.
- Complete development tasks assigned by CIO leads and generate business reports as required.
- Provide required data and support ad hoc requests from various business stakeholders.
- Design, develop, and maintain software frameworks using Tiger Graph and Neo4J databases.
- Drive process improvements through innovative ideas and deliver high-quality software solutions.
Experience Required: 4-5 years of relevant experience, particularly in Hadoop and Unix environments with hands-on experience in big data technologies.
Education: BE/BTech/MCA