Build, optimize, and maintain conceptual and logical database models.
Develop data set processes for data modeling, mining, and production.
Utilize programming languages and tools to integrate systems and manage data.
Implement internal process improvements: automating manual processes, optimizing data delivery, re-designing infrastructure for greater scalability, etc.
Collaborate with data scientists, business stakeholders, and IT team members on project goals.
What's Needed?
Proficiency in programming languages such as Python, Java, Scala, or Ruby.
Expertise in big data technologies such as Hadoop, Spark, and Kafka.
Experience with data integration, processing, and relational SQL and NoSQL databases like PostgreSQL or MongoDB.
Knowledge of data pipeline and workflow management tools.
Familiarity with data visualization tools like Tableau or Power BI.