Description
You will support the development of data pipelines and ETL processes to ensure data reliability and accessibility.
Responsibilities
- Build and maintain ETL processes to move and transform data.
- Integrate structured and semi-structured data from internal and external sources.
- Monitor pipeline performance and troubleshoot data issues.
- Collaborate with senior engineers and analysts to meet data requirements.
- Maintain documentation for data processes and flows.
Required Skills
- Bachelor's degree in Computer Science, Information Systems, Engineering, Mathematics, or a related field.
- Solid understanding of SQL and relational databases including MySQL, PostgreSQL, and SQL Server.
- Exposure to Python or other scripting languages.
- Experience with internship roles or academic projects involving data processing or software development.
- Understanding of cloud platforms such as AWS, Azure, or GCP.
- Knowledge of ETL concepts and data integration.
Preferred Skills
- Knowledge of version control using GitHub.
- Experience with data visualization tools like Tableau or Power BI.
- Understanding of APIs and web-based data ingestion.