Description
You will design, build, and maintain scalable data infrastructure and pipelines.
Responsibilities
- Design and develop scalable data pipelines to ingest, transform, and load data from APIs, flat files, and third-party services into the data warehouse.
- Implement data quality checks and validations to ensure accuracy and integrity across all data flows.
- Monitor pipeline performance, troubleshoot issues, and improve system reliability.
- Document data models, ETL processes, and technical specifications.
Required Skills
- 1-3 years of experience in a data engineering role or related field.
- Proficiency in SQL and relational databases such as MySQL or PostgreSQL.
- Proficiency in one or more programming languages including Python, Java, or Scala.
- Understanding of data modeling and schema design.
- Bachelor’s degree in Computer Science, Engineering, Mathematics, or a related field, or equivalent relevant experience.
- Strong analytical and problem-solving skills for troubleshooting data issues.
Preferred Skills
- Experience with data processing frameworks like Apache Spark or Apache Airflow.
- Experience with data visualization tools such as Tableau or Power BI.
- Understanding of data warehousing concepts.