Description
Build and manage data architecture and pipelines for large-scale processing and analytics.
Responsibilities
- Design and maintain data pipelines for ingestion, ETL, and storage.
- Develop data models to support reporting and analytics.
- Monitor, troubleshoot, and optimize data system performance and reliability.
- Manage BigQuery usage and implement cost-effective resource scaling.
- Collaborate with cross-functional teams to translate data requirements into technical solutions.
Required Skills
- 5+ years of experience in data engineering roles.
- Strong proficiency in SQL and database technologies.
- Hands-on experience with Google Cloud Platform (GCP) tools: BigQuery, Cloud Storage, Dataflow, Pub/Sub, and Bigtable.
- Practical experience with big data technologies: Hadoop, Spark, Hive, and Kafka.
- Working knowledge of data warehousing and ETL tools like Apache Airflow and Amazon Redshift.
- Experience implementing CI/CD pipelines.
- Familiarity with data modeling and data warehousing concepts.
- Degree in any graduate field.
Preferred Skills
- Exposure to data visualization tools such as Looker, PowerBI, or Tableau.