Description
You will build and maintain large-scale data pipelines and structures to standardize information for insights.
Responsibilities
- Develop and maintain data pipelines and structures to organize and standardize data.
- Integrate REST services with Optical Character Recognition (OCR) systems to process and push data.
- Collaborate with business stakeholders to define requirements and clarify needs.
- Apply software engineering principles to all data-centric projects.
Required Skills
- 5+ years of experience in data, software, or back-end engineering.
- Proficiency in Python and Java.
- Strong experience with Google Cloud Platform (GCP).
- Hands-on experience with SQL and relational databases.
- Experience with BigTable and MongoDB.
- Proven ability to build and manage ETL processes.
Preferred Skills
- Experience working with OCR technologies.
- Experience productionalizing Machine Learning models.