Description
You will build and manage data cataloging, access, and provisioning solutions within an AWS environment.
Responsibilities
- Develop data catalogs, metadata management, and data findability solutions.
- Manage large volumes of data from multiple sources including internal and external transfers.
- Build and maintain ETL pipelines and data modeling structures.
- Contribute to data management policies, procedures, and quality assurance methodologies.
- Ensure data systems meet internal and external regulatory requirements.
Required Skills
- 3+ years of industry experience managing data.
- Proficiency in Python programming.
- Experience with AWS services including Lambda, S3, SQS, Elasticsearch, IAM, EC2, and RDS.
- Hands-on experience with RDBMS and NoSQL databases.
- Experience with Linux/Unix scripting and terminal-based automation.
- Knowledge of CI/CD, Big Data, and data storage principles.
- Experience with electronic data capture systems.
- Strong understanding of data modeling and ETL pipeline construction.
Preferred Skills
- Knowledge of iRods or similar data cataloging systems.