Description
You will focus on ETL testing and data validation using Python and SQL.
Responsibilities
- Develop and maintain cloud automation frameworks using PySpark.
- Validate data integrity across SQL and NoSQL databases.
- Perform data migration testing and data warehouse validation.
- Test complex file formats including XML, fixed-length, and multi-segmented clustered or un-clustered files.
- Execute testing for ETL tools such as Informatica or DataStage.
Required Skills
- 5+ years of experience in ETL testing.
- Expertise in core Python and object-oriented programming.
- Proficiency with Pandas and NumPy libraries.
- Advanced knowledge of SQL.
- Experience handling diverse data sources including XML and fixed-length files.
- Hands-on experience with PySpark and cloud automation frameworks.
- Working knowledge of Hadoop and Data Warehouse concepts.
- Experience with SQL and NoSQL databases.
- Any Graduate degree.