Job Description: We are seeking a skilled Data Engineer to do the migration of our existing data warehouse and data model from MariaDB to the Hadoop Big Data Cloudera on-premise platform.
The ideal candidate must be proficient in SQL, Hive SQL, Spark, and data modeling.
Additionally, they should possess a strong understanding of the production deployment process, including design, development, testing, UAT, and production deployment.
Experience of scheduling jobs using Autosys is essential for this role.
Key Responsibilities
• Migrate existing data warehouse and data model from MariaDB to Hadoop Big Data Cloudera on-premises platform.
•Develop and optimize SQL, Hive SQL, and Spark scripts to ensure efficient data processing.
•Design and implement data models to support business requirements and optimize performance.
•Collaborate with cross-functional teams to understand data requirements and ensure data integrity throughout the migration process.
Qualifications
• Bachelor's (or Higher) degree in computer science, Engineering, or a related field.
•Proven experience of 5+ years in data engineering and migration projects for big data (Hortonworks / Cloudera)
•Strong hands-on experience on Cloudera and related ecosystem components.
•Strong experience in implementing ETL (Extract, Transform, Load) processes.