Mid. AI Engineer (Data Warehouse)
Job Description
- Design and develop data warehouse systems including data structure planning and integration across multiple sources to support AI and analytics.
- Build and manage efficient, scalable data pipelines for extraction, transformation, validation, and integration using tools like Apache Airflow, dbt, or NiFi.
- Perform data cleaning, transformation, and validation to ensure high-quality datasets for AI training and analytics.
- Prepare and manage AI training datasets, validating data consistency and accuracy.
- Develop dashboards and reports using visualization tools such as Power BI, Metabase, or Superset.
- Maintain technical documentation and version control for data pipelines and datasets to ensure traceability and consistency.
- Collaborate with AI Engineers, Data Scientist and stakeholders to fulfill data needs and optimize system performance.
- Take ownership of data quality standards and prioritize tasks for pipeline and visualization development.
- Propose improvements and upgrades for tools and technologies in data management.
- Identify and manage technical and operational risks related to data and pipelines.
Job Requirements
- Graduate with a Bachelor, Diploma 3, or Diploma 4 degree in IT, Computer Science, or related fields.
- Have a minimum 2 years of experience as an AI Engineer with a strong focus on data warehouse system and pipeline development.
- Good critical thinking and problem-solving skills with ability to identify and mitigate risks in data pipelines.
- Strong communication skills to collaborate effectively with AI Engineers, Data Scientists, and business stakeholders.
- Skilled in designing data warehouse systems, including data structures, storage schema, and integrating data from multiple sources for AI and analytics.
- Experienced with designing and maintaining ETL pipelines using tools like Apache Airflow, dbt, or NiFi.
- Proficient in data cleaning, transformation, validation, and preparing data for AI model training and analytics.
- Able to visualize data using Power BI, Metabase, or Superset.
- Capable of writing and maintaining technical documentation and managing dataset versioning.
- Understands data governance, corporate, management, and basic security principles relevant to AI development.