Search thousands of fresh jobs

×
This job is expired
Recru-it

Data Engineer

Recru-it

  • R Undisclosed
  • Permanent Senior position
  • South Africa
  • Posted 23 Feb 2026 by Recru-it
  • Expires in 29 days
  • Job 2633789 - Ref PE011501

About the position

Key Responsibilities:
Data Engineering & Pipeline Management

  • Design, build, and optimize T-SQL stored procedures, functions, and scripts for high-volume data processing and ECM scoring.
  • Develop, deploy, and monitor end-to-end ETL/ELT workflows (e.g., SQL Server Agent, SSIS, Azure Data Factory, or Airflow) with checkpoint/rollback, job tracking, and recovery capabilities.
  • Perform data cleansing, preparation, and transformation to support business intelligence and machine learning workflows.
  • Engineer and maintain reusable feature store tables (per entity/tax type) for ML models and operational scoring.
  • Model and maintain data warehouse structures (3NF, dimensional/star/snowflake), ensuring proper documentation of data lineage.
  • Prepare and deliver curated, scored datasets for downstream consumption in Power BI dashboards and analytics environments.
  • Develop and maintain audit, telemetry, and job tracking tables to ensure data reliability, restartability, and monitoring visibility.
  • Support and troubleshoot production pipelines, optimizing query performance via indexing, tuning, and profiling tools.

 
Data Quality, Governance, and Compliance

  • Implement and monitor data validation, reconciliation, and QA frameworks across the data lifecycle.
  • Enforce data security, privacy, and compliance controls in line with corporate and regulatory standards.
  • Support the implementation of data governance and lineage documentation, ensuring traceability and adherence to EDM policies.

 
Collaboration and Cross-functional Support

  • Collaborate with data analysts, data scientists, software engineers, and business stakeholders to translate business problems into scalable data solutions.
  • Provide accessible, well-documented datasets to support analytics and reporting.
  • Contribute to all phases of the SDLC, including requirements, design, development, testing, deployment, and maintenance.


Qualifications and Experience:

  • A tertiary qualification in Computer Science, Information Systems, Data Engineering, Analytics, Mathematics, or Statistics or Matric with 6-8 years of experience in data engineering, database development, or data management in production environments.
  • Proven hands-on experience with SQL Server, including advanced T-SQL development, ETL/ELT workflow design, and performance tuning.
  • Demonstrated delivery of production data solutions—both batch and near real-time—within enterprise environments.
  • Experience in building and maintaining data warehouses, feature stores, and reusable data products.
  • Track record of implementing data governance and quality frameworks, ensuring compliance and traceability.
  • Experience in orchestrating complex data pipelines using SQL Server Agent, SSIS, Airflow, or Azure Data Factory.
  • Familiarity with cloud-based data architectures (Azure preferred) and version control systems (Git).
  • Exposure to Power BI or equivalent visualization tools for reporting and analytics enablement.
  • Strong understanding of data security, privacy, and regulatory compliance requirements.



Key Skills and Competencies:

  • Advanced SQL Server Development: Strong proficiency in T-SQL, stored procedure design, query optimization, indexing, and error handling.
  • ETL and Data Warehousing: Expertise in ETL/ELT pipeline design and orchestration for batch and near real-time processing using SQL Server Agent, SSIS, or Azure Data Factory.
  • Data Modeling: Solid understanding of normalized and dimensional modeling (3NF, star, snowflake) and scalable architecture design.
  • Feature Store Development: Ability to design and maintain reusable feature tables supporting machine learning and operational scoring.
  • Data Validation and Quality Assurance: Skilled in implementing validation rules, reconciliation checks, and QA frameworks to ensure data integrity.
  • Data Governance and Security: Strong knowledge of data governance, privacy, and compliance standards; experience maintaining data lineage documentation.
  • Workflow Orchestration: Experience building restartable, traceable workflows with checkpoint and rollback mechanisms.
  • Programming and Scripting: Proficiency in SQL and beneficial experience in Python or R for automation and data manipulation.
  • Cloud Platforms: Familiarity with Azure (preferred) or other cloud platforms such as AWS or GCP for data engineering workloads.
  • Version Control and CI/CD: Exposure to Git and CI/CD pipelines for managing data workflow deployment.
  • Visualization and Reporting (Beneficial): Ability to prepare scored or curated data for BI tools such as Power BI.
  • Performance Optimization: Expertise in performance tuning, query profiling, and indexing strategies to optimize large-scale data operations.
  • Collaboration and Communication: Ability to work effectively across technical and business teams, translating complex requirements into practical data solutions.


Key Responsibilities:
Data Engineering & Pipeline Management

  • Design, build, and optimize T-SQL stored procedures, functions, and scripts for high-volume data processing and ECM scoring.
  • Develop, deploy, and monitor end-to-end ETL/ELT workflows (e.g., SQL Server Agent, SSIS, Azure Data Factory, or Airflow) with checkpoint/rollback, job tracking, and recovery capabilities.
  • Perform data cleansing, preparation, and transformation to support business intelligence and machine learning workflows.
  • Engineer and maintain reusable feature store tables (per entity/tax type) for ML models and operational scoring.
  • Model and maintain data warehouse structures (3NF, dimensional/star/snowflake), ensuring proper documentation of data lineage.
  • Prepare and deliver curated, scored datasets for downstream consumption in Power BI dashboards and analytics environments.
  • Develop and maintain audit, telemetry, and job tracking tables to ensure data reliability, restartability, and monitoring visibility.
  • Support and troubleshoot production pipelines, optimizing query performance via indexing, tuning, and profiling tools.

 
Data Quality, Governance, and Compliance

  • Implement and monitor data validation, reconciliation, and QA frameworks across the data lifecycle.
  • Enforce data security, privacy, and compliance controls in line with corporate and regulatory standards.
  • Support the implementation of data governance and lineage documentation, ensuring traceability and adherence to EDM policies.

 
Collaboration and Cross-functional Support

  • Collaborate with data analysts, data scientists, software engineers, and business stakeholders to translate business problems into scalable data solutions.
  • Provide accessible, well-documented datasets to support analytics and reporting.
  • Contribute to all phases of the SDLC, including requirements, design, development, testing, deployment, and maintenance.


Qualifications and Experience:

  • A tertiary qualification in Computer Science, Information Systems, Data Engineering, Analytics, Mathematics, or Statistics or Matric with 6-8 years of experience in data engineering, database development, or data management in production environments.
  • Proven hands-on experience with SQL Server, including advanced T-SQL development, ETL/ELT workflow design, and performance tuning.
  • Demonstrated delivery of production data solutions—both batch and near real-time—within enterprise environments.
  • Experience in building and maintaining data warehouses, feature stores, and reusable data products.
  • Track record of implementing data governance and quality frameworks, ensuring compliance and traceability.
  • Experience in orchestrating complex data pipelines using SQL Server Agent, SSIS, Airflow, or Azure Data Factory.
  • Familiarity with cloud-based data architectures (Azure preferred) and version control systems (Git).
  • Exposure to Power BI or equivalent visualization tools for reporting and analytics enablement.
  • Strong understanding of data security, privacy, and regulatory compliance requirements.



Key Skills and Competencies:

  • Advanced SQL Server Development: Strong proficiency in T-SQL, stored procedure design, query optimization, indexing, and error handling.
  • ETL and Data Warehousing: Expertise in ETL/ELT pipeline design and orchestration for batch and near real-time processing using SQL Server Agent, SSIS, or Azure Data Factory.
  • Data Modeling: Solid understanding of normalized and dimensional modeling (3NF, star, snowflake) and scalable architecture design.
  • Feature Store Development: Ability to design and maintain reusable feature tables supporting machine learning and operational scoring.
  • Data Validation and Quality Assurance: Skilled in implementing validation rules, reconciliation checks, and QA frameworks to ensure data integrity.
  • Data Governance and Security: Strong knowledge of data governance, privacy, and compliance standards; experience maintaining data lineage documentation.
  • Workflow Orchestration: Experience building restartable, traceable workflows with checkpoint and rollback mechanisms.
  • Programming and Scripting: Proficiency in SQL and beneficial experience in Python or R for automation and data manipulation.
  • Cloud Platforms: Familiarity with Azure (preferred) or other cloud platforms such as AWS or GCP for data engineering workloads.
  • Version Control and CI/CD: Exposure to Git and CI/CD pipelines for managing data workflow deployment.
  • Visualization and Reporting (Beneficial): Ability to prepare scored or curated data for BI tools such as Power BI.
  • Performance Optimization: Expertise in performance tuning, query profiling, and indexing strategies to optimize large-scale data operations.
  • Collaboration and Communication: Ability to work effectively across technical and business teams, translating complex requirements into practical data solutions.

Desired Skills:

  • A tertiary qualification
  • SQL Server
  • adva T-SQL dev
  • ETL/ELT workflow design perfor tuning
  • maintaining data warehouses
  • using SQL Server Agent
  • SSIS
  • Airflow
  • or Azure Data Factory

Recru-it

About the agency

Recruit IT Recruitment IT Recruitment and Talent Sourcing Specialists Offices in Cape Town and Port Elizabeth as well as Consultants working remotely across the country Telephone number 087 805 8536 www.recru-it.co.za >recru-it* COMPANY PROFILE Certified at a BEE Procurement Recognition Level of 110% >Introduction* >recru-it*was established in August 2005 & specializes in and focuses on the full spectrum of positions within the IT and other sectors. We focus our approach on delivering a superior service to both our client and candidate, in all portfolios and phases throughout the Recruitment process, supporting real transformation within the IT Industry and other sectors through ethical and transparent business practices >Value added services* • Advertising Client Roles • Screening Applications • CV searches • Head Hunting Candidates • CV Selection • Labour Broking • Pay structure advice for client & candidate >Additional services on request* • Personal Reference checks • Credit checks • Criminal checks • ID checks • Academic checks • Qualification checks >Placements portfolio* • Software Engineering & Development • I.T. Solution Sales and Strategic Sales • Sales & marketing • Finance and Insurance • HR • Engineering • Administration / Office Management • Healthcare • FMCG • Warehousing / Logistics • Telecommunications • Training and Development • Executive and senior level placements • ERP & CRM Consultants • Project Management & Administration • I.T Executive Management • Business Analysis • Business Intelligence • Consulting • Network Engineering • Support • Testing • Product Support Specialists   >Operational structure * >recru-it*uses a flat open structure in our approach  Each consultant takes personal ownership for each client request. The consultants are account managers with their respective clients ensuring professional and personal interaction at all times.  Our team supports each other in an interactive, transparent manner to deliver highest quality candidates on each specification, thus ensuring a fast and effective turnaround time to fulfill your every labour requirement. >recru-it*was established in August 2005. Carbon foot print  We practice a 90% paperless environment as most of our duties are internet and electronic. >BEE Profile*  >recru-it*is owned by 2 individuals with 8 additional staff members • 50 % of the business is owned by a black person. • 50% of the business is women owned.  >recru-it*has been officially & precisely rated according to our company structure. • We have been certified at a BEE Procurement Recognition Level of 110%. • Enterprise development – on site as well as external training courses for staff ensuring continuous skill improvement. • Corporate Social Investment – we do not have a formal CSI policy, but we do annual donations.

Receive a daily digest of all new jobs matching this job. Your information is safe with us and you can cancel any time.

Expires in 28 days

Email me jobs similar to: Data Engineer

Receive a daily digest of all new jobs matching this job: Senior IT Auditor. Your information is safe with us and you can cancel at any time.