Search thousands of fresh jobs

×
This job is expired
Datafin

Senior Computer Systems Engineer (CPT)

Datafin

  • R100 per month
  • Permanent Senior position
  • Cape Town
  • Posted 14 Apr 2026 by Datafin
  • Job 2637028

About the position

ENVIRONMENT:

Our client is a prominent organisation focused on supporting research advancement and human capital development through funding programmes, research infrastructure, and science outreach initiatives across a broad range of disciplines.

 

DUTIES

  • Contribute to the global design and implementation of scalable, fault-tolerant infrastructure systems that support engineering and operational needs.
  • Contribute to the deployment, configuration, and maintenance of distributed storage and database systems.
  • Analyse system failures, performance issues, and misconfigurations across hardware, software, and network layers.
  • Lead and mentor computer systems engineers and contribute to strategic technical planning.
 

REQUIREMENTS

Qualification:

  • BTech in Computer Science, Software Engineering, Information Systems, Electronic Engineering or equivalent qualifications, coupled with 13 years of experience; OR
  • BENG/MTech in Computer Science, Software Engineering, Information Systems, Electronic Engineering or equivalent qualifications, coupled with 9 years of experience; OR
  • MENG in Computer Science, Software Engineering, Information Systems, Electronic Engineering or equivalent qualifications, coupled with 7 years of experience; OR
  • PhD in Computer Science, Software Engineering, Information Systems, Electronic Engineering or equivalent qualifications, coupled with 5 years of experience.
Experience:

  • 3+ years in a technical leadership or software/system architectural role with direct responsibility for large-scale or platform-scale distributed systems.
  • Demonstrated hands-on experience in infrastructure design and automation, distributed systems, observability, CI/CD, container orchestration (e.g., Kubernetes), DevOps/SRE practices, and cloud-native technologies.
  • Experience leading teams or initiatives that intersect with data platforms, storage, networking, and systems engineering domains.
Knowledge:

  • In-depth understanding of systems engineering principles, including performance optimisation, fault tolerance, and resource scheduling in Linux-based environments.
  • Strong knowledge of containerised environments (Docker, Podman), orchestration platforms (Kubernetes, Helm), and runtime architectures (containerd, CRI).
  • Expertise in infrastructure-as-code, continuous integration/deployment (CI/CD), and configuration management tools (e.g., GitLab CI, Ansible, Terraform, ArgoCD).
  • Advanced understanding of distributed computing and storage architectures, including Ceph, S3, NFS, and local/clustered file systems.
  • Operational and architectural fluency in relational and NoSQL database systems (e.g., PostgreSQL, MySQL, MongoDB), including replication, backups, and performance tuning.
  • Working knowledge of networking fundamentals, security protocols, and systems-level observability (e.g., Prometheus, Grafana, ELK/EFK stack).
  • Familiarity with the HPC ecosystem (e.g., SLURM, job schedulers) is beneficial for environments supporting scientific or research computing.
 

ATTRIBUTES

Core Competencies (Essential):

  • Demonstrated technical leadership (3+ years), leading cross-functional efforts across systems, storage, and database infrastructure, driving technical decisions from architecture through implementation.
  • Systems engineering expertise, with a focus on Linux administration, infrastructure automation, service orchestration, and performance optimisation across diverse environments.
  • Expertise in distributed systems architecture, including the design and deployment of scalable, resilient services using microservices, event-driven, and cloud-native design patterns.
  • Containerisation and orchestration fluency, including production-grade usage of Kubernetes, Docker, and Helm for system and application-level deployments.
  • Infrastructure automation and CI/CD, using tools such as GitLab CI, ArgoCD, FluxCD, Jenkins, or GitHub Actions to streamline and secure platform operations.
  • Complementary DevOps and SRE practices, blending infrastructure-as-code, configuration management, and release automation with incident response, monitoring, SLIs/SLOs, and system reliability engineering.
  • Linux expertise, including advanced troubleshooting, kernel tuning, Systemd orchestration, and optimisation at scale.
  • Technical delivery and planning capabilities, including backlog scoping, cross-team collaboration, and Agile sprint execution.
  • Database administration skills, with operational experience in administering relational and NoSQL databases (e.g., PostgreSQL, MySQL, MongoDB), including high availability, backups, replication, and performance tuning.
  • Diagnostic skills, with a root-cause-first approach, and a strong bias for ownership, accountability, and long-term operational stability.
Skills:

  • Technical leadership: Ability to lead architectural discussions, influence design decisions, and mentor junior engineers across infrastructure streams.
  • Resource management/leadership: Provides leadership that fosters an environment encouraging new ideas and supports the development of emerging skills. Creates trust through consistency, understanding, integrity, and patience. Plans, seeks, allocates, and monitors resources to achieve outcomes.
  • Problem solving and analysis: Skilled in root cause analysis, systems troubleshooting, and performance bottleneck resolution.
  • Communication and collaboration: Clear articulation of technical recommendations, cross-functional stakeholder engagement, and feedback integration.
  • Planning and delivery: Proficient in backlog grooming, sprint planning, and technical delivery in Agile/DevOps environments.
  • Continuous learning: Commitment to staying current with evolving technologies in containerisation, cloud-native systems, observability, and systems automation.
  • Documentation and knowledge sharing: Ability to produce high-quality technical documentation and share knowledge across engineering teams.
  • Teamwork: Collaborates within their team and with cross-functional teams alongside partners.
  • Service Level Agreements (SLAs): Ability to interpret, monitor, and manage SLAs, warranties, and related contractual obligations, and an understanding of operational frameworks such as Site Reliability Engineering (SRE), ITIL, and COBIT.
 

Tooling Proficiency (this is not an exhaustive list; additional relevant experience or skills will be viewed favourably):

  • Containerisation & Orchestration: Kubernetes, Docker, Podman, Helm, containerd
  • Resource Management: SLURM (or other schedulers)
  • Hardware & Infrastructure Acceleration: GPU & FPGA drivers
  • Automation & Configuration Management: Ansible, Terraform, Bash, Python, Systemd, Packer
  • CI/CD and Release Management: GitLab CI, GitHub Actions, Jenkins, Ansible Tower, ArgoCD/FluxCD, cron/at/Systemd timers
  • Cloud, Virtualisation, and Bare-Metal Platforms: OpenStack, VMware vSphere/ESXi, Proxmox, KVM, AWS EC2/Storage, Terraform
  • Storage & Filesystem Tools: Ceph, NFS, iSCSI, ZFS, Lustre, or related
  • Database Operations (Operational DBA Tools): PostgreSQL CLI tools, MySQL, MongoDB, Timescale DB, cron-based backups, or related
  • Monitoring & Observability: Prometheus, Grafana, Zabbix, ELK stack, or related
 

Organisational Values:
The Senior Compute Systems Engineer will be expected to demonstrate the following values and to work actively to instil those behaviours in all their colleagues in South Africa:

  • Diversity and Inclusion
  • Excellence
  • Collaboration
  • Creativity and Innovation
  • Sustainability
  • Passion for Excellence
  • World-class service
  • People-centred approach
  • Respect
  • Integrity and Ethics
  • Accountability

Desired Skills:

  • Communication
  • Leadership
  • Solving Problems

About The Employer:

Our client is a prominent organisation focused on supporting research advancement and human capital development through funding programmes, research infrastructure, and science outreach initiatives across a broad range of disciplines.

Datafin

About the agency

Datafin Recruitment was established in 1999 and is one of South Africa’s leading Recruitment companies. Owned and managed by two sisters Lindy and Bev Sollinger, we focus on connecting with both our clients and candidates in an authentic conscious meaningful manner. We focus on the Tech, Digital/Online, Data, Finance and HR industries.

Receive a daily digest of all new jobs matching this job. Your information is safe with us and you can cancel any time.

Expires in 33 days

Email me jobs similar to: Senior Computer Systems Engineer (CPT)

Receive a daily digest of all new jobs matching this job: Senior IT Auditor. Your information is safe with us and you can cancel at any time.