Search thousands of fresh jobs

×
This job is expired
Mediro Application Consulting

Reliability Engineer (Expert) 3336

Mediro Application Consulting

  • R Undisclosed
  • Contract Senior position
  • Pretoria
  • Posted 09 Apr 2026 by Mediro Application Consulting
  • Expires in 29 days
  • Job 2636826 - Ref LM_504907649853
Apply Now

About the position

1. Infrastructure & Cloud (35%):



Design, build, and maintain scalable, secure, and cost-efficient cloud infrastructure on AWS.



Manage and evolve Kubernetes clusters including upgrades, capacity planning, and cluster health.



Build and maintain infrastructure-as-code modules for repeatable, auditable deployments.



Drive cloud cost optimization - identify waste, right-size resources, implement savings plans.



Ensure infrastructure meets non-functional requirements: performance, scalability, availability, Disaster Recovery.



 



2. CI/CD & Automation (25%):



Build, operate, and continuously improve CI/CD pipelines for fast, safe, and reliable delivery.



Automate repetitive operational tasks and reduce toil through tooling and runbooks.



Maintain and improve deployment automation - zero-touch deployments are the goal.



Drive adoption of best practices across development teams.



Own deployment runbooks and ensure they are up to date and tested.



 



3. Security Implementation (20%):



Implement and maintain security scanning in CI/CD pipelines (SAST, DAST, container image scanning).



Harden container and cloud infrastructure security (network policies, IAM, secrets, encryption).



Translate security audit findings into concrete technical actions and execute them.



Drive vulnerability remediation - track, prioritize, and fix security issues with urgency.



Ensure compliance with security standards and policies.



 



4. Monitoring, Reliability & Incident Response (15%):



Implement and own monitoring, logging, and alerting for proactive issue detection.



Build dashboards that give real-time visibility into system health and performance.



Lead incident response for infrastructure-related issues - diagnose fast, fix fast.



Conduct post-incident reviews and drive corrective actions to prevent recurrence.



Continuously improve system reliability, uptime, and mean time to recovery (MTTR).



 



5. Technical Optimization & Lifecycle Management (5%):



Drive Technical Lifecycle Management (TLM) - plan and execute upgrades and migrations



Identify and implement technical optimizations across the stack



Contribute to technical strategy and roadmap for platform engineering



Actively use and promote AI4DevOps tools and practices where they add real value



 



WHAT DOES SUCCESS LOOK LIKE?



Infrastructure – Reliable, scalable, cost-optimized - no surprises



CI/CD – Fast, safe pipelines - developers ship with confidence.



Security – Vulnerabilities found early, fixed fast - no excuses.



Incidents – Quick response, thorough root cause, things get better over time.



Automation – If you did it twice manually, the third time it's automated.



Delivery – You ship improvements continuously - not just plans, but results.



We don't need someone who writes documents about how things should be done. We need someone who rolls.



up their sleeves and makes things better - every single day.


Minimum Requirements:

Qualifications/Experience:



Degree in Computer Science, Information Technology or equivalent practical experience.



10 to 15 years+ hands-on experience in DevOps / Infrastructure engineering / SRE, incl. cloud operations.



Relevant certifications advantageous (AWS Solutions Architect, AWS DevOps Engineer, CKA/CKAD, Terraform Associate).



 



Essential Skills Requirements:



1. Cloud Infrastructure & Operations:



Deep hands-on experience with AWS (EC2, ECS/EKS, RDS, S3, VPC, IAM, CloudFront, Lambda).



Proven ability to design, build, and operate scalable, highly available cloud infrastructure.



Strong experience with infrastructure-as-code (Terraform preferred, CloudFormation).



Solid experience with containerization and orchestration (Docker, Kubernetes).



Hands-on experience with cloud cost optimization - you know how to cut waste and right-size resources.



Experience with capacity planning, scaling strategies, and performance tuning.



 



2. CI/CD & Automation:



Deep experience building and maintaining CI/CD pipelines (GitHub Actions preferred, Jenkins).



Strong automation mindset - you automate everything that can be automated.



Experience with build tools and artifact management (Maven, Gradle, GitHub Packages, ECR).



Proficiency in scripting and tooling (Bash, Python) to solve real operational problems.



Experience with GitOps workflows and deployment automation.



 



3. Security & Compliance:



Hands-on experience implementing security controls in CI/CD pipelines (SAST, DAST, dependency scanning).



Knowledge of container security best practices (image scanning, runtime security, least-privilege).



Experience with IAM policies, network security, secrets management (AWS Secrets Manager, Vault).



Familiarity with compliance frameworks and ability to translate security requirements into implementations.



Pragmatic approach to security - you find the right balance between security and velocity.



 



4. Monitoring, Observability & Incident Response:



Experience with monitoring and alerting solutions (Prometheus, Grafana, CloudWatch, ELK/OpenSearch).



Ability to build meaningful dashboards that provide real operational insight.



Strong troubleshooting and incident response skills - you stay calm under pressure and fix things fast.



Experience with post-incident root cause analysis and driving corrective actions.



 



5. Mindset & Way of Working:



Pragmatic doer - you bias towards action and delivering results over endless discussions.



Comfortable working in agile teams (Scrum/SAFe) alongside developers, architects, and product owners.



Strong sense of ownership - if it's broken, it's your problem until it's fixed.



Ability to prioritize ruthlessly - you know what matters and focus on high-impact work.



Clear communicator who can explain technical decisions to non-technical stakeholders.



 



Advantageous Skills Requirements:



Experience with ITSM processes (Incident, Problem, Change) and tools like ServiceNow.



Experience with database operations and performance tuning (PostgreSQL, MySQL, MongoDB).



Knowledge of service mesh technologies (e.g., Istio).



Experience with chaos engineering or resilience testing.



Familiarity with FinOps practices and cloud cost governance at scale.



Experience with Technical Lifecycle Management (TLM) - upgrades, deprecations, migrations.



Knowledge of AI-assisted DevOps tools and willingness to adopt AI4DevOps practices.



Familiarity with Jira and Confluence for tracking and documentation.



Experience in automotive or enterprise-scale environments.


Desired Skills:

  • DevOps
  • Cloud Infrastructure & Operations
  • Infrastructure engineering

Apply Now

Mediro Application Consulting

About the agency

Quality Placements Built on Trust Whether you are looking for a job or need to acquire top talent, Mediro IT RECRUIT is here to assist. We are technical recruiters who care. Our strength lies in fostering connections between candidates seeking employment and companies looking to employ. Our team consists of high achievers, strong individual contributors, and leaders who change lives through personal connections. With a community of professional recruiters, talent pooling and an internal referral programme, we provide multiple candidate recommendations within seven days, regardless of industry and across South Africa. The world of work is rapidly changing; people want to learn and grow, however work-life balance, equity and flexibility continue to play a major role. Companies across industries constantly require modernised and specialised skills. As a result, we strive to be your most valued IT recruitment partner by understanding individual and company needs and delivering the right resource solutions to build a workplace for the future. CONNECT with us: www.itrecruit.co.za #ResourceSolutions #talentacquisition #ITskills #itrecruitment #ITplacements #ITJobs

Receive a daily digest of all new jobs matching this job. Your information is safe with us and you can cancel any time.

Expires in 28 days

Email me jobs similar to: Reliability Engineer (Expert) 3336

Receive a daily digest of all new jobs matching this job: Senior IT Auditor. Your information is safe with us and you can cancel at any time.