About the position
Storage Engineer
The HCI and Storage Engineer will be primarily responsible for installing, monitoring, testing and
maintaining both cloud and on-premises infrastructure solutions, with a specific emphasis on Hyper-
Converged Infrastructure (HCI), VxRail, UNITY Storage and VNX(e) Hardware and software
This role will involve providing specialised technical support and guidance through high-level analysis,
diagnosis and problem-solving, ensuring that the HCI components of the infrastructure are optimised for
efficient operations.
In addition to HCI expertise, the HCI and Storage Engineer will play a crucial role in qualifying the product
fit, overseeing the installation and implementing IT systems to meet client requirements. Your deep
understanding of Hyper-Converged Infrastructure and Storage will enable you to design, deploy and
manage robust and scalable solutions that align with industry best practices and standards.
Overall, this position requires a combination of technical proficiency in HCI and Storage infrastructure to
deliver comprehensive and cutting-edge storage and Hyper converged infrastructure solutions for the
company clients.
Minimum Desired Qualifications
- Matric Senior Certificate
- VMWare Certified Professional – Data Center Virtualisation 2024
- Huawei HCIA Storage
- Huawei HCIP Storage
- PowerEdge Operate
- PowerEdge Foundations V2
- Technology Architect, Midrange Storage Solutions
Minimum Desired Experience
- 3 – 5 Years’ Experience and professional knowledge in HCI, Storage, storage management tools, HCI Management Tools
Minimum Desired Competencies
- Strong verbal and written communication skills with ability to coordinate work activities with remote teams
- Strong analytical and conceptualization skills (e.g., the ability to "join the dots", and communicate this at multiple levels)
- Working effectively within local and distributed teams
- Excellent planning, interpersonal, problem solving, leadership skills are required
List Of Duties and Responsibilities:
- Responsible for installations, maintenance, corrective maintenance, monitoring the environment, testing, temporary resolutions managing tickets from end-to-end.
- May be required to travel to the associated provinces when required.
- Attend meetings with the client and other stakeholders.
- Responsible for drawings, documentation, change requests, quality control and planning prior to the commencement of any works.
- To be primarily utilized within the operational environment and may be used within the project environment
- Will be required to be on Standby
Support Services
- Support the in-scope operating system, system management software and operating system utilities, including minor upgrades (such as a release upgrade)
- Manage the operating system configuration including initial configuration, modifying configuration files, system configuration documentation and access to system configuration files.
- Monitor and reduce operating system log files to prevent file systems from overfilling.
- Manage Operating System Processes (e.g., investigate continuously running system subtasks, or daemons) including refreshing processes as required, establishing start-up sequences, maintaining system clock synchronization, and changing process priorities as appropriate.
- Recommend operating system updates and configuration modification to the client’s IT Infrastructure Engineer, as required.
- Apply operating system patch updates, as required.
- Maintain tools for remote management and alert monitoring.
- Maintain operational support procedures.
- Maintain the hardware and software configuration information.
- Evaluate planned changes to the server environment and advise of any requirements to support such changes.
- Adhere to standard security processes and procedures.
- Provide health check and trending reports that include best practices as prescribed by the OEM.
Performance Management
- Manage incidents, problems, changes and other service requests pertaining to hardware, software and monitoring.
- Manage thresholds and alerts for usage of IT resources.
- Analyze performance service level breaches, alerts, trends and root causes to restore service.
- Track and tune proactively performance through trend and exception reporting to avoid possible service level breaches.
- Tune reactively to restore service for performance incidents and root causes.
- Provide corrective action to resolve system performance problems and provide recommendations to prevent possible future incidents.
- Recommend changes to maintain agreed upon system performance levels.
- Implement after hours’ changes as approved through a formal change management process.
- Define performance related metrics and data collection, summarization, and usage requirements.
- Collect, summarize and store performance data (Standard Performance Data Management).
- Define performance alert thresholds to support agreed upon service levels.
- Provide Standard Performance Reporting.
- Provide Ad hoc Performance Reporting for analysis of incidents to restore service.
Desired Skills:
- HCI
- Huawei HCIA Storage
- Huawei HCIP Storage