Cloud Management Engineer (SP6) - Group Information Technology: Chief Technology Office
Position summary
Introduction
Job description
TRAVEL
This position will also require regular traveling to each of the countries where Capricorn Group has offices and/or subsidiaries for onsite support and implementations, and so too for training purposes.
All of the following tasks will apply to Capricorn Group and subsidiaries;
Daily System Check: The role requires daily health checks of the IT Infrastructure environment which is made up of hardware and software related components. All checks will be signed off to comply with Banking Audit requirements.
Performance and Availability: The role will provide inputs to facilitate consistent system performance and uptime of all virtual, cloud and containerized environments.
Problem Management: This role will be required to execute troubleshooting to assist in the greater problem management including proactive problem identification and root cause analysis for all problems with the IT infrastructure environment.
Incident Management: The role will be responsible for top tier incident escalations and remediation for incidents where a high degree of expertise with virtualization, Storage and containerized technologies is required.
Change Management: The role will be responsible for implementing changes with the coordination from the Lead Cloud Management Engineer in the production environment in accordance with change management procedures including assessing risk and determining test and fail-back plans.
Monitoring: Daily use and maintenance of monitoring tools including hardware and operating system monitoring.
Automation: The role will develop, maintain and improve scripts and tools used to automate repeatable tasks within all areas of responsibility.
System Deployment and Maintenance: The role will be responsible for building and deploying new server infrastructure, software and customer environments into datacenters as well as analyzing and implementing new deployment practices. They will also be responsible for assisting with regularly scheduled maintenance activities including patching and system optimization of IT Infrastructure.
Support: The role will provide support on Virtualization, Cloud and containerized environments.
Disaster Recovery: The role will be responsible for maintaining and conducting disaster recovery scenarios in accordance with business procedures based on industry standards within the Virtual and Containerized environments.
Backups: The role will be responsible for assisting the SRE teams with the implementation, monitoring and maintenance of all backup systems within the IT infrastructure environment.
Research: Investigate new technologies to ensure that the Group stays in line with virtualization, containerization and cloud technology trends.
Documentation: The role will require the documentation and continued updating of the entire virtual, containerized and Cloud environments to comply with the required Audit, ISO and PCI-DSS standards.
Security & Risk: The role requires application and implementation of strict security standards on all systems within the virtualization, containerized and Cloud environments as well as reporting and/or escalation of any risks related to those technologies that will or may result in an outage and possible financial loss to the group.
core cOmpencies
- Reliable and Responsible – Consistently dependable and accountable for tasks and outcomes
- Respectful – Treats others with dignity and professionalism
- Proactive and Self-Initiating – Takes initiative without needing direction
- Integrity and Ethics – Upholds honesty and ethical standards
- Punctual and Disciplined – Manages time effectively and adheres to schedules
- Collaborative – Works well with others to achieve shared goals
- Team Player – Supports team dynamics and contributes positively
- Effective Communicator – Demonstrates strong oral and written communication skills
- Professional – Maintains high standards of conduct and presentation
- Competent – Applies knowledge and skills effectively in role
- Logical and Analytical – Thinks critically and solves problems with reason
- Planning and Organizational Skills – Structures tasks and priorities efficiently
- Process-Oriented – Follows and improves systems for consistent results
Minimum requirements
EXPERIENCE/SKILLS & KNOWLEDGE
Required Skills:
- 5 Years hands-on experience as a Systems Administrator (or equivalent role)
- 3 Years hands-on experience in a virtual server environment
- Possess strong knowledge of Microsoft Solutions including Windows, Active Directory, Exchange and related technologies
- Possess strong knowledge of Virtualized environments
- Possess strong knowledge of Linux server administration and related software
- Possess knowledge of application containerization and orchestration
- Possess knowledge with capacity planning within a virtualized environment
- Possess scripting skills on Virtualization and Linux systems
- Knowledge of best practices for virtualized networking and a thorough understanding of physical networking.
- Analytical and problem-solving skills
- Possess excellent written and oral communication skills.
- Demonstrated Ability to work independently, but also capable of providing mentoring and guidance to a team in a rapidly growing and dynamic environment.
Preferred Skills and Qualifications:
- Relevant certifications in virtualization technologies.
- Experience with Linux System Administration
- Experience with application containerization and orchestration technologies.
- Experience with disaster recovery technologies
- Experience with Backup & Recovery
- Experience with Storage Area Network
