Job Description
The role involves ensuring that the following duties are performed with quality:
- Work with business units and understand business flow of each application and their interdependencies and develop a DR plan for all business applications.
- Develop recovery priorities, timelines, and strategy for proper sequence of recovery components.
- Develop and understand all testing necessary for a successful DR execution.
- Configure/update infrastructure components, objects, and respective business functions, groups, and various replication.
- Configure monitoring, alerts, and reports as per RPO/RTO/SLA defined by the business.
- Configure failover/failback and switch over workflows for applications and update existing workflows and verify their functionality with runbooks.
- Coordinate with the business team and ensure runbooks are updated and reflect the current infrastructure.
- Lead planned DR Drill exercises and crisis management and provide an RPA report of the drills to the BCM team and external auditors.
- Coordinate and manage licenses for infrastructure components with the vendor and reclaim licenses for decommissioned and infrastructure components not in use.
- Onboard critical applications in the CP tool.
General Responsibilities
- Ensure that the ITSM/Service Now tool is used with quality regarding incidents and service requests.
- Ensure that all systems are secured and configured as per defined standards and policies.
- Able to troubleshoot execution issues.
- Verify connectivity to all infrastructure components and coordinate with application/infrastructure teams to ensure connectivity and synchronization among sites.
- Conduct regular meetings with application owners and review workflows with the team.
- Review all alerts in the CP tool and notify application alerts with the respective team.
- Ensure SLA/RPO and RTO of all infrastructure components are met.
- Participate in business continuity meetings and assist application teams in following best practices.
- Provide periodic status reports of infrastructure components to reflect the current conditions and support for the recovery objectives set by the organization.
Skills and Competencies
- Expertise in disaster recovery management in Perpetuuiti continuity patrol.
- Good knowledge of change management frameworks such as ITIL.
- Good knowledge of network infrastructure.
- Good knowledge of storage replication.
- Good knowledge of databases.
- Good leadership skills, including people management, selection, and development skills.
- Very good analytical, planning, forecasting, execution, and problem-solving skills.
- Flexible and able to work under pressure.
- Respect and promote trust and confidentiality.
- Results-oriented while ensuring high quality of work and able to think out of the box.
- Strong level of customer service orientation and professionalism in all interactions.
- Able to manage a multi-cultural environment and promote teamwork and knowledge sharing to achieve goals and deliverables.
#J-18808-Ljbffr