Know ATS Score
CV/Résumé Score
  • Expertini Resume Scoring: Our Semantic Matching Algorithm evaluates your CV/Résumé before you apply for this job role: Platform Support Engineer.
United Arab Emirates Jobs Expertini

Urgent! Platform Support Engineer Job Opening In Abu Dhabi Emirate – Now Hiring Open Innovation AI

Platform Support Engineer



Job description

Company Description

Open Innovation AI is a global technology company that specializes in developing advanced solutions for managing AI workloads.

Its flagship product, the Open Innovation Cluster Manager (OICM), orchestrates complex AI tasks efficiently across diverse infrastructures.

The platform is hardware‑agnostic, optimized for various GPUs and accelerators hardware, and facilitates seamless integration and scalability for enterprise AI applications.

Open Innovation AI focuses on optimizing and simplifying AI workload management and making AI technologies accessible to organizations of all sizes.

With its innovative solutions, companies can reduce operational costs, accelerate time to value, and maximize their return on investment, ensuring that their AI strategies contribute directly to enhanced business outcomes.

Role Overview

The Platform Support Engineer (L1) – On‑Site is responsible for 24×7 shift‑based operational monitoring and first‑line support of the Open Innovation AI platform hosted in a secure, air‑gapped data centre environment.

The role ensures the stability and availability of platform services by executing defined operational procedures, performing daily system checks, and responding to alerts in accordance with runbooks and incident management processes, as well as customer support with platform operations.

Working as part of the on‑site support shift team, the engineer monitors platform health across GPU infrastructure, virtualization, and network components, performs basic administrative tasks, and escalates complex issues.

This position requires strict adherence to security controls and operational discipline within a fully isolated environment.

Responsibilities

  • Operate as part of a 24×7 on‑site shift rotation, ensuring continuous monitoring and operational support for all platform components within the secure, air‑gapped environment.

  • Monitor dashboards, alerts, and logs across OICM, GPU infrastructure, virtualization (VMware/Kubernetes) , and network layers to detect and respond to incidents in real time.

  • Execute operational runbooks and standard procedures for user management, resource allocation, and daily system health checks.

  • Perform first‑level troubleshooting and incident triage, document findings, and escalated unresolved issues to L2 or engineering teams according to defined SLAs.
  • Maintain detailed shift logs, incident records, and handover notes to ensure seamless continuity between shifts.

  • Support scheduled maintenance activities including patching, configuration updates, and version upgrades under supervision from senior engineers.

  • Ensure strict compliance with air‑gap security protocols, including restrictions on connectivity, removable media, and external communications.

  • Participate in daily and weekly operational briefings with the Support Team Lead, contributing to performance reporting and service improvement actions.

  • Report recurring issues or anomalies and assist in refining operational documentation and runbooks.

Qualifications

  • 2–5 years of experience in IT operations, data‑centre support, or platform/system monitoring within enterprise or critical environments.

  • Working knowledge of Linux operating systems, with ability to perform basic administrative and diagnostic tasks.

  • Demonstrated troubleshooting capabilities in identifying, isolating, and resolving system or service issues within defined procedures.

  • Understanding of networking, virtualization and container orchestration technologies (VMware, Kubernetes, or similar).

  • Familiarity with monitoring and ticketing tools such as Grafana, Zabbix, ServiceNow, or Jira.

  • Strong attention to detail, reliability, and discipline in following defined procedures and documentation standards.

  • Ability to work effectively in rotational 24×7 shift schedules, including nights, weekends, and public holidays.

  • Good English communication and documentation skills.

  • (Preferred) Basic scripting knowledge (Bash, PowerShell, Python), exposure to GPU or high‑performance computing systems, and ITIL Foundation certification.

Reporting To

Senior Manager – Technical Operations

Seniority Level

Associate

Employment Type

Full‑time

Job Function

Product Management, Information Technology, and Consulting

Industries

IT Services and IT Consulting, Technology, Information and Media, and Software Development


#J-18808-Ljbffr


Required Skill Profession

Other General



Your Complete Job Search Toolkit

✨ Smart • Intelligent • Private • Secure

Start Using Our Tools

Join thousands of professionals who've advanced their careers with our platform

Rate or Report This Job
If you feel this job is inaccurate or spam kindly report to us using below form.
Please Note: This is NOT a job application form.


    Unlock Your Platform Support Potential: Insight & Career Growth Guide