- Expertini Resume Scoring: Our Semantic Matching Algorithm evaluates your CV/Résumé before you apply for this job role: AI Infrastructure Engineer Opus.
Urgent! AI Infrastructure Engineer- Opus Job Opening In أبوظبي – Now Hiring AppliedAI
As an
Opus AI Infrastructure Engineer
, you will lead the optimization and scaling of AI pipelines that serve foundational models in live production environments.
You will focus on evolving real-time and batch inference systems for reliability, low latency, and seamless integration with product logic.
This senior engineering role operates at the core of AI delivery, requiring strong system design, infrastructure fluency, and a deep commitment to performance and operational excellence.
You will work across modern cloud environments and manage a diverse and evolving portfolio of LLMs, both proprietary and open-source.
You will play a key role in evaluating model trade-offs, adapting to rapid model iteration, and ensuring smooth transitions as providers update APIs, capabilities, and service tiers.
You will also coordinate directly with foundational model vendors to align roadmap requirements, performance issues, and deployment optimizations.
Key Responsibilities
AI Serving Pipeline Optimization
* Design, rewrite, and mature inference pipelines for real-time, streaming, and batch workloads
* Optimize throughput, latency, and reliability via architectural evolution and model-specific strategies
* Manage orchestration of heterogeneous LLMs with varying performance, cost, and response profiles
* Implement fallback logic, request routing, and intelligent retry systems for availability and graceful degradation
* Build tooling for profiling and benchmarking pipelines involving LLMs and agentic orchestration frameworks
* Adapt infrastructure and integrations to support rapidly changing LLM APIs, model versions, and provider behavior
* Design and deploy self-hosted LLM inference pipelines, including model loading, quantization, batching, and runtime optimization on GPU/TPU environments
Production Infrastructure & Runtime Efficiency
* Own the live AI execution layer: coordinate model calls, resource scheduling, and latency-critical paths
* Monitor and improve key metrics: latency, token throughput, error rates, and autoscaling responsiveness
* Deploy and scale LLM services across cloud environments (AWS, GCP, Azure, on-prem), optimizing for regional availability and regulatory constraints
* Ensure robust observability, failover, rollback, and health monitoring across all deployed models
* Collaborate with infra teams to maximize compute efficiency across CPU/GPU/TPU backends
Model Vendor Coordination & External Integrations
* Serve as a technical counterpart to foundational model providers, communicating product needs, debugging issues, and tracking performance updates
* Maintain high reliability across provider transitions, including model deprecations, quota shifts, and new capability rollouts
* Evaluate and experiment with emerging models across different providers, providing comparative benchmarks and integration plans
System Integration & Engineering Excellence
* Integrate pipelines cleanly with APIs, orchestration layers, and application logic
* Refactor legacy systems for modularity, observability, and performance
* Promote reusable, maintainable infrastructure via tooling and shared abstractions
* Uphold engineering standards through code reviews, performance audits, and technical mentorship
Qualifications
Education
* Bachelor's or Master's degree in Computer Science, Software Engineering, or a related field
Experience
* 5+ years in backend, ML, or infrastructure engineering with a focus on live AI systems
* Demonstrated experience building and scaling real-time inference infrastructure
* Proven track record in latency optimization, fault tolerance, and production observability
Skills
* Proficient in Python (optionally Go or Rust); strong software design and debugging skills
* Experience with orchestration and serving tools
* Deep familiarity with containerization, Kubernetes, and cloud-native deployment (EKS, GKE, etc.)
* Hands-on with observability stacks (Prometheus, Grafana, etc.)
* Understanding of inference-level optimizations: batching, quantization, caching, and sharding
* Operational experience with LLMs (OpenAI, Anthropic, open-weight models) in both hosted and self-managed setups
* Experience building and maintaining self-hosted inference stacks using frameworks such as vLLM, HuggingFace Transformers, or DeepSpeed-Inference
* Familiarity with agentic AI systems and tooling (LangGraph, Semantic Kernel, CrewAI)
* Cross-cloud deployment experience (AWS, GCP, Azure) and awareness of compliance/latency trade-offs
* Comfortable managing technical communication with external vendors and adapting to fast-moving dependencies
✨ Smart • Intelligent • Private • Secure
Practice for Any Interview Q&A (AI Enabled)
Predict interview Q&A (AI Supported)
Mock interview trainer (AI Supported)
Ace behavioral interviews (AI Powered)
Record interview questions (Confidential)
Master your interviews
Track your answers (Confidential)
Schedule your applications (Confidential)
Create perfect cover letters (AI Supported)
Analyze your resume (NLP Supported)
ATS compatibility check (AI Supported)
Optimize your applications (AI Supported)
O*NET Supported
O*NET Supported
O*NET Supported
O*NET Supported
O*NET Supported
European Union Recommended
Institution Recommended
Institution Recommended
Researcher Recommended
IT Savvy Recommended
Trades Recommended
O*NET Supported
Artist Recommended
Researchers Recommended
Create your account
Access your account
Create your professional profile
Preview your profile
Your saved opportunities
Reviews you've given
Companies you follow
Discover employers
O*NET Supported
Common questions answered
Help for job seekers
How matching works
Customized job suggestions
Fast application process
Manage alert settings
Understanding alerts
How we match resumes
Professional branding guide
Increase your visibility
Get verified status
Learn about our AI
How ATS ranks you
AI-powered matching
Join thousands of professionals who've advanced their careers with our platform
Unlock Your AI Infrastructure Potential: Insight & Career Growth Guide
Real-time AI Infrastructure Jobs Trends in أبوظبي, United Arab Emirates (Graphical Representation)
Explore profound insights with Expertini's real-time, in-depth analysis, showcased through the graph below. This graph displays the job market trends for AI Infrastructure in أبوظبي, United Arab Emirates using a bar chart to represent the number of jobs available and a trend line to illustrate the trend over time. Specifically, the graph shows 3210 jobs in United Arab Emirates and 181 jobs in أبوظبي. This comprehensive analysis highlights market share and opportunities for professionals in AI Infrastructure roles. These dynamic trends provide a better understanding of the job market landscape in these regions.
Great news! AppliedAI is currently hiring and seeking a AI Infrastructure Engineer Opus to join their team. Feel free to download the job details.
Wait no longer! Are you also interested in exploring similar jobs? Search now: AI Infrastructure Engineer Opus Jobs أبوظبي.
An organization's rules and standards set how people should be treated in the office and how different situations should be handled. The work culture at AppliedAI adheres to the cultural norms as outlined by Expertini.
The fundamental ethical values are:The average salary range for a AI Infrastructure Engineer Opus Jobs United Arab Emirates varies, but the pay scale is rated "Standard" in أبوظبي. Salary levels may vary depending on your industry, experience, and skills. It's essential to research and negotiate effectively. We advise reading the full job specification before proceeding with the application to understand the salary package.
Key qualifications for AI Infrastructure Engineer Opus typically include Other General and a list of qualifications and expertise as mentioned in the job specification. Be sure to check the specific job listing for detailed requirements and qualifications.
To improve your chances of getting hired for AI Infrastructure Engineer Opus, consider enhancing your skills. Check your CV/Résumé Score with our free Resume Scoring Tool. We have an in-built Resume Scoring tool that gives you the matching score for each job based on your CV/Résumé once it is uploaded. This can help you align your CV/Résumé according to the job requirements and enhance your skills if needed.
Here are some tips to help you prepare for and ace your job interview:
Before the Interview:To prepare for your AI Infrastructure Engineer Opus interview at AppliedAI, research the company, understand the job requirements, and practice common interview questions.
Highlight your leadership skills, achievements, and strategic thinking abilities. Be prepared to discuss your experience with HR, including your approach to meeting targets as a team player. Additionally, review the AppliedAI's products or services and be prepared to discuss how you can contribute to their success.
By following these tips, you can increase your chances of making a positive impression and landing the job!
Setting up job alerts for AI Infrastructure Engineer Opus is easy with United Arab Emirates Jobs Expertini. Simply visit our job alerts page here, enter your preferred job title and location, and choose how often you want to receive notifications. You'll get the latest job openings sent directly to your email for FREE!