Software Development Manager, AI Inference Technology, Neuron SDK

Seattle, Washington | Remote-Friendly | $166,400 - $287,700

We're working with Annapurna Labs (Part of Amazon) on this exciting opportunity.

Join a pioneering team at Annapurna Labs, an AWS company, dedicated to optimizing cutting-edge AI inference technology for cloud-scale machine learning accelerators like Trainium and Inferentia. This role offers the chance to lead expert AI engineers in delivering fundamental inference building blocks and libraries, directly impacting the performance of large language models for global customers. You'll be at the forefront of innovation, navigating dynamic priorities and shaping the future of AI inference.

Key Responsibilities

  • Guide AI engineers to build fundamental inference technology building blocks and libraries.
  • Optimize LLMs such as Llama and GPT OSS to run efficiently on Trainium and Inferentia devices.
  • Develop and optimize attention kernels and deliver them in the Neuronx_Distributed Inference Libraries.
  • Define the building blocks for the latest LLMs in collaboration with senior management and technical leaders.
  • Manage changing priorities as new models and technologies emerge, adapting team's work accordingly.
  • Dive deep to help the team solve complex technical challenges.

What You'll Need

  • 3+ years of engineering team management experience.
  • Established background in optimizing LLMs.
  • Experience delivering high-performance models using distributed inference libraries.
  • Capability of managing demanding, fast-changing priorities.
  • Strong technical ability to understand and deliver within a vertically integrated system stack.
  • Proficiency with PyTorch inference library, Neuron compiler, runtime, and collectives.

Apply via Haystack today!

About Haystack

Haystack combines AI & expert vetting to deliver world-class tech candidates who are engaged, aligned, and ready to interview. We're trusted by over 100,000+ UK-based techies, working in Software Engineering, Data, Design, DevOps, Cloud, Tech Management, Testing, Product & Delivery, Architecture and more. 100s of employers from startups and scale-ups like Atom Bank, DuckDuckGo and Goodlord to established enterprises like American Express, Dunelm and AWS use Haystack to connect with qualified tech talent that they can't find anywhere else.

Apply now

Please let Haystack know you found this job on ManagerTrack. This helps us grow!

Apply now

About the job

Apply before

Jan 24, 2026

Posted on

Dec 25, 2025

Job Type

Full-time

Unlock thousands of jobs and get more interviews

Let us do the heavy lifting and sift through the noise in your job search to get the most relevant jobs in front of you

What’s included

  • Advanced search filters
  • 24 hour advanced access to new jobs
  • Email alerts

Pay monthly, cancel anytime

$19.99/month

Join now

Invoices and receipts available for easy company reimbursement