← all jobs

[Remote] Generative AI Inference Engineer

Work from home Full-time role Hiring

Note: The job is a remote job and is open to candidates in USA. Stability AI is seeking a passionate Generative AI Inference Engineer to join their Inference team, focusing on creative applications of generative AI models. The role involves leading the design and development of customer-facing multi-modal ML inference systems and optimizing inference techniques for generative models.

Responsibilities

  • Lead efforts to drive the design, development of customer-facing multi modal ML inference systems
  • Work with the Platform and Inference teams on building inference systems for the next generation of models, where you will work on areas such as optimization, model tuning and deployment
  • Partner with leading cloud providers to deliver hosted Stability AI inference solutions
  • Be a strategic thought partner for leaders across the organization on driving business impact through machine learning
  • Be part of the team to bring new Stability models and pipelines into existence
  • Prototype and productionize inference platform improvements and new features

Skills

  • 7+ years working on productionizing machine learning systems, including inference pipeline development
  • Expert level knowledge on writing and running python services at scale
  • 5+ years working on python scientific stack, pyTorch and at least one high-performance inference framework (e.g. Triton and TensorRT)
  • Deep understanding of Diffusion Architecture
  • Experience profiling and optimizing deep neural networks on Nvidia GPUs, using profiling tools such as NVIDIA Nsight
  • Experience with python-based image manipulation/encoding/decoding frameworks, such as OpenCV
  • Experience deploying to cloud orchestration systems such as Kubernetes and cloud providers such as AWS, GCP, and Azure
  • Experience with Docker
  • Ability to rapidly prototype solutions and iterate on them with tight product deadlines
  • Strong communication, collaboration, and documentation skills
  • Experience with the open-source ML ecosystem (HuggingFace, W&B, etc.)

Company Overview

  • Stability AI is an artificial intelligence company focused on developing open-source generative AI models. It is a sub-organization of Stability AI. It was founded in 2019, and is headquartered in London, England, GBR, with a workforce of 51-200 employees. Its website is https://stability.ai.
  • More open positions

    [Remote] Account Executive, Mid City

    Work from home Full-time role

    [Remote] Full Stack Software Engineer, Banking

    Work from home Full-time role

    [Remote] Manager - Software Application Specialists

    Work from home Full-time role

    [Remote] Account Executive - AVP, Sales (Consultant Partnerships)

    Work from home Full-time role

    [Remote] Senior Director - Oncology and Peripheral Imaging Clinical Development

    Work from home Full-time role

    Social Media Chat Sales Specialist – Remote English‑Native Lead Generation & Conversion Role for Latin America at careerzynith

    Work from home Full-time role

    Simulation Systems Engineer | $61/hr Remote

    Work from home Full-time role

    Services Enterprise Account Executive

    Work from home Full-time role

    Managing Editor - Senior Managing Editor job at Penguin Random House in NJ, CT, PA

    Work from home Full-time role

    SOC Splunk Analyst Evening / Overnight / Weekend Shifts

    Work from home Full-time role

    [Remote] Performance Marketing Manager, Paid Social

    Work from home Full-time role

    [Remote] Full Stack Developer (Remote)

    Work from home Full-time role

    UX Designer - AI

    Work from home Full-time role

    Area Sales Manager (Remote — Territory Based)

    Work from home Full-time role

    Claims Examiner II

    Work from home Full-time role

    Experienced Customer Account Representative (Inside Sales) – Virtual Customer Service Team

    Work from home Full-time role

    [Remote] AX/Dynamics 365 Business Analysts

    Work from home Full-time role

    careerzynith Remote Chat Support Specialist – Full‑Time Work‑From‑Home Customer Service Representative ($25‑$35/hr)

    Work from home Full-time role

    Customer Experience Associate II

    Work from home Full-time role

    [Remote] National Account Manager- Retirement

    Work from home Full-time role

    Senior QA Analyst, Web Services - Monopoly GO!

    Work from home Full-time role