← all jobs

[Remote] Infrastructure Software Engineer, Fleet & Automation

Work from home Full-time role Hiring

Note: The job is a remote job and is open to candidates in USA. Nscale is a GPU cloud company focused on AI infrastructure, providing high-performance solutions for AI development. As an Infrastructure Software Engineer for Fleet & Automation, you will ensure the performance and scalability of AI and High-Performance Computing environments by building and maintaining automation and control systems.

Responsibilities

  • Perform technical architecture, roadmap and implementation for workflow automation systems, driving architecture decisions that balance automation complexity, reliability, and maintainability
  • Identify and resolve performance and scalability issues
  • Establish technology and product direction in collaboration with other tech leads, managers, and senior leadership
  • Own end-to-end delivery of device provisioning, validation, testing, and remediation workflows at scale
  • Design and build workflow orchestration systems for hardware lifecycle management, including GPU nodes and network switches
  • Partner with Infrastructure, Platform, and SRE teams to translate operational needs into robust, scalable automation
  • Establish engineering standards for reliability, observability, and operational excellence across all services
  • Help set up engineering best practices in collaboration with the broader engineering team
  • Build production-grade Python systems for hardware lifecycle automation, leveraging AI tools to accelerate delivery
  • Assess impact to team software stack from new hardware product programs and explore AI driven process improvement and automation
  • Collaborate with cross-functional teams (product, design, operations, infrastructure) to build efficient, interoperable, and maintainable automated systems

Skills

  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • 5+ years relevant experience building large-scale infrastructure applications or similar experience
  • Experience in utilizing languages such as C, C++, Java, and scripting languages such as Python for API design and unit testing techniques
  • Deep understanding of Linux operating systems, networking fundamentals (TCP/IP, BGP), and familiarity with configuration management tools (e.g., Ansible, Terraform)
  • Experience building, running and debugging large-scale infrastructure, stateful and stateless services for distributed systems or networks, and experience with compute technologies, storage, or hardware architecture
  • Experience integrating with infrastructure tooling such as: DCIMs, NetBox, OpenStack, bare metal APIs (MAAS, Ironic, IPMI)
  • Master's degree or PhD in Engineering, Computer Science, or a related technical field
  • Experience designing, analyzing and improving efficiency, scalability, and performance of various system resources
  • Direct experience with AI/HPC infrastructure, including NVIDIA GPUs, InfiniBand or high-speed Ethernet fabrics, and related management software (e.g., NCCL, SLURM)
  • Experience with advanced observability and monitoring systems (Prometheus, Grafana, OpenTelemetry) for complex, high-cardinality telemetry data
  • Familiarity with cloud-native technologies (Kubernetes, Docker) and infrastructure-as-code principles
  • Demonstrated ability to integrate AI tools to optimize/redesign workflows and drive measurable impact (e.g., efficiency gains, quality improvements)
  • Familiarity with SLOs/metrics measurement, logs/telemetry/metrics integration with tools for enhanced operator experience

Benefits

  • Medical
  • Dental
  • Vision
  • Flexible paid time off
  • Parental leave
  • Retirement plan participation

Company Overview

  • Nscale builds AI data centers and provides GPU cloud infrastructure that companies use to train, run, and scale large AI models. It was founded in 2024, and is headquartered in London, England, GBR, with a workforce of 201-500 employees. Its website is https://www.nscale.com.
  • More open positions

    [Remote] Principal Network Architect- AI Infrastructure

    Work from home Full-time role

    [Remote] Senior Product Manager – Professional Standards

    Work from home Full-time role

    [Remote] Product Operations Manager - Remote

    Work from home Full-time role

    [Remote] Revenue Operations Data Analyst

    Work from home Full-time role

    [Remote] Family & Lifestyle Focused Content Writer

    Work from home Full-time role

    Customer Retention Shop Executive – Tele-Sales and Customer Support Expert

    Work from home Full-time role

    Business Analysis Manager

    Work from home Full-time role

    Account Executive

    Work from home Full-time role

    Global Account Manager, Telco

    Work from home Full-time role

    Remote Business Growth Specialist – Inbound & Outbound Sales Development Representative (SDR) | Full-Time | Remote Opportunity with careerzynith

    Work from home Full-time role

    H-2A & H-2B Immigration Paralegal

    Work from home Full-time role

    Experienced Bilingual Customer Service Representative - Seasonal, Remote (Spanish / English)

    Work from home Full-time role

    Remote Delta Customer Care Specialist – Airline Support & Passenger Experience (Work from Home, U.S.)

    Work from home Full-time role

    Associate Email Marketing Manager

    Work from home Full-time role

    [Remote] Uber AI Solutions - Expert Freelancer (Australia)

    Work from home Full-time role

    Entry-Level Online Chat Support Specialist – Remote Customer Service Role at careerzynith – No Experience Required

    Work from home Full-time role

    Product Manager - Uprating, Plant Performance & Long Term Operations

    Work from home Full-time role

    Remote Part‑Time Data Entry Specialist – Precision‑Focused Data Management for careerzynith Global Logistics

    Work from home Full-time role

    TERMS OF REFERENCE - Brand Broker — IP Licensing

    Work from home Full-time role

    HRIS Analyst (Onsite position, remote work not available)

    Work from home Full-time role

    Remote Data Entry Specialist – Work From Home Opportunity with Comprehensive Training and Growth Path at careerzynith

    Work from home Full-time role