Back to all roles

Member of Technical Staff, Inference (Bay Area, Remote)

Remote-first Full-time Now hiring

What You’ll Do Build low-latency inference pipelines for on-device deployment, enabling real-time next-token and diffusion-based control loops in robotics Design and optimize distributed inference systems on GPU clusters, pushing throughput with large-batch serving and efficient resource utilization Implement efficient low-level code (CUDA, Triton, custom kernels) and integrate it seamlessly into high-level frameworks Optimize workloads for both throughput (batching, scheduling, quantization) and latency (caching, memory management, graph compilation) Develop monitoring and debugging tools to guarantee reliability, determinism, and rapid diagnosis of regressions across both stacks What You’ll Bring Deep experience in distributed systems, ML infrastructure, or high-performance serving (8+ years) Production-grade expertise in Python, with strong background in systems languages (C++/Rust/Go) Low-level performance mastery: CUDA, Triton, kernel optimization, quantization, memory and compute scheduling Proven track record scaling inference workloads in both throughput-oriented cluster environments and latency-critical on-device deployments System-level mindset with a history of tuning hardware–software interactions for maximum efficiency, throughput, and responsiveness Apply To This Job

More remote roles

Member of Technical Staff, Training (Bay Area, Remote)

Remote-first Full-time

Marketing Analyst (Attribution Focus) (Promova)

Remote-first Full-time

Student and Family Experience Manager (Immediate Opening)

Remote-first Full-time

Customer Sales Representative (remote work)

Remote-first Full-time

Account Manager Industrial Markets Region: France - Africa

Remote-first Full-time

VP of Engineering

Remote-first Full-time

Member of Technical Staff, Foundation Models (Bay Area)

Remote-first Full-time

Member of Technical Staff, Data Agent (Bay Area, Remote)

Remote-first Full-time

Member of Technical Staff, Platform (Bay Area, Remote)

Remote-first Full-time

Account Manager Industrial Markets Region: Europe - Middle Eas

Remote-first Full-time

Customer Service Representative (Bilingual Spanish Required)

Remote-first Full-time

Accounts Receivable Manager - Cigna HealthCare - Remote

Remote-first Full-time

Experienced Customer Service Representative – Remote Opportunity with Comprehensive Benefits and Career Growth

Remote-first Full-time

Telehealth BCBA: Weekends!

Remote-first Full-time

Zachary Piper Solutions – Health IT Business Analyst (100% Remote) – Virginia

Remote-first Full-time

Dir, Dental & Vision Pricing

Remote-first Full-time

Immediate Hiring: Make Money Online Part-Time Data Entry Jobs for arenaflex

Remote-first Full-time

[Remote] Project Manager

Remote-first Full-time

Chat Support Remote Agent (Part Time, Entry Level)

Remote-first Full-time

Remote Referral Coordinator (OPO & or transplant center experience is preferred) - Ideal Candidate Locations: Florida, Georgia, Virginia, or Texas

Remote-first Full-time