Machine Learning - Model Serving Job at Alexander Chapman, San Francisco, CA

d2pKSXZZWFZwQUlucTJrUTNhVVo1OENtZGc9PQ==
  • Alexander Chapman
  • San Francisco, CA

Job Description

We are working with a company building intuitive, voice-first AI systems that blend natural interaction with powerful model performance. Founded by leaders from Meta, Oculus, and Google, they’re creating a new class of consumer devices powered by speech, vision, and LLMs.

The Role

You’ll help optimize and scale the inference stack, working across model serving, performance tuning, and deployment to support real-time, multimodal AI.

What You’ll Do

  • Improve serving systems for LLMs, speech, and vision models.
  • Optimize throughput, latency, and cost using advanced techniques like batching, caching, and kernel tuning.
  • Extend frameworks like VLLM or SGLang to push the limits of performance.
  • Collaborate with training teams to deploy faster, lighter models.
  • Experiment with compilers and hardware backends to boost efficiency.

What We’re Looking For

  • Strong experience with PyTorch or similar ML frameworks.
  • Deep knowledge of model serving and systems performance.
  • Skilled in low-level debugging, bottleneck analysis, and server optimization.
  • Familiar with VLLM, Ray, or deploying inference workloads at scale.
  • Comfortable owning complex infrastructure projects end to end.
  • Background in computer science or related field from a top-tier university (e.g. Stanford, MIT, Ivy League).
  • Experience at a top tech company (e.g. FAANG) or a successful, high-growth startup.

They’re looking for curious, impact-driven engineers ready to push what’s possible with real-time AI.

Job Tags

Similar Jobs

HYPLAND

Warehouse Lead Job at HYPLAND

 ...Job Title: Warehouse Lead (Part-Time) Location: Hawthorne, CA (On-Site) Schedule: Monday Friday (Part-Time) Hourly Rate: $25$30/hour, based on experience Company Overview: Hypland is a globally recognized streetwear brand inspired by anime, gaming... 

BIP

Management Consultant, Life Sciences Job at BIP

 ...BIP is a European-based Management Consulting firm. We have over 4,500 consulting professionals across 13 countries. We are Europes fastest...  ...$110,000-$160,000** Benefits: ~ Choice of medical, dental, vision insurance. ~ Voluntary benefits. ~ Short- and long... 

Insight Global

Trade Compliance Analyst Job at Insight Global

 ...specifically SAP operating system and Global Trade Solution (GTS) NICE TO HAVE SKILLS AND EXPERIENCE Strong knowledge of valuation and country of origin requirements Strong ability to identify and describe parts, machines and components in detail to... 

Odoo

Customer Care Associate (French) Job at Odoo

 ...are not duplicating efforts. Applicants must be authorized to work for any employer in the U.S. We are unable to sponsor or assume...  ...certified mental health professionals ~$100 towards a work-from-home office setup ~ Evolve in a nice working atmosphere with a... 

STIIIZY

Payroll Analyst Job at STIIIZY

 ...Job Summary: The Payroll Analyst is responsible for ensuring accurate and timely payroll processing, supporting compliance with all applicable laws, and maintaining payroll system integrity. This role requires strong technical skills, attention to detail, and the ability...