T

Software Engineer, Distributed Systems

The San Francisco Compute Company
Full-time
On-site
San Francisco, California, United States

About

We're the San Francisco Compute Company, and we're building the first real-time trading platform for compute resources. Over the next decade, we anticipate thousands of startups and labs will be training and serving large models. These organizations need substantial computing power, and we're creating a platform where that compute can be traded efficiently. Our success will enable companies to scale to tens of thousands of accelerators for hours at a time without building their own infrastructure. This breakthrough will dramatically expand access to large model training, making the most transformative technology of our era available to a far wider range of organizations.

The Role

As a distributed systems software engineer, you’ll be working on our in-house resource orchestration system. This system coordinates state and access to hundreds (soon thousands) of GPU compute nodes in multi-tenant clusters spanning across multiple data centers. Some responsibilities of the role include:

  • Design of distributed system architectures that enable high availability fault tolerant state management

  • Deployment automation and performance optimization of virtual machines running on bare metal that utilize GPU passthrough

  • Design and deployment of multi-tier high performance network attached storage systems

About You

  • You have built fault tolerant distributed systems before that can manage hardware resources at scale

  • You enjoy creating self-correcting systems that contribute to hardware health and reliability

  • You have experience with Linux virtualization (Cloud Hypervisor, QEMU, libvirt, virtiofs, sr-iov, PCIe passthrough)

  • You appreciate and value good documentation

Some Nice to Haves

  • Experience with Rust (our VM orchestrator is written in Rust)

  • Experience with etcd

  • Experience with high performance storage systems (WEKA, VAST, Ceph, etc.)

Benefits

Unlimited office book budget

You can buy as many books for the office as you want. You’re encouraged to spend time during the workday reading!

Generous equity grant

Team members are offered a competitive salary along with equity in the company

Retirement matching

We match 401(k) plans up to 4%

Medical, dental & vision

We offer competitive medical, dental, vision insurance for employees and dependents and cover 100% of premiums

Time off

We offer unlimited paid time off as well as 10+ observed holidays

Parental leave

We offer biological, adoptive, and foster parents paid time off to spend quality time with family

Daily lunch

We cover lunch daily for employees

Visa Sponsorships

Yes, we sponsor visas and work permits

The San Francisco Compute Company is committed to maintaining a workplace free from discrimination and harassment.

We make employment decisions based on business needs, job requirements, and individual qualifications, without regard to race, color, religion, belief, national origin, social or ethical origin, age, physical, mental, or sensory disability, sexual orientation, gender identity or expression, marital status, civil union or domestic partnership status, past or present military service, HIV status, family medical history or genetic information, family or parental status including pregnancy, or any other status protected by law.

We welcome the opportunity to consider qualified applicants with prior arrest or conviction records. Our commitment to diversity includes hiring talented individuals regardless of their criminal history, in accordance with local, state, and federal laws, including San Francisco’s Fair Chance Ordinance and California’s ban-the-box laws.

If you require reasonable accommodation for any reason, please reach out to us at team@sfcompute.com.