We're the San Francisco Compute Company, and we're building the first real-time trading platform for compute resources. Over the next decade, we anticipate thousands of startups and labs will be training and serving large models. These organizations need substantial computing power, and we're creating a platform where that compute can be traded efficiently. Our success will enable companies to scale to tens of thousands of accelerators for hours at a time without building their own infrastructure. This breakthrough will dramatically expand access to large model training, making the most transformative technology of our era available to a far wider range of organizations.
As a distributed systems software engineer, youβll be working on our in-house resource orchestration system. This system coordinates state and access to hundreds (soon thousands) of GPU compute nodes in multi-tenant clusters spanning across multiple data centers. Some responsibilities of the role include:
Design of distributed system architectures that enable high availability fault tolerant state management
Deployment automation and performance optimization of virtual machines running on bare metal that utilize GPU passthrough
Design and deployment of multi-tier high performance network attached storage systems
You have built fault tolerant distributed systems before that can manage hardware resources at scale
You enjoy creating self-correcting systems that contribute to hardware health and reliability
You have experience with Linux virtualization (Cloud Hypervisor, QEMU, libvirt, virtiofs, sr-iov, PCIe passthrough)
You appreciate and value good documentation
Experience with Rust (our VM orchestrator is written in Rust)
Experience with etcd
Experience with high performance storage systems (WEKA, VAST, Ceph, etc.)
You can buy as many books for the office as you want. Youβre encouraged to spend time during the workday reading!
Team members are offered a competitive salary along with equity in the company
We match 401(k) plans up to 4%
We offer competitive medical, dental, vision insurance for employees and dependents and cover 100% of premiums
We offer unlimited paid time off as well as 10+ observed holidays
We offer biological, adoptive, and foster parents paid time off to spend quality time with family
We cover lunch daily for employees
Yes, we sponsor visas and work permits
The San Francisco Compute Company is committed to maintaining a workplace free from discrimination and harassment.
We make employment decisions based on business needs, job requirements, and individual qualifications, without regard to race, color, religion, belief, national origin, social or ethical origin, age, physical, mental, or sensory disability, sexual orientation, gender identity or expression, marital status, civil union or domestic partnership status, past or present military service, HIV status, family medical history or genetic information, family or parental status including pregnancy, or any other status protected by law.
We welcome the opportunity to consider qualified applicants with prior arrest or conviction records. Our commitment to diversity includes hiring talented individuals regardless of their criminal history, in accordance with local, state, and federal laws, including San Franciscoβs Fair Chance Ordinance and Californiaβs ban-the-box laws.
If you require reasonable accommodation for any reason, please reach out to us at team@sfcompute.com.