About the Role
About the Role
Our team focuses on enabling custom models and dedicated inference on Together. We are responsible for building a container platform, optimizing autoscaling, minimizing cold starts, achieving the best end-to-end model performance, and providing a best-in-class developer experience with great tooling. We often focus on video or audio generation across the stack: CUDA kernels, pytorch optimization, inference engines, contain
Company
Together AI →Job Details
- Location
- San Francisco
- Work Type
- On-site / Hybrid
- Posted
- 2 weeks ago