Job Description
Note: The job is a remote job and is open to candidates in USA. Sully.ai is a cutting-edge company formed by a team from OpenAI, DeepMind, NASA, and more, focused on transforming healthcare through advanced AI technologies. They are seeking a Research Engineer, Applied ML/AI to bridge research and scalable production systems, owning the training and inference toolchains while optimizing model performance in healthcare applications. Responsibilities β’ Own the full training, fine-tuning, and inference toolchains for Sullyβs applied ML stack. β’ Translate research repos into production-ready services behind stable APIs. β’ Ship multimodal features (text, audio, vision) that enhance agent performance. β’ Optimize inference pipelines for cost, throughput, and latency. β’ Build evaluation systems that integrate into CI/CD, blocking weak checkpoints. Skills β’ Strong engineering background with experience in distributed systems and large-scale model training/serving. β’ Hands-on experience with multimodal ML (audio, vision, text). β’ Production ML hygiene: versioning, metrics, observability, reproducibility. β’ Proven track record of shipping ML systems into production. β’ Experience with model optimization techniques (quantization, caching, pruning). β’ Background in healthcare, medical AI, or other high-stakes regulated environments. β’ Contributions to open-source ML frameworks or libraries. Benefits β’ Competitive Compensation β’ Equity Company Overview β’ Sully.ai offers an AI Medical Assistant that automates clinical tasks and integrates with EHR systems. It was founded in 2023, and is headquartered in San Francisco, California, USA, with a workforce of 51-200 employees. Its website is Apply tot his job