AI Software Engineer (Platform Software)

🌍 Remote, USA πŸš€ Full-time πŸ• Posted Recently

Job Description

About the Job β€’ FuriosaAI is looking for passionate AI Software Engineers to join our Platform Team. You will participate in the research and development of models optimized for our NPU accelerator. β€’ Our team builds the production-grade, streamlined AI software that makes up our SDK. This includes the runtime, LLM serving framework, and PyTorch models/extensions. β€’ Your work on these critical parts of the SDK will directly enable AI developers to efficiently deploy optimized AI models on FuriosaAI NPUs. Responsibilities β€’ Develop and optimize DNN model implementations in PyTorch for FuriosaAI's Tensor Contraction Processor (TCP) architecture β€’ Analyze the features, implementations, CUDA and Triton kernels of existing AI model inference frameworks such as vLLM, TensorRT-LLM, and DeepSpeed-MII β€’ Research and implement generative AI models, parallelism strategies, and inference techniques to improve performance and efficiency β€’ Collaborate closely with the compiler team to optimize and enable models. Minimum Qualifications β€’ BS degree in Computer Science, Engineering, or a related field, or equivalent industry experience β€’ Proficiency in Python programming skill β€’ Experience in developing AI models in DNN frameworks (e.g., PyTorch) β€’ Solid understanding of machine learning, deep learning, natural language processing (NLP), and/or generative AI models β€’ Strong communication skills with the ability to collaborate effectively across cross-functional teams Preferred Qualifications β€’ Hands-on experience with PyTorch 2.0 technologies (e.g., TorchDynamo) or DNN compiler technologies, such as Triton and MLIR β€’ Proficiency in C++/CUDA or Rust programming skills β€’ Hands-on experience deploying and optimizing large-scale ML models in production β€’ Hands-on experience in model training and fine-turning of pre-trained models β€’ Experience in LLM inference frameworks: vLLM, TensorRT-LLM, and DeepSpeed-MII β€’ Strong background in model quantizations and model evaluations β€’ Strong background in machine learning, generative AI, and model evaluation techniques β€’ Proven track record of contributing to open-source projects Contact β€’ [email protected] Apply tot his job

Ready to Apply?

Don't miss out on this amazing opportunity!

πŸš€ Apply Now

Similar Jobs

Recent Jobs

You May Also Like