Job Description
Description We are seeking a self-disciplined and experienced Data Scientist who can bridge the gap between data science and ML engineering, with a focus on using data-driven approaches to optimize LLM performance Responsibilities β’ Design and implement data structures to improve LLM efficiency and performance β’ Analyze large datasets to extract insights for model optimization β’ Develop and maintain data pipelines for LLM training and evaluation β’ Collaborate with ML engineers to implement data-driven improvements in model architectures β’ Conduct experiments to validate hypotheses and quantify improvements β’ Participate in team meetings and provide data-driven insights to guide decision-making Requirements β’ 5+ years of relevant experience in Data Science or ML-related roles β’ Strong background in statistics, mathematics, and computer science β’ Expertise in data structures and algorithms, particularly as applied to ML and LLMs β’ Proficiency in Python β’ Experience with data analysis libraries (e.g., Pandas, NumPy) and visualization tools (e.g., Matplotlib, Seaborn) β’ Familiarity with deep learning frameworks (TensorFlow, PyTorch) and LLM technologies (Hugging Face, AWS Bedrock) β’ Experience with version control systems (Git) β’ Strong analytical and problem-solving skills β’ Ability to work independently and collaboratively in a remote environment β’ Excellent time management skills to meet project deadlines β’ Authorized to work in the USA Preferred Qualifications β’ Advanced degree in Data Science, Computer Science, or related field β’ Experience with vector databases and embedding techniques β’ Knowledge of cloud computing platforms (e.g., AWS, GCP, Azure) β’ Familiarity with MLOps practices and tools Benefits β’ Flexible schedule β’ Competitive salary β’ Stock options Apply tot his job