Job Description
- Role Description
- Design challenging, real-world STEM problems to evaluate model reasoning and problem-solving.
- Implement tasks using Python in an agentic development environment.
- Analyze model/agent behavior to diagnose reasoning gaps and improve performance.
- Develop reproducible and testable deliverables with clear specifications and deterministic tests.
- Collaborate with AI research teams to enhance model outputs and training data quality.
- Work independently and asynchronously to meet deadlines and project goals.
- Qualifications
- Deep expertise in data science, machine learning, finance, and/or Python-based coding.
- Active or recently graduated PhD (Top 20 U.S.-based school).
- Strong research background in frontier STEM topics.
- Ability to engage reliably for 30+ hours/week, primarily on weekdays.
- Demonstrated technical output such as high-quality open-source contributions.
- Comfort reading and reasoning about agent behavior traces to diagnose failure modes.
- Requirements
- Familiarity with agentic frameworks and OSS ecosystems like LangChain, MetaGPT, AutoGen, AutoGPT, CrewAI, LlamaIndex, BabyAGI, SuperAGI, CAMEL, AgentGPT, Dify.
- Benefits
- Compensation: $50–$100/hour
- Commitment: 30+ hours/week
- Type: Contract
- Application Process
- Upload resume
- AI interview based on your resume
- Submit form
- Resources & Support
- For details about the interview process and platform information, please check:
- Interview Process
- For any help or support, reach out to:
- [email protected]
- PS: Our team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity.
Apply tot his job
Apply To this Job