Research Scientist - Model Evaluation Job at Lumicity, Santa Rosa, CA

dE91M3ZjNWxza3BUMG52d2d6U2xDdVR2NVE9PQ==
  • Lumicity
  • Santa Rosa, CA

Job Description

AI Benchmarking & Evaluation Engineer

Join a team at the forefront of AI model evaluation, setting the standard for how large language models are tested and validated. In this role, you'll assess the latest AI models, design new benchmarks, and develop advanced evaluation methodologies. You'll work closely with engineers, AI researchers, and enterprise clients to ensure cutting-edge AI systems meet the highest standards. This role is a bridge between research and practical implementation and will suit someone who enjoys taking academic papers and creating working models.

Key Responsibilities:

  • Analyze and benchmark newly released AI models (DeepSeek, Gemini, etc.)
  • Develop and implement novel evaluation frameworks
  • Build datasets, manage labeling processes, and publish findings
  • Enhance automated evaluation techniques for AI-generated content
  • Collaborate with top AI labs and enterprise partners to refine best practices

Who You Are:

  • MSc or PhD from leading Computer Science or Machine Learning school
  • At least 3 years of experience in applied AI, with a focus on benchmarking or model evaluation
  • Strong background in designing evaluation methodologies
  • Passion for advancing AI assessment standards
  • Solid Python, PyTorch/TensorFlow and Django

Make a real impact in AI research and development—apply today!

Job Tags

Similar Jobs

United Therapy Solutions

Speech Language Pathologist Job at United Therapy Solutions

 ...United Therapy Solutions is hiring Speech Language Pathologists, SLS or CF for our full-time, school-based position! United Therapy Solutions is a New Jersey-based pediatric therapy company exclusively servicing NJ school districts and their families. Our management... 

LIME Painting® of Northern Colorado

Outside Sales Representative - Premium Home Services Job at LIME Painting® of Northern Colorado

Northern Colorado Full-time $50K-$100K+ OTE Join LIME Painting, the nation's leading premium painting and restoration company for luxury residential and commercial properties. What You'll Do Prospect in high-end neighborhoods and build relationships with builders...

Guthrie

Care Companion II - Nursing Unit PRN - Full Time Job at Guthrie

 ...7. Maintain and respect the patients responsibility, privacy, and dignity. 8. Demonstrates behavior appropriate to keeping the patient relaxed. Other Duties: 1. Participates in performance improvement activities to improve service and care. Rev: 11-15-2024

SESCO Cement Corp.

Industrial Engineer Job at SESCO Cement Corp.

 ...Job Description: Industrial Engineer The purpose of this position is to develop and sustain efficient operational methods for engineering, manufacturing, and supply chain that improve profitability. This position is responsible for calculating and maintaining production... 

Construction Industry Education Foundation

Marketing Director Job at Construction Industry Education Foundation

 ...providing an online plan room, education and safety training, and government advocacy. SRBX and CIEF have 18 full-time employees and seven...  ..., and demand generation Manage press releases, public relations, and earned media, ensuring a consistent and positive organizational...