Research Scientist - Model Evaluation Job at Lumicity, Santa Rosa, CA

dE91M3ZjNWxza3BUMG52d2d6U2xDdVR2NVE9PQ==
  • Lumicity
  • Santa Rosa, CA

Job Description

AI Benchmarking & Evaluation Engineer

Join a team at the forefront of AI model evaluation, setting the standard for how large language models are tested and validated. In this role, you'll assess the latest AI models, design new benchmarks, and develop advanced evaluation methodologies. You'll work closely with engineers, AI researchers, and enterprise clients to ensure cutting-edge AI systems meet the highest standards. This role is a bridge between research and practical implementation and will suit someone who enjoys taking academic papers and creating working models.

Key Responsibilities:

  • Analyze and benchmark newly released AI models (DeepSeek, Gemini, etc.)
  • Develop and implement novel evaluation frameworks
  • Build datasets, manage labeling processes, and publish findings
  • Enhance automated evaluation techniques for AI-generated content
  • Collaborate with top AI labs and enterprise partners to refine best practices

Who You Are:

  • MSc or PhD from leading Computer Science or Machine Learning school
  • At least 3 years of experience in applied AI, with a focus on benchmarking or model evaluation
  • Strong background in designing evaluation methodologies
  • Passion for advancing AI assessment standards
  • Solid Python, PyTorch/TensorFlow and Django

Make a real impact in AI research and development—apply today!

Job Tags

Similar Jobs

URBANSPACE Management

Brand Ambassador Job at URBANSPACE Management

Job Title: Brand Ambassador (Seasonal, Part-Time) Location: New York City (Urbanspace Union Square Holiday Market) Dates: November 10 ...  ...available to work between November 10 and December 24, including weekends and some evenings Details: Position type: Part-time,... 

Dexian

Receptionist 2 Job at Dexian

Title: Receptionist 2 Duration: 3 months (contract to Hire) Location: Huntsville, Alabama, 35806 Pay Rate: $16.00 - $17.75 hourly Shift: Monday - Friday , 7:30am - 4:00pm Job Description - Job Summary : The primary responsibilities of this position...

NBCUniversal

Production/ Broadcast Engineer - NBC Sports Job at NBCUniversal

 ...global theme park destinations, consumer products, and experiences. We own and operate leading entertainment and news brands, including NBC, NBC News, NBC Sports, Telemundo, NBC Local Stations, Bravo, and Peacock, our premium ad-supported streaming service. We produce and... 

Hunting Lebanese

Warehouse Stock Keeper Job at Hunting Lebanese

Job DescriptionHandles inventory physical count of raw material and finished productsData entry of stockRequirements:2 years of relevant experience in stock controlBachelor's degree is a plusComputer literateGood English communicationHard worker... 

Energy Efficient Replacements LLC

Exterior Sales Professional Job at Energy Efficient Replacements LLC

 ...include engaging with potential clients, conducting in-person consultations, providing expert advice on energy-efficient exterior upgrades...  ...motivated work ethic, and manage time effectively in both office and remote settings. Reliable transportation and a valid driver's...