AI Benchmarking & Evaluation Engineer
Join a team at the forefront of AI model evaluation, setting the standard for how large language models are tested and validated. In this role, you'll assess the latest AI models, design new benchmarks, and develop advanced evaluation methodologies. You'll work closely with engineers, AI researchers, and enterprise clients to ensure cutting-edge AI systems meet the highest standards. This role is a bridge between research and practical implementation and will suit someone who enjoys taking academic papers and creating working models.
Key Responsibilities:
Who You Are:
Make a real impact in AI research and development—apply today!
...Job Title: Embedded Tester / Software Integration Test Engineer Location: Alameda, CA (Onsite Day 1) Experience: 45 Years Employment Type: Contract Work Authorization: USC / GC / GC EAD only Position Overview We are looking for an Embedded...
...FULL-SERVICE SHOPPER Start earning quickly with a flexible schedule Shopping with Instacart is more than grocery delivery. Shoppers help make our world go round. They make money, make moves, and make shopping lists come true. They make good time, make life easier,...
...They also offer additional services such as LEED Reporting and delivery of building materials. Service areas include Manhattan, Brooklyn... ...will be responsible for coordinating and dispatching drivers, managing schedules, tracking shipments, and ensuring timely delivery...
...Catering Delivery Drivers Needed! Earn an average of $36per delivery! Deliveries begin on October 13th!Catering deliveries will be completed within 10 miles of Yorktown. Live, dedicated driver support is available to help when you need it, via chat or phone...
...MEDICAL GROUP ADMINISTRATOR (Turnaround-Focused | System-Level Role) Client Organization Health System: Central Maine Healthcare (acquired by Prime Healthcare) Ownership Post-Close: Not-for-Profit Primary Location: Lewiston, Maine System Scope: Central...