← all jobs

[Remote] LLM - AI Quality Analyst (Personalization) - Dutch

Work from home Full-time role Hiring

Note: The job is a remote job and is open to candidates in USA. Careerflow.ai is seeking an AI Quality Analyst to evaluate a new personalization feature for Gemini. The role involves assessing how well the model utilizes personal information to generate relevant responses, requiring a combination of creativity and analytical skills.

Responsibilities

  • Designing and executing multi-turn conversational prompts (typically 1-5 turns) that require the AI to utilize your personal information and experiences
  • Evaluating model responses based on your intent from the starting prompt, checking if the personalization was appropriately applied
  • Analyzing responses for Grounding issues, ensuring claims about you are supported by evidence and not flawed inferences or hallucinations
  • Assessing Integration quality to ensure personal data is woven naturally into the response without robotic 'overnarrating'
  • Rigorously evaluating and stack-ranking two model responses side-by-side (SxS) to determine which is overall more helpful, easy to use, and enjoyable
  • Writing clear, defensible rationales for your comparisons, explicitly referencing where issues or positive aspects occurred in the conversation
  • Extracting and verifying 'Debug Info' from the model to confirm that chat summaries and data sources were properly utilized
  • Maintaining strict data hygiene by deleting evaluation conversations to prevent them from polluting your future chat history

Skills

  • Dutch Proficiency: Ability to read and write in Dutch with a high degree of comp, as Dutch is the focus language for this project
  • Schedule Flexibility: Full-time availability in your local time zone is required. We are staffing a global, 24-hour operations team
  • Exceptional Analytical Thinking: Demonstrate ability to evaluate nuanced and ambiguous AI responses, specifically assessing personalization quality
  • Creative Prompt Engineering: Experience in designing creative, multi-turn starting prompts based on personal context to thoroughly test the model's capabilities
  • Strong Evaluation Acumen: Understanding of personalization concepts, including the ability to identify incorrect personalization, poor inferences, and forced connections
  • Meticulous Attention to Detail: The ability to review Side-by-Side (SxS) model responses and spot subtle differences in naturalness and overnarrating
  • Excellent Written Communication: Superior ability to write clear, concise, and structured rationales for model rankings, explicitly referencing specific turn numbers
  • Feedback: Ability to provide constructive feedback and detailed annotations
  • Communication: Excellent communication and collaboration skills
  • Independence: Self-motivated and able to work independently in a remote setting
  • Technical Setup: Desktop/Laptop set up with a good internet connection
  • BS/BA degree or equivalent experience in a relevant field (e.g., Policy, Law, Ethics, Linguistics, Journalism, Computer Science, or a related analytical field)
  • Experience in data annotation, AI quality evaluation, content moderation, or a related role is strongly preferred

Company Overview

  • Careerflow - Career Copilot It was founded in 2022, and is headquartered in San Francisco, California, USA, with a workforce of 11-50 employees. Its website is https://www.careerflow.ai.
  • More open positions

    [Remote] Remote Customer Service Rep

    Work from home Full-time role

    [Remote] QA Tester (17557)

    Work from home Full-time role

    [Remote] Support and Services Operations Manager

    Work from home Full-time role

    [Remote] Junior Accountant

    Work from home Full-time role

    [Remote] Virtual Administrative Assistant

    Work from home Full-time role

    Director, UX Design (Freelance)

    Work from home Full-time role

    Remote Data Entry Specialist – Precision Data Management, Client Collaboration, and Fully Remote Work Opportunity

    Work from home Full-time role

    [Remote] Junior Full Stack Developer

    Work from home Full-time role

    [Remote] Packaging Engineer

    Work from home Full-time role

    Software Engineer, Platform - Dayton, OH, USA

    Work from home Full-time role

    Sales, Key Account Executive- Dallas (Remote)

    Work from home Full-time role

    Senior Software Engineer - Sales & Retail

    Work from home Full-time role

    Experienced Full Stack Customer Service Representative – Remote Customer Support in Brooklyn, NY

    Work from home Full-time role

    Experienced Customer Service Representative - Phones and Chat - Dallas, TX: Join careerzynith's Growing Digital Mailbox Industry

    Work from home Full-time role

    Remote Medical Billing Representative | WFH

    Work from home Full-time role

    Delta remote jobs (Virtual Assistant) US

    Work from home Full-time role

    Remote Dental Insurance Biller

    Work from home Full-time role

    Associate/Entry Level Consultant (Healthcare Consulting) - Project Management & Financial Analysis

    Work from home Full-time role

    911 Dispatcher/Communications Officer

    Work from home Full-time role

    Medical Records Coder 4 job at Inova in VA, MD, DC, DE, FL, GA, NC, OH, PA, SC, TN, TX, WV

    Work from home Full-time role

    Operations Manager - Senior | Health and Fitness | Remote

    Work from home Full-time role