← all jobs

Post-Training Research Scientist (LLMs) — Experimental Track

Work from home Full-time role Hiring

About us Vetto is a global talent platform connecting top-tier professionals to high-impact AI projects around the world. Our mission is to build trust, quality, and long-term value in the AI ecosystem - for both exceptional talents and companies operating at the frontier of technology. About the role This role sits at the heart of Vetto’s mission: using high-quality human data to build AI systems that make the world better. You’ll take raw expert signals and turn it into tangible model improvement, experimenting rapidly and carving new paths in post-training. With full autonomy and no production constraints, you’ll have the freedom to try unconventional ideas and see their impact quickly.

Key Responsibilities

Design and run post-training experiments on frontier and open-weight LLMs (SFT, preference-based methods, rubric-driven training) Translate raw annotation artifacts (multi-step solutions, evaluations, adversarial prompts) into training-ready datasets. Prototype new reward signals beyond pairwise preferences (rubrics, constraints, structured critics). Analyze failure modes; propose data-centric fixes (sampling, curriculum, counterfactuals). Build lightweight training/eval pipelines; iterate quickly. Produce short internal memos: what worked, what didn’t, why. About you We’re looking for a researcher who thrives with autonomy, is hands-on, and brings a strong execution mindset and startup mentality. You are opinionated about data quality, pragmatic about tradeoffs, and comfortable moving quickly with incomplete information. You have strong experimental instincts — you can design, run, and interpret messy experiments and extract meaningful insights from them. Minimum Qualification PhD (or equivalent experience) in ML/AI, applied math, stats, or adjacent. Hands-on experience with LLM post-training (at least one of SFT/DPO/RLHF/RLVR). Solid Python + PyTorch/JAX; comfortable with training infra basics. Fluent English Preferred Qualification Worked with rubric-based evaluation or tool-augmented tasks. Experience mixing synthetic and human data. Familiarity with failure analysis and dataset audits. Work Model We operate remote-first. We focus on outcomes, not where the work is done. To support flexibility and personal choice, we maintain offices in select locations as an optional resource for the team. Location: Flexible (EU-friendly time zones preferred) Type: Full-time or long-term contract Equal Employment Opportunity Vetto is proud to be an equal opportunity employer and values diversity at our company. We do not discriminate on the basis of race, color, religion, national origin, sex, sexual orientation, gender identity, age, disability, veteran status, or any other protected characteristic. Type: Full-time or long-term contract

More open positions

Senior Product Manager

Work from home Full-time role

Senior Product Marketing Manager

Work from home Full-time role

Senior Performance Marketing Manager, New Channel Expansion

Work from home Full-time role

Travel Expert | Reviewer

Work from home Full-time role

Lead Technical Partner Enablement Engineer

Work from home Full-time role

Remote Live Chat Assistant (Entry-Level) - Unlock a World of Opportunities at careerzynith

Work from home Full-time role

[Remote] Data Collection Contributor

Work from home Full-time role

Remote Part-Time Data Entry Specialist – Precision Data Management & Quality Assurance at careerzynith

Work from home Full-time role

Customer Service Representative – Frontline Client Support & Relationship Management at careerzynith

Work from home Full-time role

Revenue Cycle Policy Analyst (Staff Consultant II) Remote / Telecommute Jobs

Work from home Full-time role

Backend Engineer (Infra), Core Sending

Work from home Full-time role

Inbound Answering Service Operator - Up to $16/hr

Work from home Full-time role

[Remote] Operations & AI Automation Associate

Work from home Full-time role

Steuerfachangestellter / Steuerfachwirt / Bilanzbuchhalter (alle m/w/d) 35.000 - 60.000 € Jahresgehalt

Work from home Full-time role

Remote Clerical Jobs Hiring ASAP 405-504-3140 OKC, OK

Work from home Full-time role

Insurance Producer - Springfield, MA

Work from home Full-time role

Technical Writer - Talent Acquisition Team

Work from home Full-time role

Real Estate Assistant / Transaction Coordinator

Work from home Full-time role

Inside sales specialist

Work from home Full-time role

Enterprise Account Executive

Work from home Full-time role

High-Volume Recruiter (Remote)

Work from home Full-time role