See all roles

[Remote] Research Intern (LLM)

Work from home Full-time role Hiring

Note: The job is a remote job and is open to candidates in USA. 2077AI Open Source Foundation is looking for a Research & Evaluation Intern to help build advanced QA datasets and evaluate large language models. This role is ideal for students passionate about LLMs, evaluation science, and the intersection of research and applied data work.

Responsibilities

  • Design and construct high-quality, sufficiently challenging QA datasets (graduate/PhD level) inspired by GPQA, HLE, and AI4Sci families, collaborating with a global network of talented researchers
  • Evaluate large language models on reasoning, factuality, and problem-solving benchmarks
  • Develop review pipelines and quality-control criteria for expert-level question generation
  • Analyze model outputs, conduct error taxonomy studies, and summarize insights for internal reports and research papers
  • Collaborate with the 2077AI Foundation’s open-source benchmark teams on public dataset releases

Skills

  • Strong background in computer science, data engineering, artificial intelligence, or related fields, with hands-on experience in large-scale data systems
  • 1+ years of experience with LLMs, prompt engineering, and evaluation frameworks (e.g., LM Eval Harness, OpenCompass)
  • Excellent written and verbal English skills and analytical reasoning
  • Strong execution and team management skills—able to translate high-level objectives into actionable plans and drive team outcomes
  • Experience with formal methods, chain-of-thought evaluation, or curriculum generation
  • Relevant publications in top conferences

Company Overview

  • The 2077AI Foundation, is at the forefront of AI data standardization and progression. It was founded in undefined, and is headquartered in Singapore, SG, with a workforce of 51-200 employees. Its website is https://www.2077ai.com/.
  • Apply To This Job

    You might like

    [2026] AI/ML Engineer Intern

    Work from home Full-time role

    AI Safety Research Intern-2

    Work from home Full-time role

    [Remote] Billing Specialist I

    Work from home Full-time role

    2026 CareSource Summer Internship - Teaching Kitchen

    Work from home Full-time role

    [Remote] Entry Level Client Care Specialist (Remote)

    Work from home Full-time role

    Co-op Software Engineer, Android

    Work from home Full-time role

    Growth Business Development Representative - SMB

    Work from home Full-time role

    Human-Centered AI Intern, Generative Human Modeling

    Work from home Full-time role

    Partner Account Manager

    Work from home Full-time role

    [Remote] AI Safety Research Intern (PhD)

    Work from home Full-time role

    [Remote-Position] Warehouse Production Worker

    Work from home Full-time role

    National Customer Care Representative I

    Work from home Full-time role

    [Remote/WFM] Pharmacy Technician Data Entry, Customer Service

    Work from home Full-time role

    [Remote] AI/Prompt Engineer – Intern/Entry Level

    Work from home Full-time role

    Entry Level Email Chat Support Specialist – Remote Work Opportunity with Flexible Scheduling and Comprehensive Training at arenaflex

    Work from home Full-time role

    Experienced Data Entry Professional – Remote Data Management Opportunity with Comprehensive Training and Growth Prospects at blithequark

    Work from home Full-time role

    Influencer Marketing Lead (remote, ft-contract)

    Work from home Full-time role

    Maintenance Mechanic 2 – Eleanor Chase Reentry Center in Spokane County, WA

    Work from home Full-time role

    Experienced Remote Data Entry Clerk and Typist – Flexible Part-Time and Full-Time Opportunities for Administrative, Customer Service, and Sales Professionals at arenaflex

    Work from home Full-time role

    [Remote-Position] Retail Execution Specialist - Miami

    Work from home Full-time role