See all roles

[Remote] Applied AI Inference Engineer

Work from home Full-time role Hiring

Note: The job is a remote job and is open to candidates in USA. Baseten is a company that powers mission-critical inference for leading AI companies. The Applied AI Inference Engineer will partner with customers to architect, build, and deploy high-scale production AI applications, translating business goals into reliable services with clear outcomes.

Responsibilities

  • Develop and maintain software systems and product features using one or more general-purpose programming languages in a production-level environment, with a preference for Python due to its relevance in ML projects
  • Drive customer impact by designing, implementing, and deploying Baseten solutions end-to-end (problem framing → evaluation → production deployment → monitoring). This involves working with customers’ engineering teams at every stage of the customer journey including: sales, implementation, and expansion
  • Deliver with velocity: turn vague objectives into clear specs and well-defined PoCs so we can rapidly ship well-tested services and outcomes for our customers
  • Optimize and enhance AI/ML projects, contributing to the continuous improvement of our technical stack. This includes developing features and PRDs with other engineering and product orgs
  • Own products and customer projects end-to-end, functioning as both an engineer, project manager, and product manager, with a focus on user empathy, project specification, and end-to-end execution
  • Navigate ambiguity and exercise good judgment on tradeoffs and tools needed to solve problems, avoiding unnecessary complexity
  • Demonstrate pride, ownership, and accountability for your work, expecting the same from your teammates

Skills

  • Bachelor's, Master's, or Ph.D. degree in Computer Science, Engineering, Mathematics, or related field
  • 1+ years of professional work experience in a fast-paced, high-growth environment
  • Demonstrated experience with one or more general-purpose programming languages in a production-level environment, with a strong preference for Python
  • Familiarity with AI/ML pipelines and the lifecycle of ML model development and deployment
  • Strong communication skills, particularly on complex technical topics
  • Experience in building or optimizing AI/ML projects is highly valued

Benefits

  • Competitive compensation, including meaningful equity.
  • 100% coverage of medical, dental, and vision insurance for employee and dependents
  • Flexible PTO policy including company wide Winter Break (our offices are closed from Christmas Eve to New Year's Day!)
  • Paid parental leave
  • Fertility and family-building stipend through Carrot
  • Company-facilitated 401(k)
  • Exposure to a variety of ML startups, offering unparalleled learning and networking opportunities.

Company Overview

  • Baseten is an AI infrastructure company that integrates machine learning into business operations, production, and processes. It was founded in 2019, and is headquartered in San Francisco, California, USA, with a workforce of 201-500 employees. Its website is https://www.baseten.co.
  • Company H1B Sponsorship

  • Baseten has a track record of offering H1B sponsorships, with 1 in 2026, 6 in 2025, 8 in 2024, 1 in 2023, 1 in 2020. Please note that this does not guarantee sponsorship for this specific role.
  • Apply To This Job

    You might like

    Junior SRE (Endpoint focus)

    Work from home Full-time role

    AI Solutions Engineer

    Work from home Full-time role

    Software Engineer II, Machine Learning (Feature Platform)

    Work from home Full-time role

    Software Engineer II, Machine Learning (Feature Platform)

    Work from home Full-time role

    Productivity Platforms Developer

    Work from home Full-time role

    Associate Network Engineer

    Work from home Full-time role

    Junior Software Engineer (Backend + AI)

    Work from home Full-time role

    Software Engineer II, Machine Learning (Feature Platform)

    Work from home Full-time role

    Software Engineer (Secret) (4611)

    Work from home Full-time role

    Support Products Analyst

    Work from home Full-time role

    Experienced Customer Support Specialist – Online Live Chat Assistant (Entry Level / Immediate Start)

    Work from home Full-time role

    Primary Level Translators

    Work from home Full-time role

    NYS Regents Earth & Space Sciences Tutor (REQUIRED: Current NYS ESS High School Teacher)

    Work from home Full-time role

    Part-time Instructor: Music

    Work from home Full-time role

    Experienced Chat Support Representative - Work from Home Opportunity at arenaflex

    Work from home Full-time role

    Experienced Full Stack Data Entry Specialist – Remote Opportunity with arenaflex

    Work from home Full-time role

    Practice Advisor

    Work from home Full-time role

    Lead Solutions Engineer (Remote)

    Work from home Full-time role

    IT Cybersecurity Advisor

    Work from home Full-time role

    Experienced Full Stack Customer Service Representative – Work-From-Home, Paid Training, and Career Growth Opportunities at arenaflex

    Work from home Full-time role