See all roles

[Remote] Agentic AI Data Engineer (Scientific Data)

Work from home Full-time role Hiring

Note: The job is a remote job and is open to candidates in USA. BayOne Solutions is seeking an Agentic AI Data Engineer to build and manage data ingestion pipelines for scientific data. The role involves collaborating with various teams to establish data standards and ensuring the integrity and usability of datasets.

Responsibilities

  • Build an agentic data ingestion pipeline
  • Triage and prioritize incoming requests to ingest specific datasets
  • Clean and organize the data. Build the first pass cleaning and organization steps into the agentic flow
  • Validate cross-modal linkage. Add automated checks that catch when ingested data does not connect correctly and flag low quality or mismatched records
  • Version every dataset. Retain and make prior versions addressable
  • Preserve raw data and provenance. Make agent workflows log validation and transformation steps so lineage is traceable
  • Make agents usable across teams. Move beyond bespoke steps towards agents that teams can reliably use as a shared, deployed service
  • Collaborate with AI, software engineering, and computational biology groups to co-define data standards and conventions

Skills

  • Agentic AI engineering: Demonstrated experience building multi-agent workflows or LLM workflows using tools/frameworks such as LangGraph or LlamaIndex, including tool/function calling and asynchronous task execution
  • Python data engineering: Strong Python for data manipulation, working with APIs and databases, and handling heterogeneous data formats
  • Data versioning and provenance: Familiarity with dataset versioning approaches (e.g. DVC, lakeFS, or equivalent)
  • Working knowledge of scientific data structures: Comfortable or willingness to learn common omics data formats like AnnData, H5AD, TileDB
  • Basic understanding of omics: No deep bioinformatics expertise required; just a basic understanding of different modalities (e.g. what is RNA-seq vs scRNA-seq vs WES; genomics vs transcriptomics vs proteomics vs metabolomics)
  • Unit testing: Comfortable writing unit and functional tests to ensure data processing workflows are reliable and reproducible
  • Education: Degree in a technical field or equivalent practical experience
  • Experience deploying agent workflows as a shared service (e.g., FastAPI or MCP endpoints)
  • Exposure to cloud (AWS, GCP) and containerization (Docker)
  • Familiarity with workflow managers such as Nextflow or Snakemake

Company Overview

  • BayOne Solutions provides computer programming services. It was founded in 2012, and is headquartered in Pleasanton, California, USA, with a workforce of 501-1000 employees. Its website is https://bayone.com/.
  • Company H1B Sponsorship

  • BayOne Solutions has a track record of offering H1B sponsorships, with 23 in 2025, 25 in 2024, 20 in 2023, 30 in 2022, 20 in 2021, 37 in 2020. Please note that this does not guarantee sponsorship for this specific role.
  • Apply To This Job

    You might like

    [Remote] Mechanical Design Engineer - Combined Cycle Power Plant

    Work from home Full-time role

    [Remote] Staff Solution Consultant

    Work from home Full-time role

    [Remote] Account Specialist | Remote

    Work from home Full-time role

    [Remote] Global Account Manager - East Coast

    Work from home Full-time role

    [Remote] Director of Business Development, Navy & Marine Corps

    Work from home Full-time role

    [Remote] Senior Technical Project Manager

    Work from home Full-time role

    [Remote] Customer Service Representative - Behavioral Health

    Work from home Full-time role

    [Remote] Marketing - Merchandising Product Manager

    Work from home Full-time role

    [Remote] Client Growth Executive I

    Work from home Full-time role

    [Remote] Staff Software Engineer, Platform

    Work from home Full-time role

    Content Creator

    Work from home Full-time role

    Experienced Call Center Customer Service Representative – Delivering Exceptional Arenaflex Customer Experiences

    Work from home Full-time role

    Seasonal Reader - Admissions - College of Arts & Sciences - Temp/Part-Time

    Work from home Full-time role

    [Remote-Position] Case Manager RN - Field (Passaic County, NJ)

    Work from home Full-time role

    F-35 Principal / Sr. Principal SLO Field Support Engineer (FSE) - R10189921

    Work from home Full-time role

    Experienced Data Entry and Administrative Support Professional – Remote Work Opportunities for Part-Time and Full-Time Positions

    Work from home Full-time role

    Job Title: Data Analytics Systems Administrator

    Work from home Full-time role

    Startup Finance & Equity Writer

    Work from home Full-time role

    Data Modeler

    Work from home Full-time role

    Experienced Customer Care Resolution Coordinator – Work From Home | Remote Customer Support Specialist

    Work from home Full-time role