[Remote] Artificial Intelligence Engineer
Note: The job is a remote job and is open to candidates in USA. Creospan Inc. is a growing tech collective that offers innovative solutions to propel businesses into a better tomorrow. They are seeking a Software Engineer III who will design data engines and evaluation frameworks to support AI research and integrate breakthrough technologies into widely-used products.
Responsibilities
- Design and develop multimodal datasets for training and evaluating AI models across image, video, audio, and text modalities
- Build and execute model evaluation frameworks, benchmarks, and performance metrics
- Fine-tune machine learning models using supervised and preference-based training techniques
- Utilize multimodal LLMs to generate, augment, balance, and improve training datasets
- Develop internal annotation tools and user interfaces using React and TypeScript
- Build and maintain scalable data ingestion and processing pipelines
- Implement reliable large-scale data movement, batching, deduplication, retry logic, and parallel processing
- Collaborate closely with AI researchers, scientists, and engineers to advance state-of-the-art AI systems
Skills
- 5+ years of professional software engineering experience
- Strong understanding of machine learning fundamentals, including:
- Model fine-tuning (SFT, preference tuning)
- Prompt engineering
- Model evaluation methodologies
- Understanding of model failure modes
- Experience building multimodal datasets involving image, video, audio, and text data
- Strong Python programming skills
- Hands-on experience with:
- PyTorch
- Hugging Face ecosystem
- Model training and inference workflows
- Experience building production-grade web applications using:
- React
- TypeScript
- Strong SQL skills and experience developing large-scale data pipelines
- Experience designing reliable data processing systems including batching, deduplication, retries, and parallel execution
- Master's or PhD in Computer Science, Artificial Intelligence, Machine Learning, or related field
- Experience working on multimodal large language models (MLLMs)
- Research publications or significant contributions to AI research projects
- Open-source contributions to AI or machine learning projects
- Experience working in mid-sized technology companies or large-scale technology organizations
Company Overview