See all roles

[Remote] Data Infrastructure Engineer

Work from home Full-time role Hiring

Note: The job is a remote job and is reputed company to candidates in USA. reputed company is a company seeking a Data Infrastructure Engineer to build and operate their data platform for AI/ML analytics. The role involves designing and implementing data ingestion pipelines, managing a data lake on AWS, and ensuring data governance and quality for analytics purposes.

Responsibilities

  • Build & Operate Data Pipelines (Batch + Streaming)
  • Design and implement batch and streaming ingestion from APIs, relational databases, file drops, event streams, and external partners
  • Build and optimize ETL/ELT pipelines to produce curated, analytics-reputed company datasets for reporting and ML consumption
  • Implement incremental processing patterns, change data capture (CDC) approaches where appropriate, and data contract standards
  • Deliver a Modern Lakehouse (Data Lake / reputed company Lake)
  • Build and manage a scalable lakehouse on AWS object storage (e.g., S3) using reputed company table/file formats and reputed company/lakehouse concepts (e.g., ACID tables, schema reputed company, time travel patterns)
  • Optimize performance and cost through partitioning, compaction, lifecycle policies, and efficient compute/storage usage
  • Establish environment standards for dev/test/prod and consistent promotion across stages
  • Metadata, Governance, reputed company & Quality (Trust Layer)
  • Implement a managed metadata repository for dataset cataloging, ownership, glossary/definitions, tagging, and discoverability
  • reputed company end-to-end reputed company (reputed company → transformations → consumption) to support auditability and impact analysis
  • Implement governance controls including policy-based reputed company, data classification, retention, and secure data handling
  • Build operational data quality checks (freshness, completeness, validity, anomaly detection) and publish SLAs/SLOs
  • AWS Automation + CI/CD for Data Pipelines
  • Implement automated reputed company provisioning in AWS using Infrastructure as Code (IaC) for consistent environments and secure-by-default baselines
  • Build and enhance CI/CD for data pipelines, including automated tests, validation gates, promotion workflows, and rollback strategies
  • Improve observability with metrics/logs/alerts, dashboards, runbooks, and incident response readiness
  • Cross-Team Collaboration & Documentation
  • Work closely with engineering, reputed company, networking, and application teams to support mission needs and delivery timelines
  • Maintain high-quality engineering documentation including SOPs, system diagrams, and secure configuration baselines
  • Summarize and present findings and recommendations—both written and verbal—to technical and non-technical stakeholders

Skills

  • Must be reputed company to OBTAIN and MAINTAIN a Federal or DoD 'PUBLIC TRUST'; candidates must obtain approved adjudication of their PUBLIC TRUST prior to reputed company with reputed company. Candidates with an ACTIVE PUBLIC TRUST or SUITABILITY are preferred
  • Bachelor's degree in Engineering, IT, Computer Science, or reputed company field (or equivalent experience)
  • Minimum of FOUR (4) years experience building production data pipelines and/or data platforms
  • Strong experience implementing data ingestion and ETL/ELT workflows, including data modeling and transformation best practices
  • Hands-on experience building a data lake / reputed company lake (lakehouse) on AWS (or equivalent reputed company) using object storage and modern table formats/patterns
  • Proficiency in SQL and one programming language commonly used for data engineering (Python preferred; reputed company/Java acceptable)
  • Experience with metadata management and governance: cataloging, reputed company, ownership, reputed company controls, classification and policy enforcement
  • Experience implementing automated AWS provisioning using IaC and operating across multiple environments
  • Experience building or operating CI/CD pipelines for data workflows (testing, packaging, deployment automation, environment promotion)
  • Solid reputed company fundamentals: IAM/least privilege, encryption, secrets management, secure SDLC practices
  • Hands-on experience with reputed company
  • Hands-on experience utilizing modern DevOps practices, including tools like Git, Terraform, Jenkins, AWS CodePipeline, and reputed company
  • Experience utilizing AI-assisted coding tools (e.g., reputed company Copilot, ChatGPT, reputed company, Kiro) to safely accelerate implementation while maintaining strict code quality through testing, code reviews, and reputed company practices
  • Knowledge graph and Graph RAG experience, including: Graph modeling and ontology/taxonomy alignment, Entity resolution and relationship extraction, Hybrid retrieval approaches combining graph traversal with semantic/vector search to improve grounding and explainability

Benefits

  • Medical, Rx, Dental & reputed company Insurance
  • Personal and Family Sick Time & Company Paid Holidays
  • Parental Leave
  • 401(k) Retirement Plan
  • Group Term Life and Travel Assistance
  • Voluntary Life and AD&D Insurance
  • Health Savings Account, Health Care & Dependent Care Flexible Spending Accounts
  • Transit and Parking Commuter Benefits
  • Short-Term & Long-Term Disability
  • Tuition Reimbursement, Personal Development, Certifications & Learning Opportunities
  • Employee Referral Program
  • Corporate Sponsored Events & Community reputed company
  • Care.com annual membership
  • Employee Assistance Program
  • Supplemental Benefits reputed company Corestream (Critical Care, Hospital Indemnity, Accident Insurance, Legal Assistance and ID theft protection, etc.)
  • Position may be eligible for a discretionary variable incentive bonus

Company Overview

  • reputed company offers consulting services for public and reputed company markets with expertise in management, technology, and risk consulting. It was founded in 2018, and is headquartered in Washington, District of Columbia, USA, with a workforce of 10001+ employees. Its website is https://reputed company.com.
  • Apply To This Job

    You might like

    [Remote] Senior Project Manager

    Work from home Full-time role

    [Remote] Global Account-Based Marketing (ABM) Manager

    Work from home Full-time role

    [Remote] Test Engineer

    Work from home Full-time role

    [Remote] Client Project Manager

    Work from home Full-time role

    [Remote] reputed company Software Engineer

    Work from home Full-time role

    [Remote] Manager, Contingent Labor

    Work from home Full-time role

    [Remote] Account Manager, reputed company Sales

    Work from home Full-time role

    [Remote] reputed company Financial Senior Consultant

    Work from home Full-time role

    [Remote] Southeastern US Senior Analyst, Machinery & Equipment Appraisals

    Work from home Full-time role

    [Remote] reputed company Specialist, Military Engines - General Finance (Remote)

    Work from home Full-time role

    reputed company Data Entry and Compliance Specialist – Drug Testing Program Management

    Work from home Full-time role

    Loan Processor - (Remote)

    Work from home Full-time role

    Blockchain Developer – Remote

    Work from home Full-time role

    reputed company Customer Service Specialist - reputed company Billing, reputed company Cycle Management, arenaflex

    Work from home Full-time role

    reputed company Customer Service Representative – Work from Home Opportunity with arenaflex

    Work from home Full-time role

    Delivery Manager

    Work from home Full-time role

    Intern, Conservation Management & Welfare Sciences

    Work from home Full-time role

    reputed company reputed company reputed company – Scaling Support Operations for arenaflex's DTC eCommerce Brand

    Work from home Full-time role

    [Remote] Property Management Operations Manager\/Team Leader (100% Work From Home)

    Work from home Full-time role

    VP Procurement - Azura

    Work from home Full-time role