Lead Data Engineer – Big Data Architecture & Analytics Solutions Lead at arenaflex
About arenaflex – Pioneering Data‑Driven Entertainment Experiences
arenaflex is a global leader in creating immersive digital experiences that captivate audiences worldwide. With a legacy of innovation in streaming, interactive media, and next‑generation entertainment platforms, arenaflex continuously pushes the boundaries of what’s possible through data‑powered storytelling. Our mission is to transform raw information into actionable insights that fuel creative decisions, enhance user engagement, and drive sustainable growth across all of our brands.
Why This Role Matters
As the Lead Data Engineer for arenaflex’s Information and Investigation group, you will be at the heart of a high‑impact team that builds the data foundations for our web‑based features, advertising platforms, and emerging business ventures. This is not a simple data entry position; it is a strategic, technically demanding role that empowers the organization to harness massive data volumes, deliver real‑time analytics, and support data‑driven product innovation.
Role Overview
In this full‑time, 8‑hour‑per‑day position based in San Francisco, USA, you will lead the design, development, and operational excellence of large‑scale data pipelines and warehouses. You will collaborate with product owners, data scientists, and architecture teams to translate business requirements into robust, scalable data solutions that meet stringent Service Level Agreements (SLAs) and uphold the highest standards of data quality.
Key Responsibilities
- Architectural Leadership: Define and evolve arenaflex’s data platform strategy, focusing on high‑performance data warehousing, ETL pipelines, and real‑time streaming architectures.
- Pipeline Development: Design, build, and maintain end‑to‑end data ingestion pipelines in both cloud (AWS) and on‑premise environments, leveraging technologies such as Hadoop, Snowflake, and Apache Flink.
- Technology Stack Management: Champion the use of modern big‑data tools—including Hadoop, HDFS, Hive, Spark, PySpark, and Scala—while ensuring seamless integration with Python‑based analytics workflows.
- Collaboration & Delivery: Partner with Data Product Owners, Data Modelers, and Data Architects to implement data models, data marts, and analytical datasets that support downstream analytics and machine‑learning initiatives.
- Documentation & Governance: Maintain comprehensive technical documentation, data lineage, and metadata catalogs to support data governance, compliance, and audit requirements.
- Performance Optimization: Monitor, troubleshoot, and fine‑tune data pipelines to achieve optimal throughput, latency, and cost efficiency, ensuring SLA adherence for internal and external stakeholders.
- Agile Participation: Actively contribute to Scrum ceremonies, sprint planning, and retrospectives, fostering a culture of continuous improvement and rapid delivery.
- Mentorship: Guide junior engineers and data analysts, sharing best practices in data engineering, testing, and deployment automation.
Essential Qualifications
- Bachelor’s degree in Computer Science, Engineering, Information Systems, or a related discipline (or equivalent practical experience).
- 5+ years of hands‑on experience designing and operating large‑scale data platforms in a production environment.
- Deep expertise in SQL and the ability to craft complex queries for data extraction, transformation, and reporting.
- Proficiency with big‑data ecosystems: Hadoop, HDFS, Hive, Spark, PySpark, and familiarity with streaming frameworks such as Apache Flink or Kafka.
- Strong programming skills in Python and either Java or Scala, with a track record of building reliable, maintainable code.
- Experience with at least one major MPP or cloud data warehouse technology (e.g., Snowflake, Amazon Redshift, Google BigQuery).
- Hands‑on experience building and maintaining data pipelines using orchestration tools (e.g., Airflow, Luigi) and data integration platforms.
- Solid understanding of data modeling principles, dimensional modeling, and data lake architecture.
- Familiarity with Agile and Scrum methodologies, and the ability to thrive in fast‑paced, cross‑functional teams.
- Excellent problem‑solving abilities, strong communication skills, and a collaborative mindset.
Preferred Qualifications & Additional Skills
- Master’s degree or advanced certifications in data engineering, cloud architecture, or related fields.
- Hands‑on experience with AWS services such as S3, EMR, EC2, Lambda, and Glue.
- Knowledge of containerization (Docker, Kubernetes) and CI/CD pipelines for data workloads.
- Exposure to data quality frameworks and data governance tools (e.g., Great Expectations, Collibra).
- Experience supporting data‑driven advertising or marketing analytics platforms.
- Demonstrated ability to lead technical initiatives from concept through production rollout.
Core Skills & Competencies
- Analytical Thinking: Ability to dissect complex business problems and translate them into scalable data solutions.
- Technical Leadership: Proven track record of influencing architecture decisions and driving technical excellence.
- Collaboration: Strong partnership skills with product, analytics, and engineering teams.
- Communication: Clear articulation of technical concepts to both technical and non‑technical audiences.
- Adaptability: Comfort working in a dynamic environment where priorities shift rapidly.
- Quality Focus: Commitment to data integrity, reliability, and compliance.
Compensation, Perks & Benefits
arenaflex offers a competitive hourly rate of $24 per hour, complemented by a comprehensive benefits package that includes:
- Health, dental, and vision insurance with generous employer contributions.
- 401(k) retirement plan with matching contributions.
- Paid time off, holidays, and flexible work arrangements.
- Professional development budget for certifications, conferences, and training.
- Wellness programs, on‑site fitness facilities, and mental‑health resources.
- Employee stock purchase plan and performance‑based bonuses.
- Opportunities to work on cutting‑edge technologies within a globally recognized entertainment brand.
Career Growth & Learning Opportunities
At arenaflex, your career trajectory is shaped by your ambition. As a Lead Data Engineer, you will:
- Gain exposure to multi‑petabyte data environments and industry‑leading analytics platforms.
- Lead cross‑functional initiatives that directly influence product strategy and revenue growth.
- Mentor emerging talent, building a pipeline of future data leaders.
- Participate in internal hackathons, innovation labs, and research collaborations.
- Transition into senior architecture, data‑science leadership, or product management roles as your interests evolve.
Work Environment & Culture at arenaflex
arenaflex fosters an inclusive, collaborative, and forward‑thinking culture where creativity meets technology. Our teams are empowered to experiment, fail fast, and iterate quickly. We celebrate diversity, encourage continuous learning, and prioritize work‑life balance. Whether you’re in a bustling San Francisco office or collaborating remotely, you’ll be part of a community that values curiosity, integrity, and impact.
How to Apply
If you are ready to lead the next generation of data infrastructure at a world‑class entertainment company, we want to hear from you. Click the link below to submit your application and start your journey with arenaflex.
Apply Job!
``` Apply for this job