ML Data Engineer #978695 Job at Dexian, Seffner, FL

d3pCUHVJWFhvd1ltcW1VVTJhb2Y0OGlwZEE9PQ==
  • Dexian
  • Seffner, FL

Job Description

Job Title: Data Engineer – AI/ML Pipelines

Work Model: Hybrid (on-site 3 days a week)

Location: Seffner, FL

Position Summary

The Data Engineer – AI/ML Pipelines plays a key role in building and optimizing the data infrastructure that powers enterprise analytics and machine learning initiatives. This position focuses on developing robust, scalable, and intelligent data pipelines—from ingestion through feature engineering to model deployment and monitoring.

The ideal candidate has hands-on experience supporting end-to-end ML workflows , integrating operational data from Warehouse Management Systems (WMS) and ERP platforms , and enabling real-time predictive systems . This is a highly collaborative role, working across Data Science, ML Engineering, and Operations to ensure that models are fed with clean, reliable, and production-ready data.

Key Responsibilities

ML-Focused Data Engineering

  • Build and maintain data pipelines optimized for machine learning workflows and real-time model deployment.
  • Partner with data scientists to prepare, version, and monitor feature sets for retraining and evaluation.
  • Design and implement feature stores, data validation layers, and model input pipelines that ensure scalability and reproducibility.

Data Integration from WMS & Operational Systems

  • Ingest, normalize, and enrich data from WMS, ERP , and telemetry platforms.
  • Model operational data to support predictive analytics and AI-driven warehouse automation use cases.
  • Develop integrations that provide high-quality, structured data to data science and business teams.

Pipeline Automation & Orchestration

  • Design, orchestrate, and automate modular pipelines using tools such as Azure Data Factory , Airflow , or Databricks Workflows .
  • Ensure pipeline reliability, scalability, and monitoring for both batch and streaming use cases.
  • Implement CI/CD practices for data pipelines supporting ML deployment.

Data Governance & Quality

  • Establish robust data quality frameworks, anomaly detection, and reconciliation checks.
  • Maintain strong data lineage, versioning, and metadata management to ensure reproducibility and compliance.
  • Contribute to the organization’s broader data governance and MLOps standards.

Cross-Functional Collaboration

  • Collaborate closely with Data Scientists, ML Engineers, Software Engineers , and Operations teams to translate modeling requirements into technical solutions.
  • Serve as the technical liaison between data engineering and business users for ML-related data needs.

Documentation & Mentorship

  • Document data flows, feature transformations, and ML pipeline logic in a reproducible, team-friendly format.
  • Mentor junior data engineers and analysts on ML data architecture and best practices.

Required Qualifications

Technical Skills

  • Proven experience designing and maintaining ML-focused data pipelines and supporting model lifecycle workflows .
  • Proficient in Python , SQL , and data transformation tools such as dbt , Spark , or Delta Lake .
  • Strong understanding of cloud-based data platforms (Azure, Databricks) and data orchestration frameworks.
  • Familiarity with ML pipeline tools such as MLflow , TFX , or Kubeflow.
  • Hands-on experience working with Warehouse Management Systems (WMS) or other operational logistics data.

Experience

  • 5+ years in data engineering , with at least 2 years supporting AI/ML systems .
  • Proven track record building and maintaining production-grade pipelines in cloud environments.
  • Demonstrated collaboration with data scientists and experience turning analytical models into operational data products.

Education

  • Bachelor’s degree in Computer Science, Data Science, Engineering , or related field (Master’s preferred).
  • Relevant certifications are a plus (e.g., Azure AI Engineer , Databricks ML Associate , Google Professional Data Engineer ).

Preferred Qualifications

  • Experience with real-time data ingestion technologies (Kafka, Kinesis, Event Hubs).
  • Exposure to MLOps best practices and CI/CD for ML and data pipelines.
  • Industry experience in logistics, warehouse automation, or supply chain analytics .

Job Tags

3 days per week,

Similar Jobs

Foxtrot Aviation Services

Working Operations Manager- First Shift Job at Foxtrot Aviation Services

 ...leading a team through the grueling, hands-on work of aircraft cleaning and CIC removal. Schedule:MondayFriday, 6:30 AM 3:30 PM (First Shift) Experience:13 years management or supervisory experience preferred Requirements:Must be willing and able to perform very... 

Optimum Staffing

Hazmat Tanker Driver Job at Optimum Staffing

 ...to/from dispatch location ~ Drive a Sleeper truck hauling a Tanker Hazmat & Tanker endorsement required ~ Light Touch Freight ...  ...~ Accrued PTO & Vacation Time!~ Referral Bonus of $1000 per driver ($500 on day 31; $250 on day 181 to both referrer and referee.)... 

Wyndham Destinations

Reservations Supervisor Job at Wyndham Destinations

 ...is simple: to put the world on vacation. Our vacation ownership brands, Club Wyndham, Worldmark, Margaritaville Vacation Club, and Shell Vacations Club, include more than 245 vacation club resort locations across the globe. Innovation and growth keep our work interesting... 

OSI Engineering, Inc.

People Operations Generalist (HR) Job at OSI Engineering, Inc.

 ...~37 years of HR operations experience (start-up or fast-paced environments a plus)~ Manage onboarding: send offer letters and hire packets, run background checks (including E-Verify), and complete I-9 verifications ~ Handle all offboarding logistics: termination... 

Ultimate Staffing

Mortgage Loan Servicing Admin Job at Ultimate Staffing

 ...Job Title: Mortgage Loan Servicing Admin Compensation: Compensation starting at $22/hour, DOE Location: Onsite/Spokane, WA Job...  ...skills, both written and verbal. Proficiency in Microsoft Office applications, particularly Excel and Word; experience with loan...