Which orchestration tools do you use?

Airflow for traditional scheduled DAGs, Dagster for modern asset-based pipelines, Prefect for Python-native workflows, Temporal for long-running stateful workflows. We pick based on team familiarity and pipeline complexity, not vendor preference.

What's your approach to feature stores?

For most teams: start with Postgres + materialized views (simple, cheap, sufficient). Scale to Feast or Tecton when you need real-time feature serving across multiple models. Skip feature stores entirely if you have only one model and one team. Premature abstraction kills velocity.

How do you monitor LLM and ML models in production?

We deploy LangSmith + Arize + OpenTelemetry for LLM observability (prompts, latency, token usage, hallucination rates). For traditional ML: Evidently AI or Arize for data drift, WhyLabs for ML health. Every BearPlex pipeline ships with dashboards and alerting before cutover.

Can you migrate legacy ETL to modern data pipelines?

Yes. Common migrations: Informatica/IBM DataStage → Airflow/Dagster, SSIS → dbt + Airflow, home-grown cron scripts → proper orchestration. We do parallel-run cutover (old and new systems run side-by-side until trust is established) to de-risk migrations in regulated environments.

What's your CI/CD for ML?

GitHub Actions or GitLab CI for the pipeline itself. MLflow or Weights & Biases for experiment tracking. Model registries (MLflow, SageMaker Model Registry) for versioning. Deployment via BentoML, Seldon Core, or native SageMaker/Vertex AI endpoints. Everything Git-versioned, everything reproducible.

Start a conversation

Data pipelines & MLOps

From raw to reliable.

Dashboards lie when pipelines drift. We build the ingestion, modelling, and quality gates that turn raw exports into data your team can bet a quarter on.

Talk to engineering

See the pipeline

Cycles, one pipeline, one quarter

2m 50s

Average train-and-deploy cycle

99.97%

Uptime across three pipelines

47 ms

P99 serving latency

From the run ledger of a typical automated deployment: one pipeline, one quarter.

The shape of the system

One graph, from source to decision.

A pipeline is a dependency graph, not a script. Every table knows what feeds it, every run knows what changed, and a failure anywhere stops exactly the branch it should.

Sourceslayer 1 of 4

Ingestion. Scheduled and streaming loads land raw with schema validated at the door, and the originals stay immutable so any run can be replayed.

Hover any node to see what that layer is responsible for. The pulse runs in dependency order because the pipeline does: nothing downstream rebuilds until its inputs pass.

The quality gate

Bad rows stop here, not in your board deck.

Every run validates its inputs before they move. Watch one malformed row hit the gate: the test names it, the quarantine holds it, and the clean rows carry on.

Tonight’s batch · orders

Quality gate: schema and column tests

Clean, flows on

Quarantine, held

empty

A wrong dashboard looks exactly like a right one.

That is the quiet failure mode of analytics: nothing crashes, the chart still renders, and the number is wrong. So tests run where the data enters, not where it is read. Schema at the door, nulls and uniqueness on every key, freshness on every source, and a dataset version pinned for every run so any result can be reproduced months later.

A stream of light diverted through an inspection gate into a holding basin while the main flow continues

The 9am contract

Built overnight. Trusted by nine.

Freshness is a contract, not a hope. The pipeline does its work while nobody is watching, and what your team opens at nine has already passed its tests. Here is a typical night on the loop.

MidnightNine