Question 1

What are the four components of MLflow?

Accepted Answer

MLflow has four main components: (1) MLflow Tracking — an API and UI for logging parameters, metrics, and artifacts from ML training runs. Runs are organised into Experiments. (2) MLflow Projects — a standard format for packaging ML code as reproducible runs, with a MLproject YAML file defining the entry points, environment, and parameters. (3) MLflow Models — a standard format for packaging trained models with their flavors (sklearn, pytorch, transformers, etc.) and a model signature (input/output schema). (4) MLflow Model Registry — a centralised store for managing the lifecycle of models: registering model versions, tracking stage transitions (None → Staging → Production → Archived), and annotating versions with aliases and tags.

Question 2

How do you log a PyTorch model in MLflow?

Accepted Answer

Use mlflow.pytorch.log_model(model, artifact_path='model', registered_model_name='my_model'). MLflow saves the model's state_dict and the model class definition. For loading, mlflow.pytorch.load_model('runs:/<run_id>/model') returns the model in eval mode on CPU. Alternatively, use mlflow.pytorch.autolog() at the start of your script to automatically log training metrics, parameters, and the final model without explicit log calls — supports PyTorch Lightning natively.

Question 3

What is a model signature in MLflow and why is it important?

Accepted Answer

A model signature in MLflow defines the expected input and output schema of a model: column names, data types, and optionally tensor shapes. It is specified as an mlflow.models.ModelSignature object and stored alongside the model artefact. Signatures serve three purposes: documentation (what inputs does this model expect?), validation (MLflow can enforce input types at serving time), and tooling integration (MLflow's model serving code uses the signature to construct API request/response schemas automatically). Define signatures using mlflow.models.infer_signature(X_train, predictions) to automatically infer types from example data.

Question 4

How does MLflow compare to Weights & Biases?

Accepted Answer

MLflow is open-source with a strong model registry and deployment story. It can be self-hosted (on-premises), integrates with the broader MLOps stack (Apache Spark, Databricks, AWS SageMaker), and is widely used in enterprise environments where data residency and open standards matter. Weights & Biases is a commercial SaaS product with a richer experiment visualisation UI, better collaboration features (shared dashboards, reports), and a superior hyperparameter sweep interface. W&B is favoured at AI-native companies and research teams. Both are widely used in UK AI engineering; knowing both is advantageous. Databricks customers often use MLflow natively since Databricks acquired MLflow's creator.

MLflow for Experiment Tracking
The 2026 Skills Guide

Why ML Experiments Need Tracking

MLflow Tracking: Core API

The MLflow Model Registry

MLflow in Production: Server Setup and Integrations

Frequently Asked Questions

What are the four components of MLflow?

How do you log a PyTorch model in MLflow?

What is a model signature and why does it matter?

How does MLflow compare to Weights & Biases?

How do you set up a remote MLflow tracking server?

Browse MLOps Jobs in the UK

Quick Facts

Key Concepts

Roles That Need This

Related Skills

MLflow for Experiment TrackingThe 2026 Skills Guide

Why ML Experiments Need Tracking

MLflow Tracking: Core API

The MLflow Model Registry

MLflow in Production: Server Setup and Integrations

Frequently Asked Questions

What are the four components of MLflow?

How do you log a PyTorch model in MLflow?

What is a model signature and why does it matter?

How does MLflow compare to Weights & Biases?

How do you set up a remote MLflow tracking server?

Browse MLOps Jobs in the UK

Quick Facts

Key Concepts

Roles That Need This

Related Skills

MLflow for Experiment Tracking
The 2026 Skills Guide