Run an Evaluation¶

In Dyff, we call the process of running input data through an AI/ML system to produce outputs an Evaluation. Now that we’ve create a dataset, the next step is to run (or simulate) an Evaluation on that dataset.

Setup¶

Create an API client as described in the Python client guide:

API Client

import os
from dyff.client import Client

dyffapi = Client(api_key=os.environ["DYFF_API_TOKEN"])
ACCOUNT = "<your account ID>"

DyffLocalPlatform

from dyff.audit.local import DyffLocalPlatform

dyffapi = DyffLocalPlatform(storage_root="/some/dir")
ACCOUNT = "<arbitrary string>"

You’ll also need to know the IDs of a Dataset to use as input and an InferenceService to run on the dataset.

Run an evaluation on the platform¶

The process of running an evaluation is the same whether you’re using a Client connnected to a Dyff deployment, or a DyffLocalPlatform instance running locally.

Note

Some of the evaluation parameters, such as replicas and useSpotPods, have no effect when using a DyffLocalPlatform instance.

from datetime import datetime, timedelta
from dyff.schema.requests import (
    EvaluationCreateRequest,
    EvaluationInferenceSessionRequest,
)

dyffapi = ...
dataset_id: str = ...
service_id: str = ...
evaluation_request = EvaluationCreateRequest(
    account=account,
    dataset=dataset_id,
    inferenceSession=EvaluationInferenceSessionRequest(
        inferenceService=service_id,
        expires=datetime.now() + timedelta(days=1),
        replicas=1,
        useSpotPods=False,
    ),
    replications=2,
    workersPerReplica=2,
)
evaluation = dyffapi.evaluations.create(evaluation_request)
print(evaluation.json(indent=2))

When running on a Dyff deployment, the create() call will return immediately. You can monitor progress using dyffapi.evaluations.get(evaluation.id).status. The .status will be Complete when the evaluation is finished.

Run local data through a remote session¶

You can also run an evaluation on local data using a remote inference session. This capability requires that you provide a Client instance that can communicate with an appropriate remotely-hosted Dyff instance:

from pathlib import Path

from dyff.audit.local import DyffLocalPlatform
from dyff.client import Client

account = "local"
root = Path("/home/me/dyff/my-analysis")

dyffremote = Client(...)
dyfflocal = DyffLocalPlatform(
    storage_root=root / ".dyff-local", remote_client=dyffremote
)

dataset = dyfflocal.datasets.create_arrow_dataset(
    str(root / "arrow_dataset"), account=account, name="test"
)
dyfflocal.datasets.upload_arrow_dataset(dataset, str(root / "arrow_dataset"))

inferencesession_id = ...
evaluation_id = dyfflocal.evaluations.local_evaluation(
    dataset=dataset.id, inferencesession=inferencesession_id
)

Then, you can run a local dataset managed by the DyffLocalPlatform through the remote inference session:

from pathlib import Path

from dyff.audit.local import DyffLocalPlatform
from dyff.client import Client

account = "local"
root = Path("/home/me/dyff/my-analysis")

dyffremote = Client(...)
dyfflocal = DyffLocalPlatform(
    storage_root=root / ".dyff-local", remote_client=dyffremote
)

dataset = dyfflocal.datasets.create_arrow_dataset(
    str(root / "arrow_dataset"), account=account, name="test"
)
dyfflocal.datasets.upload_arrow_dataset(dataset, str(root / "arrow_dataset"))

inferencesession_id = ...
evaluation_id = dyfflocal.evaluations.local_evaluation(
    dataset=dataset.id, inferencesession=inferencesession_id
)