How will my ML model perform in real life?

Each model needs to be measured in the target hardware environment:

Inference time (ms)
Energy per inference (mJ), particularly for battery-powered devices
Power vs. time “P(t)”, peak power, and average power during Inference
Inference accuracy

System design decisions that can impact these metrics include:

Chip clock speed and core voltage
Memory organization, such as: Model and weights in RAM or NVM? Internal or External? Tensor arena stored in Internal or External RAM?
Vendor supplied Runtimes+SDK

The Solution: ModelCat Hardware Farm

A complete environment for accurate model benchmarking and analysis

Standardized interface across all vendor hardware
Connected to a ModelCat Measurement System, which measures inference timing and energy in real time
REAL results, from REAL HARDWARE

Take a Test Drive