How will my ML model perform in real life?

spring

Each model needs to be measured in the target hardware environment:

  • Inference time (ms)
  • Energy per inference (mJ), particularly for battery-powered devices
  • Power vs. time “P(t)”, peak power, and average power during Inference
  • Inference accuracy

System design decisions that can impact these metrics include:

  • Chip clock speed and core voltage
  • Memory organization, such as: Model and weights in RAM or NVM? Internal or External? Tensor arena stored in Internal or External RAM?
  • Vendor supplied Runtimes+SDK

The Solution: ModelCat Hardware Farm

A complete environment for accurate model benchmarking and analysis

  • Standardized interface across all vendor hardware
  • Connected to a ModelCat Measurement System, which measures inference timing and energy in real time
  • REAL results, from REAL HARDWARE
Take a Test Drive