Grigori Fursin

Grigori Fursin

Sep 23, 2021

Grigori Fursin

Grigori Fursin

Sep 23, 2021

OctoML joins the community effort to democratize MLPerf inference benchmarking

OctoML enters the MLPerf inference benchmark with the first two submissions accelerated via Apache TVM and automated with MLCommons' Collective Knowledge framework.

Jason Knight

Jason Knight

Mar 4, 2021

Jason Knight

Jason Knight

Mar 4, 2021

Up to 9x performance improvements with TVM’s new auto-scheduler

Autoscheduling enables higher performance end to end model optimization from TVM, while also enabling users to write custom operators even easier than before.

Jason Knight

Jason Knight

Feb 25, 2021

Jason Knight

Jason Knight

Feb 25, 2021

Compiling classical ML for performance gains (up to 30x) and hardware portability

Today, machine learning engineers and data scientists use popular frameworks such as Scikit-learn, XGBoost, and LightGBM to train and deploy classical ML models such as linear and logistic regression, decision trees and gradient boosting.

Jason Knight

Jason Knight

Jan 15, 2021

Jason Knight

Jason Knight

Jan 15, 2021

In the cloud — Sparsity on GPUs provides 5X speedup

As AI models get larger, the importance of each weight for a typical inferencing decreases...

Sayce Falk

Sayce Falk

Dec 16, 2020

Sayce Falk

Sayce Falk

Dec 16, 2020

On the Apple M1, Beating Apple’s Core ML 4 With 50% Model Performance Improvements

Apple’s release of an Arm-based chip, called the M1, was a seismic shift in the personal computing landscape.

Accelerate Your AI Innovation