Jason Knight

Jason Knight

CPO, Co-Founder

9 Articles

Jason Knight

2

3 authors

Dec 16, 2021

Jason Knight

2

3 authors

Dec 16, 2021

Free pre-accelerated Model Zoo streamlines choice of model and cloud/edge targets

This week at TVMCon, OctoML is launching a model zoo with pre-accelerated, ready-to-download vision and language models. Running extremely fast, sub-millisecond models in production is now easier than ever, whether in the cloud or at the edge.

Jason Knight

Jason Knight

Aug 24, 2021

Jason Knight

Jason Knight

Aug 24, 2021

With Apache TVM, Microsoft Research develops and serves the latest computer vision algorithms on live streams

OctoML engineering collaborated with Microsoft Research on the “Watch For” project, an AI system for analyzing live video streams and identifying specified events within the streams.

Jason Knight

Jason Knight

Mar 4, 2021

Jason Knight

Jason Knight

Mar 4, 2021

Up to 9x performance improvements with TVM’s new auto-scheduler

Autoscheduling enables higher performance end to end model optimization from TVM, while also enabling users to write custom operators even easier than before.

Jason Knight

Jason Knight

Feb 25, 2021

Jason Knight

Jason Knight

Feb 25, 2021

Compiling classical ML for performance gains (up to 30x) and hardware portability

Today, machine learning engineers and data scientists use popular frameworks such as Scikit-learn, XGBoost, and LightGBM to train and deploy classical ML models such as linear and logistic regression, decision trees and gradient boosting.

Jason Knight

Jason Knight

Jan 15, 2021

Jason Knight

Jason Knight

Jan 15, 2021

In the cloud — Sparsity on GPUs provides 5X speedup

As AI models get larger, the importance of each weight for a typical inferencing decreases...

1

Accelerate Performance and Deployment Time