Jared Roesch

Jared Roesch

Jun 21, 2022

Jared Roesch

Jared Roesch

Jun 21, 2022

New AI/ML Tool for Fans of Docker Compose and Kubernetes–OctoML CLI

We’re excited to share with you our first public release of OctoML CLI (v0.4.4) which provides an ML deployment workflow which should feel very familiar to anyone who uses Docker, Docker Compose and Kubernetes.

Sameer Farooqui
André Kang-Moeller, Staff Software Engineer, OctoML

Jun 21, 2022

Sameer Farooqui
André Kang-Moeller, Staff Software Engineer, OctoML

Jun 21, 2022

Fast-track to deploying machine learning models with OctoML CLI and NVIDIA Triton Inference Server

Today, we introduce the OctoML CLI, a Command Line Interface that automates the deploying deep learning models - model containerization and acceleration. One of the key technologies that ties our containerization and acceleration together is NVIDIA Triton Inference Server.

Chris Hoge

Chris Hoge

Jan 13, 2022

Chris Hoge

Chris Hoge

Jan 13, 2022

TVMCon 2021 Wrapup

The Apache TVM Community and OctoML closed out 2021 with the fourth annual Apache TVM and Open Source ML Acceleration Conference. It was the TVM community’s largest event ever, with 700 attendees from 34 nations coming together for a virtual conference...

Jared Roesch

T

Dec 16, 2021

Jared Roesch

T

Dec 16, 2021

Write Python with blazing fast CUDA-level performance

By using TVMScript, TVM's embedded domain specific language (DSL), OctoML engineers are able to demonstrate a 20x speedup over a straightforward PyTorch implementation on CPU, and a 1.3x speedup over handwritten CUDA implementation on GPU for a real-world kernel.

Byungsoo Jeon
Sunghyun Park

Dec 15, 2021

Byungsoo Jeon
Sunghyun Park

Dec 15, 2021

Collage: Automated integration of various deep learning backends results in state of the art model performance

At TVMCon this week, we will be presenting our latest research from Carnegie Mellon University and University of Michigan for generating the fastest possible executable for a given machine learning model by using Collage.

1

...

Accelerate Your AI Innovation