VESSL AI — Control plane for machine learning and computing

In the Get-started section, we’ll walk you through the best practice for using VESSL Run.

VESSL AI provides a unified interface for training and deploying AI models on the cloud. Simply define your GPU resource and pinpoint to your code & dataset. VESSL AI does the orchestration & heavy lifting for you:

  1. Create a GPU-accelerated container with the right Docker Image.
  2. Mount your code and dataset from GitHub, Hugging Face, Amazon S3, and more.
  3. Launches the workload on our fully managed GPU cloud.

One any cloud, at any scale

Instantly scale workloads across multiple clouds.


Streamlined interface

Launch any AI workloads with a unified YAML definition.


End-to-end coverage

A single platform for fine-tuning to deployment.


A centralized compute platform

Optimize GPU usage and save up to 80% in cloud.


What can you do?

  • Run compute-intensive AI workloads remotely within seconds.
  • Fine-tune LLMs with distributed training and auto-failover with zero-to-minimum setup.
  • Scale training and inference workloads horizontally.
  • Deploy an interactive web applicaiton for your model.
  • Serve your AI models as web endpoints.

How to get started

Head over to VESSL AI and sign up for a free account. No docker build or kubectl get.

  1. Create your account at VESSL AI and get $15 in free GPU credits.
  2. Install our Python package — pip install vessl.
  3. Follow our Quickstart guide or try out our example models at VESSL Hub.

How does it work?

VESSL AI abstracts the obscure infrastructure and complex backends inherent to launching AI workloads into a simple YAML file, so you don’t have to mess with AWS, Kubernetes, Docker, or more. Here’s an example that launches a web app for Stable Diffusion.

resources:
  cluster: vessl-gcp-oregon
  preset: gpu-l4-small
image: quay.io/vessl-ai/torch:2.3.1-cuda12.1-r5
import:
  /code/:
    git:
      url: https://github.com/vessl-ai/hub-model
      ref: main
  /model/: hf://huggingface.co/VESSL/SSD-1B
run:
  - command: |-
      pip install -r requirements.txt
      streamlit run ssd_1b_streamlit.py --server.port=80
    workdir: /code/SSD-1B
interactive:
  max_runtime: 24h
  jupyter:
    idle_timeout: 120m
ports:
  - name: streamlit
    type: http
    port: 80

With every YAML file, you are creating a VESSL Run. VESSL Run is an atomic unit of VESSL AI, a single unit of Kubernetes-backed AI workload. You can use our YAML definition as you progress throughout the AI lifecycle from checkpointing models for fine-tuning to exposing ports for inference.

What’s next?

See VESSL AI in action with our examples Runs and pre-configured open-source models.