HF-Cloud

A CLI tool for managing HuggingFace model deployments across multiple cloud providers.

Installation modes

HF-Cloud can be used in two equivalent ways:

Standalone CLI install: commands are prefixed with hf-cloud ...
Hugging Face Hub CLI extension install: commands are prefixed with hf cloud ...

Preferred method: install and run it as an HF CLI extension (hf cloud ...). More details about HF CLI extensions here.

In the examples below, replace <cli> with either hf-cloud or hf cloud.

Hugging Face CLI extension

This project is packaged as a Python hf CLI extension. It can be discovered and installed using the following commands:

# Search for available extensions
hf extensions search

# Install this extension from GitHub
hf extensions install ehcalabres/hf-cloud

# Run extension commands
hf cloud --help
hf cloud sagemaker ls

Installation

From Source (Development)

# Clone the repository
git clone https://github.com/ehcalabres/hf-cloud.git
cd hf-cloud

# Install base CLI
pip install -e .

# Install with SageMaker support
pip install -e ".[sagemaker]"

# Install with Vertex AI support
pip install -e ".[vertex]"

# Install with all providers
pip install -e ".[all]"

# Install with development dependencies
pip install -e ".[dev]"

Agent Skills

HF-Cloud can scaffold a reusable SKILL.md for coding agents.

# Preview generated skill content
<cli> skills preview

# Install skill in project-level central directory (.agents/skills/hf-cloud)
<cli> skills add

# Install to a custom destination directory
<cli> skills add --dest ./my-skills

# Link into specific assistant directories
<cli> skills add --codex
<cli> skills add --claude --cursor

# Overwrite existing skill files/symlinks
<cli> skills add --force

Quick Start

1. Configure your provider (Optional)

You can configure some default settings for your cloud provider. This step is optional, as you can also provide these settings via command-line arguments during deployment or use the default configuration from your environment.

<cli> providers configure sagemaker

2. Deploy a model

<cli> sagemaker deploy gpt2 \
  --name my-gpt2-endpoint \
  --instance-type ml.g5.xlarge \
  --region us-east-1

3. Check deployment status

<cli> sagemaker status my-gpt2-endpoint

4. Test inference

<cli> sagemaker invoke my-gpt2-endpoint \
  --input "Once upon a time"

5. List all deployments

<cli> ls

6. Estimate required instance for a model

<cli> sagemaker estimate meta-llama/Llama-2-7b-hf
<cli> vertex estimate meta-llama/Llama-2-7b-hf

7. Delete deployment

<cli> sagemaker delete my-gpt2-endpoint

Commands

Command	Description
`<cli> [PROVIDER] deploy <model>`	Deploy a model to the specified provider
`<cli> [PROVIDER] ls`	List deployments for the specified provider
`<cli> [PROVIDER] describe <id>`	Show deployment details
`<cli> [PROVIDER] status <id>`	Check deployment status
`<cli> [PROVIDER] logs <id>`	View deployment logs
`<cli> [PROVIDER] invoke <id>`	Test inference
`<cli> [PROVIDER] estimate <model>`	Estimate minimum viable instance for a model
`<cli> [PROVIDER] delete <id>`	Delete deployment
`<cli> ls`	List all deployments (all providers)
`<cli> providers ls`	List available providers
`<cli> providers configure <provider>`	Configure provider credentials
`<cli> skills preview`	Preview generated SKILL.md for AI assistants
`<cli> skills add`	Install SKILL.md and optionally symlink to assistant folders

Supported Providers

AWS SageMaker - Fully implemented
Google Cloud Vertex AI - Fully implemented
Azure ML - Work in progress

Provider-Specific Examples

AWS SageMaker

# Configure (optional)
<cli> providers configure sagemaker

# Deploy
<cli> sagemaker deploy gpt2 \
  --name my-gpt2-endpoint \
  --instance-type ml.g5.xlarge \
  --region us-east-1

# Check status
<cli> sagemaker status my-gpt2-endpoint

# Invoke
<cli> sagemaker invoke my-gpt2-endpoint --input "Hello world"

# Estimate minimum viable instance
<cli> sagemaker estimate meta-llama/Llama-2-7b-hf
# Show all compatible instances
<cli> sagemaker estimate meta-llama/Llama-2-7b-hf --all

# Delete
<cli> sagemaker delete my-gpt2-endpoint

Google Cloud Vertex AI

# Configure (optional)
<cli> providers configure vertex

# Deploy
<cli> vertex deploy gpt2 \
  --name my-gpt2-endpoint \
  --machine-type n1-standard-4 \
  --location us-central1

# Deploy with GPU
<cli> vertex deploy meta-llama/Llama-2-7b-hf \
  --name my-llama-endpoint \
  --machine-type n1-standard-8 \
  --accelerator-type NVIDIA_TESLA_T4 \
  --accelerator-count 1

# Check status
<cli> vertex status my-gpt2-endpoint

# Invoke
<cli> vertex invoke my-gpt2-endpoint --input "Hello world"

# Estimate minimum viable machine/GPU configuration
<cli> vertex estimate meta-llama/Llama-2-7b-hf
# JSON output for scripting
<cli> vertex estimate meta-llama/Llama-2-7b-hf --json

# Delete
<cli> vertex delete my-gpt2-endpoint

Requirements

Python 3.9+

You will also need to have the respective cloud provider authentication set up, so that hf-cloud can access your account. Current provider-specific requirements:

AWS SageMaker: AWS credentials configured via AWS CLI or environment variables.
Azure ML: Azure CLI logged in or service principal set up.
Google Cloud Vertex AI: Google Cloud SDK authenticated or service account key set up.

Development

# Install dev dependencies
pip install -e ".[dev]"

# Run tests
pytest

# Run linting
ruff check src/

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
src/hf_cloud		src/hf_cloud
tests		tests
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

HF-Cloud

Installation modes

Hugging Face CLI extension

Installation

From Source (Development)

Agent Skills

Quick Start

1. Configure your provider (Optional)

2. Deploy a model

3. Check deployment status

4. Test inference

5. List all deployments

6. Estimate required instance for a model

7. Delete deployment

Commands

Supported Providers

Provider-Specific Examples

AWS SageMaker

Google Cloud Vertex AI

Requirements

Development

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

HF-Cloud

Installation modes

Hugging Face CLI extension

Installation

From Source (Development)

Agent Skills

Quick Start

1. Configure your provider (Optional)

2. Deploy a model

3. Check deployment status

4. Test inference

5. List all deployments

6. Estimate required instance for a model

7. Delete deployment

Commands

Supported Providers

Provider-Specific Examples

AWS SageMaker

Google Cloud Vertex AI

Requirements

Development

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages