Hub

API docs

Join the Hugging Face community

and get access to the augmented documentation experience

Collaborate on models, datasets and Spaces

Faster examples with accelerated inference

Switch between documentation themes

to get started

Pricing and Billing

Hugging Face Jobs let you run compute tasks on Hugging Face infrastructure without managing it yourself. Simply define a command, a Docker image, and a hardware flavor among various CPU and GPU options.

Jobs are available to PRO users and Team or Enterprise organizations.

Billing on Jobs is based on hardware usage and is computed by the minute: you get charged for every minute the Jobs runs on the requested hardware.

During a Job’s lifecycle, it is only billed when the Job is Starting or Running. This means that there is no cost during build.

If a running Job starts to fail, it will be automatically suspended and the billing will stop.

Pricing

Jobs are billed per minute based on the hardware used. Below are the available hardware options and their pricing.

CPU

Hardware	CPU	Memory	Hourly Price
CPU Basic	2 vCPU	16 GB	Free
CPU Upgrade	8 vCPU	32 GB	$0.03

GPU

Hardware	CPU	Memory	GPU Memory	Hourly Price
Nvidia T4 - small	4 vCPU	15 GB	16 GB	$0.40
Nvidia T4 - medium	8 vCPU	30 GB	16 GB	$0.60
1x Nvidia L4	8 vCPU	30 GB	24 GB	$0.80
4x Nvidia L4	48 vCPU	186 GB	96 GB	$3.80
1x Nvidia L40S	8 vCPU	62 GB	48 GB	$1.80
4x Nvidia L40S	48 vCPU	382 GB	192 GB	$8.30
8x Nvidia L40S	192 vCPU	1534 GB	384 GB	$23.50
Nvidia A10G - small	4 vCPU	15 GB	24 GB	$1.00
Nvidia A10G - large	12 vCPU	46 GB	24 GB	$1.50
2x Nvidia A10G - large	24 vCPU	92 GB	48 GB	$3.00
4x Nvidia A10G - large	48 vCPU	184 GB	96 GB	$5.00
Nvidia A100 - large	12 vCPU	142 GB	80 GB	$2.50

You can also retrieve available hardware and pricing programmatically via the API at GET /api/jobs/hardware.

Manage billing

Bill to your organization

Billing is done to the user’s namespace by default, but you can bill to your organization instead by specifying the right namespace:

hf jobs run --namespace my-org-name ...

In this case the Job runs under the organization account, and you can see it in your organization Jobs page (organization page > settings > Jobs).

View current compute usage

You can look at your current billing information for Jobs in in your Billing page, under the “Compute Usage” section:

Additional information about billing can be found in the dedicated Hub documentation.

Recommendations

Set timeout limits

Set a timeout when creating the Job to ensure it can’t run beyond a certain duration. A Job run that reaches the timeout duration is automatically stopped, and so is its billing. Here is how to set a timeout with the CLI:

hf jobs run --timeout 3h ...

Note that the default timeout is set to 30 minutes. You must therefore specify a longer timeout if your Job requires more time to run.

Cancel irrelevant Jobs

If a running Job is no longer relevant, you can cancel it prematurely to stop its billing, either via the Job page or the CLI:

hf jobs cancel <job-id>

Update on GitHub

←Quickstart Manage Jobs→