Inferless
True Serverless
Free Tier: 30 USD
Deployment: Inferless CLI
Queues: Supported
Webhooks: Not supported
Serverless GPU Prices
Processor | GPU RAM | RAM | vCPUs | Price |
---|---|---|---|---|
T40 | 2 | 2 | 0.4 | 0.000023/s 0.0828/h |
T40 | 2 | 2 | 0.4 | 0.000026/s 0.0936/h |
T4 (shared) | 1 | 2 | 3 | 0.000092/s 0.3312/h |
T4 (shared) | 2 | 3 | 0.5 | 0.000092/s 0.3312/h |
T4 (shared) | 3 | 10 | 1.5 | 0.000092/s 0.3312/h |
T4 (shared) | 3 | 10 | 1.5 | 0.000092/s 0.3312/h |
A100 | 7 | 30 | 7 | 0.0001/s 0.36/h |
A1000 | 9 | 22 | 2 | 0.000168/s 0.6048/h |
A10 (shared) | 3 | 12 | 3 | 0.00017/s 0.612/h |
A10 (shared) | 12 | 15 | 3 | 0.00017/s 0.612/h |
A10 (shared) | 12 | 15 | 3 | 0.00017/s 0.612/h |
A10 (shared) | 12 | 15 | 3 | 0.00017/s 0.612/h |
T40 | 6 | 20 | 3 | 0.000185/s 0.666/h |
T40 | 6 | 20 | 3 | 0.000185/s 0.666/h |
16 | 20 | 3 | 0.000185/s 0.666/h | |
T40 | 16 | 20 | 3 | 0.000185/s 0.666/h |
A1000 | 10 | 25 | 2 | 0.000188/s 0.6768/h |
T4 (shared) | 1 | 2 | 3 | 0.00019/s 0.684/h |
A1000 | 11 | 28 | 2 | 0.000213/s 0.7668/h |
A1000 | 12 | 31 | 3 | 0.000233/s 0.8388/h |
A100 | 7 | 30 | 7 | 0.000341/s 1.2276/h |
24 | 30 | 7 | 0.000341/s 1.2276/h | |
A100 | 24 | 30 | 7 | 0.000341/s 1.2276/h |
A100 | 24 | 30 | 7 | 0.000341/s 1.2276/h |
A10 (shared) | 2 | 30 | 7 | 0.00036/s 1.296/h |
A100 | 2 | 3 | 0 | 0.00038/s 1.368/h |
A100 | 3 | 3 | 0 | 0.00044/s 1.584/h |
A100 | 3 | 4 | 0 | 0.00048/s 1.728/h |
T4 (shared) | 4 | 5 | 3 | 0.00048/s 1.728/h |
A100 | 3 | 4 | 0 | 0.00053/s 1.908/h |
T4 (shared) | 4 | 5 | 3 | 0.00054/s 1.944/h |
A100 (shared) | 40 | 100 | 10 | 0.000745/s 2.682/h |
A100 (shared) | 40 | 100 | 10 | 0.000745/s 2.682/h |
A100 (shared) | 40 | 100 | 10 | 0.000745/s 2.682/h |
A100 (shared) | 40 | 100 | 10 | 0.000745/s 2.682/h |
A10 (shared) | 6 | 30 | 7 | 0.00088/s 3.168/h |
80 | 200 | 20 | 0.001491/s 5.3676/h | |
A1000 | 80 | 200 | 20 | 0.001491/s 5.3676/h |
A1000 | 80 | 200 | 20 | 0.001491/s 5.3676/h |
A1000 | 80 | 200 | 20 | 0.001491/s 5.3676/h |
A100 (shared) | 8 | 21 | 2 | 0.00159/s 5.724/h |
A100 (shared) | 20 | 52 | 5 | 0.00387/s 13.932/h |
A1000 | 23 | 58 | 5 | 0.00437/s 15.732/h |
Other GPU Providers
Replicate
True Serverless
Free Tier: Unspecified
Modal
True Serverless
Free Tier: 30 USD per month
Runpod
True Serverless
Free Tier: Unknown
Beam.cloud
True Serverless
Free Tier: 10 hours
Baseten.co
Free Tier: 30 USD
Covalent.xyz
Free Tier: Unknown
Amazon Web Services
Free Tier: Unknown
Google Cloud
Free Tier: Unknown
Microsoft Azure
Free Tier: Unknown
Jarvis Labs
Free Tier: Unknown
Paperspace
Free Tier: Unknown
CoreWeave
Free Tier: Unknown
Lambda Labs
Free Tier: Unknown
Fluid Stack
Free Tier: Unknown
Novo Nimbus
Free Tier: Unknown
Latitude.sh
Free Tier: Unknown
Cr8dl.ai
Free Tier: Unknown
Datacrunch
Free Tier: Unknown
Exoscale
Free Tier: Unknown
OVH Cloud
Free Tier: 200 USD
Oblivus Cloud
Free Tier: Unknown
Oracle Cloud
Free Tier: Unknown
Cudo Compute
Free Tier: Unknown
Vultr
Free Tier: 100 USD
Fal.ai
True Serverless
Free Tier: Unknown