+ Submit a Tool

Back to the homepage

Filter

Category

Search with AI

All Serving Listings

Loading

...

Loading

...

Loading

...

Loading

...

Loading

...

Loading

...

Footer

The ultimate resource directory for free AI tools and resources.

Resources

All Listings
Blog
Newsletter

Legal

Terms
Privacy
Cookies

© 2023 - 2025 AI Something. All rights reserved.

Cover Image

Internal link to /explore/gateway

AI Engineering 🤖

Internal link to /explore/gateway

gateway

A Blazing Fast AI Gateway with integrated Guardrails. Route to 200+ LLMs, 50+ AI Guardrails with 1 fast & friendly API.

Open Source Serving Artificial Intelligence

Cover Image

Internal link to /explore/vllm

AI Engineering 🤖

Internal link to /explore/vllm

VLLM

A high-throughput and memory-efficient inference and serving engine for LLMs

Open Source Infrastructure Serving High throughput

Cover Image

Internal link to /explore/gptcache

AI Engineering 🤖

Internal link to /explore/gptcache

GPTCache

Semantic cache for LLMs. Fully integrated with LangChain and llama_index.

Open Source Serving

Cover Image

Internal link to /explore/openllm

AI Engineering 🤖

Internal link to /explore/openllm

OpenLLM

Run any open-source LLMs, such as Llama, Mistral, as OpenAI compatible API endpoint in the cloud.

Open Source Serving Artificial Intelligence

Cover Image

Internal link to /explore/openvino

AI Engineering 🤖

Internal link to /explore/openvino

openvino

OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference

Open Source Serving Artificial Intelligence

Cover Image

Internal link to /explore/text-embeddings-inference

AI Engineering 🤖

Internal link to /explore/text-embeddings-inference

text-embeddings-inference

A blazing fast inference solution for text embeddings models

Open Source Serving

Cover Image

Internal link to /explore/faster-whisper

AI Engineering 🤖

Internal link to /explore/faster-whisper

faster-whisper

Faster Whisper transcription with CTranslate2

Open Source Serving

Cover Image

Internal link to /explore/aphrodite-engine

AI Engineering 🤖

Internal link to /explore/aphrodite-engine

aphrodite-engine

Large-scale LLM inference engine

Open Source Serving

Cover Image

Internal link to /explore/inference

AI Engineering 🤖

Internal link to /explore/inference

inference

Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.

Open Source Serving Artificial Intelligence

Cover Image

Internal link to /explore/lmdeploy

AI Engineering 🤖

Internal link to /explore/lmdeploy

lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Open Source Serving

Cover Image

Internal link to /explore/text-generation-inference

AI Engineering 🤖

Internal link to /explore/text-generation-inference

text-generation-inference

Large Language Model Text Generation Inference

Open Source Serving

Listings per page

Showing1 - 11of11listings

Page 1 of 1