Deploy Portkey Gateway to Koyeb to Streamline Requests to 200+ LLMs
Learn how to deploy Portkey Gateway, a request and prompt router for LLMs with a unified API, and build an application that can query more than one LLM easily.
Discover how to build, deploy and run applications in production on Koyeb. The fastest way to deploy applications globally.
Learn how to deploy Portkey Gateway, a request and prompt router for LLMs with a unified API, and build an application that can query more than one LLM easily.
Learn how to use OpenAI Whisper to build an app to generate transcription of podcast audio files in real-time.
This guide shows how to use Continue with Ollama, a self-hosted AI solution to run the Mistral Codestral model on Koyeb GPUs
Learn how to set up a vLLM Instance to run inference workloads and host your own OpenAI-compatible API on Koyeb.
Learn how to use Groq, speech-to-text (STT), and text-to-speech (TTS) to build an app to automatically translate between languages in real-time.
This tutorial walks through how to build a multimodal vision chat app powered by LLaVa, Chainlit, and Replicate.
This tutorial walks through how to build a chatbot powered by MistralAI, with FastAPI as the backend and FastUI as the front end.
Learn step-by-step how to set up and utilize AutoGen within Chainlit. You'll discover how to create and interact with AI personas that are tailored to your specific needs, be it scriptwriting for YouTube content or ideating SaaS products.
In this tutorial, we showcase how to deploy a FAQ search service built with Hugging Face's Inference API, pgvector, Koyeb's Managed Postgres. The optimized FAQ Search leverages sentence similarity searching to provide the most relevant results to a user's search terms.
This guide explains how to build a YouTube video summarization using Langchain, Deepgram, and Mistral 7B. Deploy your AI workload on Koyeb to enjoy high-performance microVMs, seamless scaling, and fast global deployments.