Chroma MCP
by chroma-core
Open-source embedding database for LLM apps — persistent and in-memory modes.
Category
Specialized AI tooling servers for model workflows, research, and applied ML tasks.
56 servers in this category
56 servers found
by chroma-core
Open-source embedding database for LLM apps — persistent and in-memory modes.
by elevenlabs
Official ElevenLabs MCP server for AI-powered text-to-speech, voice cloning, and audio generation.
by BerriAI
Unified LLM gateway supporting 100+ providers with a single OpenAI-compatible interface.
by 21st-dev
A curated MCP toolkit by 21st.dev for rapid product and workflow automation.
by replicate
Run thousands of open-source AI models (image, video, audio, text) via Replicate's API.
by anthropic
Use Claude models programmatically as tools within your MCP workflow.
by AssemblyAI
Transcribe and analyze audio/video with AssemblyAI's speech intelligence platform.
by awslabs
Query Amazon Bedrock Knowledge Bases using natural language to retrieve relevant information.
by modelcontextprotocol
Query AWS Bedrock Knowledge Bases for RAG-based document retrieval using natural language from your AI assistant.
by aws
Analyze images and videos with AWS Rekognition for object detection, faces, text, and content moderation.
by cerebras
Ultra-fast inference on Cerebras Wafer-Scale Engine chips for LLMs.
by clarifai
Build and deploy visual AI models for image classification, detection, and segmentation via Clarifai.
by community
Generate text embeddings, rerank search results, and run RAG workflows using Cohere's embedding models via MCP.
by cohere-ai
Access Cohere's language models, embeddings, and rerank APIs from any MCP-compatible client.
by coze-dev
Orchestrate Coze's AI bots, workflows, and knowledge bases via the Coze API.
by datarobot
Automate ML model building, deployment, and monitoring with DataRobot's AutoML platform.
by deepgram
Real-time and batch speech recognition powered by Deepgram's Nova-2 models.
by deepl
Translate text and documents into 30+ languages with DeepL's AI-powered translation engine.
by deepseek-ai
Access DeepSeek reasoning and chat models via MCP for cost-effective AI inference.
by langgenius
Build and invoke LLM apps, chatbots, and agents in Dify's open-source LLM platform.
by modelcontextprotocol
Generate images via EverArt's AI model fine-tuning platform from your AI assistant with consistent brand style.
by exa-labs
Neural search with neural embeddings — find the most relevant web content for any query.
by fw-ai
Fastest open-source model inference with function calling on Fireworks AI.
by FlowiseAI
Invoke Flowise AI flows and chatflows with custom inputs via the Prediction API.
by groq-ai
Ultra-fast LLM inference via Groq's LPU hardware, exposed as an MCP tool.
by Helicone
Monitor, cache, and rate-limit LLM API calls with Helicone's observability proxy.
by huggingface
Access and run HuggingFace models and Inference API endpoints via MCP.
by heartex
Manage annotation projects, tasks, and labels in Label Studio for ML training data.
by langfuse
Collaborate on, version, evaluate, and release prompts with Langfuse prompt management.
by langchain-ai
Trace, debug, and evaluate LLM application runs via LangSmith's observability platform.
by run-llama
Build data-aware AI agents and RAG pipelines using the LlamaIndex framework via MCP.
by lmstudio
Run local large language models and expose them via OpenAI-compatible API with LM Studio.
by mapbox
Access geospatial intelligence via Mapbox APIs: geocoding, POI search, directions, and isochrones.
by mistralai
Connect to Mistral AI models (Mistral Large, Mixtral) for chat, completion, and embeddings.
by mlflow
Manage ML lifecycle with MLflow: log runs, register models, and serve predictions.
by modal
Deploy and run AI inference functions on Modal's serverless GPU cloud infrastructure.
by nvidia
Run NVIDIA NIM microservices for optimized LLM, vision, and speech inference.
by patruff
Run local LLMs via Ollama and expose them as MCP tools for offline AI inference.
by openai
Use OpenAI Agents SDK tools — web search, file search, computer use, and code interpreter — exposed as MCP endpoints.
by openai
Access GPT models, Assistants API, and vector stores via MCP-native interfaces.
by jtrokel
Query OpenStreetMap for geocoding, POIs, routes, and geographic data.
by openweathermap
Fetch current weather, forecasts, and historical climate data via OpenWeather API.
by alexei-led
Prompt optimization server that improves quality and consistency of AI requests.
by roboflow
Train, deploy, and query computer vision models using Roboflow's dataset and inference APIs.
by runwayml
Generate and edit videos with AI using Runway's Gen-3 and other video models.
by scale-ai
Submit and manage AI training data tasks through Scale AI's data annotation platform.
by modelcontextprotocol
Provides structured reasoning helpers for stepwise planning in MCP workflows.
by stability-ai
Generate and edit images using Stable Diffusion models via Stability AI's REST API.
by togethercomputer
Run open-source LLMs (Llama, Mistral, Falcon) at scale via Together AI's cloud API.
by unstructured-io
Extract and parse text from PDFs, Office documents, HTML, and images for LLM pipelines.
by vectara
Query Vectara's enterprise RAG platform — hybrid search, grounded generation, and hallucination detection via MCP.
by vllm-project
High-throughput open-source LLM serving with PagedAttention, exposed via MCP.
by voyageai
Generate high-quality embeddings optimized for retrieval using Voyage AI's specialized embedding models via MCP.
by wandb
Track ML experiments, visualize runs, and query model artifacts from W&B.
by openai
Transcribe audio files and streams with OpenAI Whisper's state-of-the-art speech recognition.
by youcom
AI search and coding assistant API with chat, search, and code interfaces from You.com.
AI & ML Tools MCP servers help teams expose high-value tools to assistants while keeping boundaries explicit and auditable. On EveryMCP, listings in this category are organized for faster evaluation across repo quality, use cases, and deployment fit.