Vinkius

Together AI MCP. Power Multi-Modal AI with Open Source Models

Together AI connects your AI agent to over 100 open-source models, giving you a unified platform for everything from text chat and image creation to audio transcription and model fine-tuning. It powers advanced generative AI applications without requiring you to manage any cloud infrastructure.

Together AI MCP is compatible with Claude Claude
Together AI MCP is compatible with ChatGPT ChatGPT
Together AI MCP is compatible with Cursor Cursor
Together AI MCP is compatible with Gemini Gemini
Together AI MCP is compatible with Windsurf Windsurf
Together AI MCP is compatible with VS Code VS Code
Together AI MCP is compatible with JetBrains JetBrains
Together AI MCP is compatible with Vercel Vercel
See Vinkius in Action

Give Claude and any AI agent real-world access

Generate Text and Chat Responses

Your agent can generate high-quality text responses for conversations using various open-source models.

Create Visual Media

The MCP handles generating realistic images or full videos based on simple text prompts.

Process Audio Files

You can convert spoken words into written transcripts, or turn plain text into natural-sounding speech for voiceovers.

Build Knowledge Retrieval Systems

It generates vector embeddings from documents and reranks results so your agent finds the most relevant information quickly.

Manage Model Training

You can run fine-tuning jobs, upload data files, and manage dedicated endpoints for reliable performance.

Waiting for input…

AI Agent
Together AI

What AI agents can do with Together AI: A Powerful Toolset With 27 Tools

These tools let you manage model lifecycle, generate media, process voice and text data, and run large background jobs all through one connection.

Make your AI actually useful.

Add this MCP to Claude, Cursor, or Windsurf and your AI stops guessing. It gets real tools to look things up, take action, and handle the stuff you keep doing by hand.

Start using Together AI MCP

Create Audio Speech

This tool generates speech from plain text, creating voiceovers for your content.

Create Audio Transcription

It converts an uploaded audio file into a written transcript using speech-to-text...

Cancel Batch

You can stop any large, running background processing job immediately.

Create Chat Completion

This tool generates model responses by simulating a full back-and-forth chat...

Create Batch

It starts a new, large-scale asynchronous job that runs in the background over time.

Create Endpoint

You can set up a dedicated connection point to ensure your model performance never drops or slows down.

Create Fine Tune

This initiates the process of training an open-source model on your specific, proprietary dataset.

Delete Endpoint

It removes a dedicated connection point you previously set up for performance...

Delete File

This permanently deletes an uploaded file used for training or batch processing.

Delete Fine Tune

You can cancel a fine-tuning job that you started and no longer need.

Create Embeddings

It takes any block of text and converts it into numerical vector embeddings for...

Get Batch

You can check the current status and results of a specific background job.

Get Endpoint

This retrieves all the details about a dedicated model endpoint you created.

Get File

It fetches metadata and information about an uploaded file without needing to...

Get Fine Tune

You get the current status and progress report for a specific fine-tuning job.

Create Image Generation

This tool generates brand new images based on detailed text descriptions or prompts.

List Batches

You see a list of all background jobs that have been created using the system.

List Endpoints

It lists every dedicated model endpoint currently running or configured for your account.

List Files

You get a list of all data files you've uploaded to the system.

List Fine Tune Checkpoints

This lists saved versions, or checkpoints, for a fine-tuning job so you can revert...

List Fine Tunes

It gives you an overview of all the fine-tuning jobs that have been run previously.

List Models

You can see a list of every model available for use through this MCP connection.

Create Rerank

This tool reorders documents based on how relevant they are to the user's specific...

Create Text Completion

It generates extended text content for a simple prompt, ideal for articles or summaries.

Update Endpoint

You can change the status—like scaling up or down—of an existing dedicated model endpoint.

Upload File

It securely uploads a file for use in fine-tuning, evaluation, or batch processing...

Create Video Generation

This tool creates entire videos from text prompts or by animating an existing image.

Security and governance baked right in.

Pick your AI client below to get set up. Just create a Vinkius account, subscribe, and you're instantly up and running. We handle the entire backend infrastructure, delivering out-of-the-box support for HTTPS Streamable, SSE, and OAuth2—zero messy routing required.

Together AI MCP is compatible with Claude

Claude AI

1

Open Claude Settings

Go to claude.ai, click your profile icon, then navigate to Customize → Connectors.

2

Add Custom Connector

Click the "+" button and select Add custom connector. Paste your Vinkius endpoint URL:

https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp

Replace [YOUR_TOKEN_HERE] with your token from cloud.vinkius.com. For OAuth-protected servers, expand Advanced settings to add credentials.

3

Start a conversation

Open a new chat. The Together AI integration is available immediately — no restart needed.

Choose How to Get Started

Build a custom MCP for your own tools, or connect a ready-made integration from our catalog.

Build Your Own

Turn any API into an MCP. Import a spec, define Agent Skills, or deploy with MCPFusion.

  • Import from OpenAPI, Swagger, or YAML specs
  • Create Agent Skills with progressive disclosure
  • Deploy to edge with MCPFusion framework
  • Built in DLP, auth, and compliance on each call
  • Real time usage dashboard and cost metering
  • Publish to catalog or keep private
Start building

Make Your AI Do More

Start with Together AI, then connect any of our 5,200+ other servers whenever your AI needs more. One click, no limits.

  • Use this MCP plus 5,200+ others, all in one place
  • Add new capabilities to your AI anytime you want
  • Connections are secured and governed automatically
  • Track usage and costs across all your servers
  • Works with Claude, ChatGPT, Cursor, and more
  • New servers added to the catalog weekly
Together AI MCP server cover

Independent Platform Disclaimer: Vinkius is an independent platform and is not affiliated with, endorsed by, sponsored by, verified by, or otherwise authorized by Together AI. All third-party trademarks, logos, and brand names are the property of their respective owners. Their use on this website is strictly for informational purposes to identify service compatibility and interoperability.

VINKIUS CLOUD

Cloud Hosted

Managed infra

V8 Isolated

Sandboxed per request

Zero-Trust Proxy

No stored credentials

DLP Enforced

Policy on each call

GDPR Compliant

EU data residency

Token Compression

~60% cost reduction

Your data is protected. See how we built it.

The headache of piecing together AI features manually.

Today, building a single feature that needs to do three things—like reading an audio file, summarizing it, and then generating promotional art—is a nightmare. You're jumping between the transcription tool, the chat API, and the image generation platform. You copy text from one dashboard into another service, manage keys for multiple providers, and spend hours just stitching the workflow together.

With this MCP, your agent handles the whole sequence inside one connection point. It takes the audio input, runs `create_audio_transcription`, passes that output to generate a summary via chat completion, and finally feeds keywords into `create_image_generation`. You get a fully functional feature without ever leaving your client.

Generating Media with Dedicated Model Operations

The biggest manual step that disappears is the juggling act between different model APIs. You used to have separate documentation and setup steps just for generating an image versus generating a video, forcing you into complex multi-step code blocks.

Now, if your workflow needs visual content, whether it's basic text prompts or full motion video, you call `create_image_generation` or `create_video_generation`. The whole process is contained and controllable from one place.

What Together AI MCP does for your AI

You can connect this MCP to your agent to access the world's fastest inference cloud for open-source models. This connector gives you a complete toolkit for generative AI, handling everything from basic text chat and creating stunning images to processing audio files or training custom model checkpoints. Need to build complex search features? You generate vector embeddings and rerank documents using specialized tools.

Plus, if your application needs constant performance, you can create dedicated endpoints with predictable scaling. Whether you're building an app that talks, draws pictures, or analyzes voice recordings, this MCP keeps all the power running through a single connection point via Vinkius.

Built · Hosted · Managed by Vinkius Together AI MCP - Open Source Model Access
Server ID 019e38fc-f902-730c-94c9-64868c3fd057
Vinkius Inspector
Compliance Grade A+
Score 100/100
Vinkius Inspector Badge — Score 100/100

Frequently asked questions about Together AI MCP

How do I use the Together AI MCP for document search? +

You run this by first calling create_embeddings on your documents to turn them into vectors. Then, when a user asks a question, you use create_rerank to find the most relevant chunks of text from those stored embeddings.

Can I make my AI model better using this MCP? +

Yes. You manage custom training jobs by calling upload_file and then initiating a job with create_fine_tune. This allows you to teach the open-source models your company's specific jargon.

What is the difference between `create_chat_completion` and `create_text_completion`? +

Use create_chat_completion when you need the model to remember context from a conversation history. Use create_text_completion for single, self-contained text generation tasks like writing an article summary.

Does this MCP help with large data uploads? +

It handles massive jobs using the batch tools. You start a job via create_batch, and then you monitor its progress and retrieve results later using get_batch.

How do I ensure my model stays fast for production? +

You use create_endpoint. This tool establishes a dedicated, stable connection point that isolates your usage from general traffic fluctuations, guaranteeing reliable performance.