SenseCore Platform MCP. Control model runs, compute resources, and completions.
Works with every AI agent you already use
…and any MCP-compatible client
Just plug in your AI agents and start using Vinkius.
SenseCore Platform provides 10 tools for managing industrial-grade AI infrastructure via MCP. Use it to run chat completions with SenseTime models, list and inspect available foundation models, monitor compute node availability, and track long-running inference task status directly from your agent.
What your AI agents can do
Chat completions
Sends a message through the SenseCore large language model engine.
Create assistant
Sets up and defines a new, dedicated AI assistant within your project.
Create message
Adds content to an existing conversation thread for historical context or continuation.
Initiate chat completions using specific SenseTime foundation models while maintaining thread history and context.
Define, list, and interact with dedicated AI assistants to structure multi-step or guided conversations.
Retrieve a complete catalog of foundation models, including technical details and versioning information.
Check the health metrics, latency (P99), and overall uptime for specific model services or GPU clusters.
List and check the status of complex, long-running tasks like training runs or large inference batches.
Ask AI about this MCP
Supported MCP Clients
Waiting for input…
SenseCore Platform MCP Server: 11 Tools for AI Infrastructure Control
Manage everything from model discovery to run status checks. Use these 11 tools to orchestrate complex SenseTime AI workflows directly from your agent.
019d847cchat completions
Sends a message through the SenseCore large language model engine.
019d847ccreate assistant
Sets up and defines a new, dedicated AI assistant within your project.
019d847ccreate message
Adds content to an existing conversation thread for historical context or continuation.
019d847ccreate run
Starts executing a defined assistant against a specific message thread, initiating the core workflow.
019d847ccreate thread
Initializes a brand new, empty conversation history that can accept messages and runs.
019d847cget assistant details
Retrieves the full configuration settings for an existing AI assistant by its ID.
019d847cget run status
Checks and returns the current status (running, failed, complete) of a previously started assistant run.
019d847clist assistants
Lists all AI assistants that have been configured for use within your account's project.
019d847clist files
Retrieves a list of files that have been uploaded to the current workspace or thread context.
019d847clist messages
Fetches the complete message history for a given conversation thread ID.
019d847clist models
Gets a list of all available SenseNova foundation models and their versions.
Choose How to Get Started
Build a custom MCP for your own tools, or connect a ready-made integration from our catalog.
Build Your Own
Turn any API into an MCP. Import a spec, define Agent Skills, or deploy with MCPFusion.
- Import from OpenAPI, Swagger, or YAML specs
- Create Agent Skills with progressive disclosure
- Deploy to edge with MCPFusion framework
- Built in DLP, auth, and compliance on every call
- Real time usage dashboard and cost metering
- Publish to catalog or keep private
Make Your AI Do More
Start with SenseCore Platform, then connect any of our 4,700+ other servers whenever your AI needs more. One click, no limits.
- Use this MCP plus 4,700+ others, all in one place
- Add new capabilities to your AI anytime you want
- Every connection is secured and compliant automatically
- Track usage and costs across all your servers
- Works with Claude, ChatGPT, Cursor, and more
- New servers added to the catalog every week
What you can do with this MCP connector
SenseCore Platform gives your AI agent the muscle it needs to run serious industrial-grade model workflows through MCP. You're not just connecting to an API; you're getting full control over the entire lifecycle, from defining a new assistant to tracking complex background jobs. Here’s what you can do with these tools.
Orchestrating Model Conversations
When your agent needs to talk, it uses chat_completions to send messages through SenseCore's large language model engine. You pass in a message and the specific SenseTime model you want to use; the engine handles the rest. If you need to maintain context—which you always do—you first initialize a brand new conversation using create_thread.
That gives your agent an empty history it can build on. Then, before running anything complex, you'll use list_messages to pull the entire message history for that thread ID so your agent remembers everything that was said. To add content at any point, even if you just want to inject a document reference or correct a previous turn, you call create_message, appending it directly to that existing conversation context.
When everything is set up—the thread exists, the messages are there, and the model is ready—you kick off the core action with create_run. This tool tells your agent to execute the defined assistant against the current message thread, initiating the full workflow run.
Managing AI Assistant Lifecycles
You don't just want a random chat; you need a structured process. That’s where dedicated assistants come in. You can set up and define a completely new, specialized AI assistant using create_assistant. This defines the rules for multi-step or guided conversations within your project. Once it's defined, if you need to know what kind of assistant is already running around, you run list_assistants to get a full roster of everything configured in your account.
If you want deep technical specs on a specific helper—like how many steps it has or which model it defaults to—you use get_assistant_details and pass in its ID. These tools let you build, manage, and audit the intelligence layer before any messages are even sent.
Discovering Model Specs and Files
If you don't know exactly what models you have access to, you run list_models. This gives you a complete catalog of every available SenseNova foundation model and tracks their specific versions. It’s your source of truth for what engines are running. Furthermore, if your workflow involves data—whether it’s reference documents or user uploads—you can use list_files to pull a list of all files uploaded to the current workspace or thread context, ensuring your agent knows exactly what data it's working with before it processes anything.
Monitoring and Tracking Jobs
Running an LLM isn't always instant. Sometimes you're talking about huge batches of inference or long training runs that take forever. You don't want to stare at a loading bar all day, so you use get_run_status. This checks the current status—whether it's running, if it failed, or if it completed successfully—for any complex assistant run you started earlier.
It keeps you in the loop without having to manually poll for updates.
In short, you get a complete operational view: you can define your workflow with dedicated assistants, pull the specific models and data files needed, start the conversation thread, kick off the job, and then constantly monitor its status until it's done. It’s comprehensive control over everything running on SenseCore.
How SenseCore Platform MCP Works
- 1 Subscribe to this server. Then, log into the SenseCore Console to get your API Key, Secret Key, Organization ID, and Project ID.
- 2 Insert those credentials into your AI client's configuration fields. This authenticates your agent against the SenseTime infrastructure.
- 3 Your agent can now call tools like
list_modelsorchat_completions, executing operations directly against the managed resources.
The bottom line is you plug in your credentials once, and your AI client handles all the complex orchestration calls for model control and resource monitoring after that.
Who Is SenseCore Platform MCP For?
MLOps engineers who are tired of manually checking dashboards for job status or API developers building applications that need enterprise-grade compute management. This is for anyone whose code needs to reliably talk to complex, paid AI infrastructure.
Uses get_run_status and list_assistants to monitor batch inference jobs and ensure models deploy correctly.
Calls create_assistant and chat_completions to build production-ready applications using SenseTime's specific foundation models.
Uses service monitoring tools to check P99 latency and compute node availability before deploying a new model endpoint.
What Changes When You Connect
- You don't have to guess if a job finished. Use
get_run_statusto check the real-time status of any long-running inference task, preventing stale data issues. - Stop rebuilding conversations from scratch. The combination of
create_thread,list_messages, andchat_completionslets you manage full, persistent chat history automatically. - It's not just about chatting—it's about structure. Define specific workflows using
create_assistantbefore running them withcreate_run. This makes the system repeatable. - Model discovery is fast. Instead of guessing model names, call
list_modelsto get a definitive list and technical specs for every SenseNova version available. - Resource checks are built-in. You can monitor compute node availability and track quota usage directly via the platform's tools before your agent even tries to run.
- Your code doesn't fail when context is missing. By using
list_messages, you ensure your agent always has access to the full thread history, keeping conversations coherent.
Real-World Use Cases
Building a Customer Support Bot
A support bot needs memory. The agent first calls create_thread and then uses list_messages to pull the last 10 turns of chat history. It passes this context payload, along with the user's query, to chat_completions. The result is a highly contextual reply that references past details.
Automating Model Integration
An ML team needs to test three new models. Instead of calling endpoints individually, the agent uses list_models to validate all available options first. It then chooses a specific model (e.g., SenseChat-5) and uses get_assistant_details to confirm its configuration before running any tests.
Debugging Failed Batch Jobs
A large data batch job fails in the background. The agent calls create_run and then repeatedly hits get_run_status. This gives an immediate status update, telling the engineer exactly which step failed (e.g., 'Quota Exceeded') so they can fix it without manual console diving.
Creating a Multi-Step Workflow
A user needs to process a document through several steps. The agent first defines the entire flow using create_assistant. Then, when the user is ready, it calls create_run on that assistant template, allowing SenseCore to handle all the internal logic.
The Tradeoffs
Assuming Conversation History
Calling chat_completions directly without first running an agent or retrieving history. The model will respond generically and lose context.
→
Always start by establishing state. First, call create_thread to initialize the conversation container. Then, retrieve past turns using list_messages before calling chat_completions. This guarantees continuity.
Running a Job Without Knowing Status
Executing create_run and then immediately assuming success. The job might be stuck or failed, leading to bad data.
→
After calling create_run, you must poll the system. Use get_run_status until the status is 'complete' or 'failed'. Never trust an immediate response for long tasks.
Confusing Assistants and Threads
Trying to run a task on an arbitrary message ID instead of using the dedicated workflow tools. The system won't know what process to follow.
→
A conversation needs structure. Use create_assistant first to define the 'how.' Then, use that assistant ID with create_run against a create_thread. This is the correct execution path.
When It Fits, When It Doesn't
Use this MCP Server if your application's core functionality depends on managing state across multiple AI interactions or needs deep oversight into compute resources. Specifically, if you need to track job status (get_run_status), manage structured conversations (create_assistant), or monitor GPU usage/latency (via the platform tools), this is necessary.
Don't use it if you just need a simple API call—for example, if you only need to send one isolated prompt without remembering anything. In that case, a basic, single-call REST endpoint might suffice. But because your application needs enterprise reliability (i.e., knowing why a job failed or what the P99 latency was), this orchestration layer is mandatory.
Independent Platform Disclaimer: Vinkius is an independent platform and is not affiliated with, endorsed by, sponsored by, verified by, or otherwise authorized by SenseCore Platform. All third-party trademarks, logos, and brand names are the property of their respective owners. Their use on this website is strictly for informational purposes to identify service compatibility and interoperability.
VINKIUS INFRASTRUCTURE
Cloud Hosted
Managed infra
V8 Isolated
Sandboxed per request
Zero-Trust Proxy
No stored credentials
DLP Enforced
Policy on every call
GDPR Compliant
EU data residency
Token Compression
~60% cost reduction
Works with Claude, ChatGPT, Cursor, and more
The Model Context Protocol standardizes how applications expose capabilities to LLMs. Instead of operating in isolation, your AI gains direct access to external platforms, live data, and real-world actions through secure, standardized connections.
This server provides 11 capabilities that interface natively with Claude, ChatGPT, Cursor, and any MCP client. No middleware. No custom integration required.
Available Capabilities
Managing complex AI interactions shouldn't require writing three separate API calls just to remember context.
Today, if you build an agent that needs memory, you run into a mess of manual steps. You have to first check the conversation history endpoint (`list_messages`), then format those messages correctly, and finally send them all in a batch payload to the completion endpoint. It's error-prone and clutters your code.
With this MCP server, that whole sequence is encapsulated. You define the thread using `create_thread` and rely on the platform tools to maintain the state. Your agent just needs to call `chat_completions`, trusting that the underlying service handled all the history retrieval for you.
SenseCore Platform MCP Server: Structure your AI logic with `create_assistant`.
Without a defined assistant, every conversation is just a stream of messages. If your workflow requires specific tools or a predictable sequence (e.g., 'Summarize' then 'Translate'), you have to manage that flow manually in Python logic. By creating an assistant first (`create_assistant`), you build the guardrails into the platform itself. You define the process once, and then all subsequent runs are reliable and repeatable.
Common Questions About SenseCore Platform MCP
How do I check if a long-running inference job failed using get_run_status? +
You monitor the run status by repeatedly calling get_run_status with the run ID. If the status returns 'failed,' you'll also get error details explaining what caused the failure, letting you fix it fast.
Do I need to use create_assistant before chat_completions? +
No, not always. chat_completions works for simple chats. But if your conversation needs specific tools or a defined process (like data extraction), you should define it first using create_assistant.
What is the difference between create_thread and list_messages? +
create_thread makes an empty container for history. You call list_messages later to pull the actual content that has been added to that thread ID.
How do I get model specs without listing all models first? +
You use list_models to get a list of available names. Then, you can retrieve detailed specifications for any specific model name using the platform's dedicated inspection tools.
Before I can use `list_assistants`, what credentials do I need for my AI client? +
You must provide your API Key, Secret Key, and Organization ID. These are necessary authentication tokens that prove your agent has access to the SenseCore infrastructure.
If I hit rate limits while using `chat_completions`, how should my agent handle it? +
Your client must implement exponential backoff and retry logic. The API will return a specific status code (e.g., 429) when you exceed the defined request quota.
How do I manage files that are attached to a conversation using `list_files`? +
list_files retrieves all file IDs associated with your current project or thread context. You'll need the specific file ID when calling other tools, like sending messages.
I used `list_models` and saw 20 models; how do I know which one is best for my task? +
list_models only gives you names. For specific technical requirements—like parameter support or latency guarantees—you must call get_assistant_details using the model's ID.
Can I automatically list all available models in my SenseCore project? +
Yes! Use the list_models tool. Your agent will retrieve a complete list of all SenseTime foundation models and specialized variants currently active in your account.
How do I check the health status of my deployed model services? +
Use the get_service_health tool with the specific Service ID. The agent will return real-time metrics on availability, throughput, and average latency.
Can I monitor GPU resource utilization via the AI agent? +
Yes! The get_resource_usage tool retrieves granular metrics on compute node utilization and remaining quota for your specific project environment.
Use it with your favorite AI tools
Connect this server to Cursor, Claude, VS Code, and more.
More in this category
HubSpot
Grow better with an integrated CRM that unifies marketing, sales, service, and operations in one powerful platform.
Cloudflare Stream
Manage video infrastructure via Cloudflare Stream—list videos, manage live inputs, and handle uploads directly from any AI agent.
AT&T IoT
IoT Control Center -- Manage SIM devices, activation, data pools, shared plans, and connectivity diagnostics via AT&T IoT API.
You might also like
Tower
Lightweight project management and team collaboration platform — manage tasks, projects, and discussions via AI.
Apex & Division Stats by TRN
Deep gaming analytics — player profiles, per-legend breakdowns, match history, and global leaderboards for Apex Legends and Division 2 via AI.
Wbiztool
Manage your WhatsApp Business account with bulk messaging, contact management, and campaign analytics for marketing teams.