# T-Test Statistics Engine MCP

> T-Test Statistics Engine provides mathematically guaranteed t-tests for your AI client. Stop relying on language models to calculate p-values; this MCP runs exact Student's, Welch's, and Paired t-tests locally using a robust statistical engine. Get precise, deterministic results every time you need to test data significance.

## Overview
- **Category:** developer-tools
- **Price:** Free
- **Tags:** statistics, data-science, mathematics, hypothesis-testing, deterministic-math, p-value

## Description

When you’re working with real data—like A/B testing conversion rates or medical readings—you can't afford for your AI agent to guess the math. Language models are great at talking about statistics, but they fail spectacularly when it comes to calculation.

This MCP solves that problem by bringing deterministic computation into your workflow. Instead of asking your agent to calculate a p-value and hoping for the best, you route the data through this engine. It handles all complex math—including Student's t-tests, Welch's t-tests, and Paired t-tests—using a reliable local statistical library. Your AI client extracts the raw numbers and sends them here; we guarantee the mathematically correct t-score, degrees of freedom, and p-value back to you.

This means your analysis is based on solid computation, not educated guesswork. You'll know exactly whether or not to reject the null hypothesis at alpha=0.05 without needing a second pair of eyes. Connecting this MCP via Vinkius gives all your compatible AI clients access to statistical rigor, making your data-driven decisions trustworthy.

## Tools

### calculate_t_test
Runs precise t-tests (independent, paired, one-sample) on data to calculate statistical significance without guessing.

## Prompt Examples

**Prompt:** 
```
Run an independent t-test to see if the conversion rates for Variant A and Variant B are significantly different.
```

**Response:** 
```
The t-score is 2.45 and the p-value is 0.018. Since p < 0.05, there is a statistically significant difference between the two variants.
```

**Prompt:** 
```
Do a paired t-test on these pre-treatment and post-treatment blood pressure readings.
```

**Response:** 
```
The paired t-test gives a p-value of 0.002. We reject the null hypothesis — the treatment had a statistically significant effect on blood pressure.
```

**Prompt:** 
```
Perform a one-sample t-test to check if this batch's mean weight differs from the target of 500g.
```

**Response:** 
```
The calculated p-value is 0.34. We fail to reject the null hypothesis — the batch weight is not significantly different from the 500g target.
```

## Capabilities

### Determine Statistical Significance
The tool calculates the p-value and t-score to tell you if observed differences between datasets are statistically meaningful.

### Compare Independent Datasets
You can run a Student's t-test to see if two separate groups, like conversion rates for Variant A and Variant B, differ significantly.

### Analyze Paired Measurements
The engine processes paired data, such as blood pressure readings taken before and after a treatment, to find meaningful changes.

### Validate Against a Target Mean
Check if a single dataset's average deviates from a known benchmark or target value using a one-sample t-test.

## Use Cases

### Comparing two marketing variants
A product analyst wants to know if the new checkout flow (Variant B) truly increases conversions compared to the old one (Variant A). They use calculate_t_test on both datasets, receiving a clear p-value. Since p < 0.05, they confirm that Variant B is statistically better and proceed with the rollout.

### Validating clinical trial results
A biostatistician has blood pressure readings taken before and after a new medication. Running a paired t-test shows a strong, significant drop in average readings, allowing them to confidently conclude the treatment was effective.

### Checking batch quality control
A manufacturing engineer needs to verify if an entire run of product weights is consistent with the 500g standard. A one-sample t-test runs against the target, confirming that the average weight is not significantly different from spec.

## Benefits

- Eliminates math hallucination. You get deterministic, CPU-guaranteed p-values instead of relying on an LLM's best guess for statistical significance.
- Supports the full suite of necessary tests: run independent comparisons (like comparing two ad campaign groups), paired measurements (pre/post data), and one-sample checks against a target mean.
- Keeps your data private. The complex math runs locally, meaning sensitive company or research data never leaves your environment when using this MCP.
- Automates interpretation. After calculating the metrics, the tool automatically tells your agent whether to reject the null hypothesis at the standard alpha=0.05 level.
- Direct integration for deep workflows. Connects directly through Vinkius, letting any compatible AI client use statistical rigor in natural conversation or code execution.

## How It Works

The bottom line is that you get accurate statistical results without having to write complex Python code or worry about LLM math errors.

1. Your AI client identifies the data points and the type of test needed (e.g., paired, independent).
2. It sends the raw dataset to this MCP for deterministic calculation.
3. You receive a clean output containing the precise t-score, degrees of freedom, and statistically guaranteed p-value.

## Frequently Asked Questions

**Does T-Test Statistics Engine MCP handle A/B testing?**
Yes, you use calculate_t_test for this. You simply feed in the conversion data from Variant A and Variant B as two separate groups to determine if their performance difference is statistically significant.

**Can I run a paired t-test with T-Test Statistics Engine MCP?**
Yes, calculate_t_test supports paired tests. This is crucial for measuring change over time, like comparing pre- and post-intervention measurements on the same subject.

**Is this better than using a standard Python library?**
It's designed to be easier for your agent to use. While it uses robust engines under the hood, you interact with reliable tools that guarantee calculation without needing to manage complex code dependencies.

**What kind of data does calculate_t_test accept?**
It accepts numerical datasets—any numbers representing measurements (e.g., rates, counts, scores). It's designed for continuous measurement metrics.