๐Ÿงช Vibe Test Results

Multi-Model AI Performance Analysis for OllamaPy

2
Models Tested
5
Iterations Each
2
Skills Tested
4
Total Tests
Generated: 2025-09-01 12:00:00 UTC

๐Ÿ† Model Performance Comparison

Tested 2 models with 5 iterations each

#1

Gemma 3 12B

Full-featured 12B parameter model for comprehensive performance

Success Rate 94.0%
Average Response Time 3.20s
Consistency Score 76/100
Performance Category Moderate
#2

Gemma 3 4B

Compact 4B parameter model optimized for speed

Success Rate 84.0%
Average Response Time 1.80s
Consistency Score 88/100
Performance Category Fast

๐Ÿ“Š Skill Performance by Model

Compare how different models perform on each skill

calculate

Perform mathematical calculations

getWeather

Get weather information for a location