EdgeMatrix Benchmark Dashboard

Comprehensive AI model performance analysis across hardware platforms

Average Performance Gain

74.2%

Across all models

Average Cost Savings

38.1%

hardware costs

Average Power Efficiency

39.6%

power saving

Models Tested

34

configurations

Peak Throughput

21,015.55

tokens/sec

Showing 160 results
ModelCategoryHardwareSizePrecisionWithout EdgeMatrixWith EdgeMatrixImprovementCost SavingsPower Efficiency
Shakti-1B
Shakti Family
GPU
A100 (80GB)
1.88GBFP1616,25021,015.55+29.3%22.7%22.7%
Qwen2-VL-2B
Qwen Family
GPU
A100 (80GB)
4.419GBFP1612,690.319,963.4+57.3%36.4%36.4%
DeepSeek-R1-Distill-Qwen-1.5B
DeepSeek Family
GPU
A100 (80GB)
3.55GBFP1612,931.4218,104.78+40.0%28.6%28.6%
Qwen2.5-VL-3B
Qwen Family
GPU
A100 (80GB)
7.51GBFP168,234.1113,903.65+68.8%40.8%40.8%
DeepSeek-R1-Distill-Qwen-1.5B
DeepSeek Family
GPU
L40s (48GB)
3.55GBFP167,99813,040.77+63.0%38.7%38.7%
Shakti-1B
Shakti Family
GPU
L40s (48GB)
1.88GBFP167,57012,064.4+59.4%37.3%37.3%
DeepSeek-R1-Distill-Qwen-1.5B
DeepSeek Family
GPU
A100 (80GB)
1.12GBQ49,004.6711,412.41+26.7%21.1%21.1%
Qwen2-VL-2B
Qwen Family
GPU
L40s (48GB)
4.419GBFP166,509.4410,659.19+63.8%38.9%38.9%
Shakti-4B
Shakti Family
GPU
A100 (80GB)
7.42GBFP167,56410,099.3+33.5%25.1%25.1%
Gemma-2-2B-IT
Gemma Family
GPU
A100 (80GB)
5.14GBFP167,256.99,954.13+37.2%27.1%27.1%
Gemma-2-2B-IT
Gemma Family
GPU
A100 (80GB)
1.71GBQ48,904.679,929.41+11.5%10.3%10.3%
InternVL2.5-4B
InternVL Family
GPU
A100 (80GB)
7.42GBFP164,576.069,542.87+108.5%52.0%52.0%
Gemma-3-4B-IT
Gemma Family
GPU
A100 (80GB)
8.6GBFP165,798.239,462.91+63.2%38.7%38.7%
Qwen2.5-VL-3B
Qwen Family
GPU
L40s (48GB)
7.51GBFP161,738.959,188.68+428.4%81.1%81.1%
DeepSeek-R1-Distill-Qwen-1.5B
DeepSeek Family
GPU
L40s (48GB)
1.12GBQ47,011.018,527.61+21.6%17.8%17.8%
Llama-3.1-8B
Llama Family
GPU
H100 (80GB)
16GBFP166,473.077,956.74+22.9%18.6%18.6%
Shakti-2.5B
Shakti Family
GPU
A100 (80GB)
6.43GBFP166,120.037,854.42+66.7%40.0%40.0%
Phi-4-mini-Instruct
Phi Family
GPU
A100 (80GB)
7.67GBFP165,285.016,870.52+30.0%23.1%23.1%
Llama-3.2-3B
Llama Family
GPU
A100 (80GB)
2.02GBQ44,014.236,852.23+70.7%41.4%41.4%
Phi-4-mini-reasoning
Phi Family
GPU
A100 (80GB)
7.67GBFP163,105.496,607.25+112.7%53.0%53.0%
Llama-3.2-3B
Llama Family
GPU
A100 (80GB)
6.6GBFP163,581.46,362.89+77.7%43.7%43.7%
Gemma-3-4B-IT
Gemma Family
GPU
A100 (80GB)
8.6GBFP164,998.026,063.09+21.3%17.6%17.5%
Shakti-4B
Shakti Family
GPU
L40s (48GB)
7.42GBFP162,3395,789+147.5%59.6%59.6%
Gemma-2-2B-IT
Gemma Family
GPU
L40s (48GB)
1.71GBQ44,5015,699.2+26.6%21.0%21.0%
Shakti-2.5B
Shakti Family
GPU
L40s (48GB)
1.5GBFP163,122.925,612.24+66.7%40.0%52.5%
InternVL2.5-4B
InternVL Family
GPU
L40s (48GB)
7.42GBFP162,951.755,569.34+88.7%47.0%47.0%
Llama-3.2-3B
Llama Family
GPU
L40s (48GB)
2.02GBQ43,163.25,183.45+63.9%39.0%41.3%
InternVL2-2B
InternVL Family
GPU
A100 (80GB)
4.41GBFP162,0744,970.63+139.7%58.3%58.3%
Llama-3.1-8B
Llama Family
GPU
A100 (80GB)
16GBFP163,796.374,875.86+28.4%22.1%22.1%
Gemma-2-2B-IT
Gemma Family
GPU
L40s (48GB)
5.14GBFP162,862.54,698.01+64.1%39.1%39.1%
InternVL2-2B
InternVL Family
GPU
L40s (48GB)
4.41GBFP161,9594,684.38+139.1%58.2%58.2%
Llama-Guard-3-8B
Llama Family
GPU
A100 (80GB)
16.07GBFP163,528.94,517+28.0%21.9%21.9%
Llama-3.2-3B
Llama Family
GPU
L40s (48GB)
6.6GBFP161,676.34,467.32+166.4%62.5%52.5%
DeepSeek-R1-Distill-Llama-8B
DeepSeek Family
GPU
A100 (80GB)
16.06GBFP163,424.034,417.09+29.0%22.5%22.5%
Janus-Pro-7B
DeepSeek Family
GPU
A100 (80GB)
14.84GBFP162,6544,235+59.6%37.3%37.3%
Gemma-2-9B-IT
Gemma Family
GPU
A100 (80GB)
18.48GBFP163,269.694,184.52+28.0%21.9%21.9%
Phi-4-mini-Instruct
Phi Family
GPU
A100 (80GB)
2.49GBQ43,121.234,117.11+31.9%24.2%24.2%
Phi-4-mini-reasoning
Phi Family
GPU
A100 (80GB)
2.49GBQ43,281.024,109.32+25.2%20.2%20.1%
LLaVA-OneVision-Qwen2-7B
LLaVA Family
GPU
A100 (80GB)
16.06GBFP161,8504,101.7+121.7%54.9%54.9%
Gemma-3-4B-IT
Gemma Family
GPU
A100 (80GB)
2.49GBQ43,039.124,007.55+31.9%24.2%24.2%
Qwen3-4B
Qwen Family
GPU
A100 (80GB)
2.2GBQ43,068.123,917.55+27.7%21.7%21.7%
Gemma-3-4B-IT
Gemma Family
GPU
L40s (48GB)
8.6GBFP162,220.353,903+75.8%43.1%43.1%
DeepSeek-R1-Distill-Llama-8B
DeepSeek Family
GPU
A100 (80GB)
4.92GBQ43,098.773,884+25.3%20.2%20.2%
Qwen3-4B
Qwen Family
GPU
A100 (80GB)
8.1GBFP163,105.593,837.05+23.5%19.1%19.1%
Llama-3.1-8B
Llama Family
GPU
A100 (80GB)
4.92GBQ43,081.023,822.92+24.1%19.4%19.4%
Gemma-3-12B-IT
Gemma Family
GPU
A100 (80GB)
24.32GBFP163,061.53,801.01+24.1%19.5%19.5%
Phi-4-multimodal
Phi Family
GPU
A100 (80GB)
11.12GBFP161,8163,769.45+107.6%51.8%51.8%
Qwen3-8B
Qwen Family
GPU
A100 (80GB)
5.03GBQ43,208.873,709.09+15.6%13.5%13.5%
Phi-4-multimodal
Phi Family
GPU
L40s (48GB)
11.12GBFP161,564.633,641+132.7%57.0%57.0%
Llama-Guard-3-8B
Llama Family
GPU
A100 (80GB)
4.92GBQ42,967.973,618.06+21.9%18.0%18.0%
Qwen3-4B
Qwen Family
GPU
L40s (48GB)
2.2GBQ42,991.433,587.3+19.9%16.6%16.6%
Phi-4-mini-Instruct
Phi Family
GPU
L40s (48GB)
7.67GBFP162,116.713,493+65.0%39.4%39.4%
Phi-4-mini-reasoning
Phi Family
GPU
L40s (48GB)
7.67GBFP162,013.63,434.8+70.6%41.4%41.4%
Llama-3.2-1B
Llama Family
Device
Tesla T4 (16GB)
808MBFP162,793.423,256.68+16.6%14.2%14.3%
Gemma-2-9B-IT
Gemma Family
GPU
A100 (80GB)
5.76GBQ42,360.333,209+36.0%26.4%26.4%
Qwen3-8B
Qwen Family
GPU
A100 (80GB)
16.5GBFP162,845.23,159.02+11.0%9.1%9.9%
Qwen3-8B
Qwen Family
GPU
L40s (48GB)
5.03GBQ42,829.543,118.23+10.2%9.3%9.3%
Qwen3-4B
Qwen Family
GPU
L40s (48GB)
8.1GBFP162,696.343,110.34+15.4%13.3%13.3%
Llama-3.1-8B
Llama Family
GPU
L40s (48GB)
4.92GBQ42,639.543,089.13+17.0%14.6%14.6%
Gemma-3-4B-IT
Gemma Family
GPU
L40s (48GB)
8.6GBFP161,749.452,974.85+70.0%41.2%41.2%
Gemma-3-12B-IT
Gemma Family
GPU
A100 (80GB)
7.3GBQ42,241.312,904.86+29.6%22.8%22.8%
Qwen3-8B
Qwen Family
GPU
L40s (48GB)
16.5GBFP162,428.232,874.12+18.4%15.5%15.5%
Llama-3.1-8B
Llama Family
GPU
L40s (48GB)
16GBFP161,578.772,748.51+74.1%42.5%42.6%
Janus-Pro-7B
DeepSeek Family
GPU
L40s (48GB)
14.84GBFP161,3802,746.76+99.2%49.8%49.8%
LLaVA-OneVision-Qwen2-7B
LLaVA Family
GPU
L40s (48GB)
16.06GBFP161,154.62,704.22+134.2%57.3%57.3%
Llama-Guard-3-8B
Llama Family
GPU
L40s (48GB)
16.07GBFP161,528.92,575+68.4%40.6%40.6%
InternVL2-8B
InternVL Family
GPU
A100 (80GB)
16.16GBFP161,553.842,423.82+56.0%35.9%35.9%
DeepSeek-R1-Distill-Llama-8B
DeepSeek Family
GPU
L40s (48GB)
16.06GBFP161,552.962,281.91+46.9%31.9%31.9%
Phi-4-mini-Instruct
Phi Family
GPU
L40s (48GB)
2.49GBQ41,752.22,238.41+27.7%21.7%21.7%
Phi-4-mini-reasoning
Phi Family
GPU
L40s (48GB)
2.49GBQ41,760.32,149.98+22.1%18.1%18.1%
Gemma-3-4B-IT
Gemma Family
GPU
L40s (48GB)
2.49GBQ41,6821,890.36+12.4%11.0%11.0%
Gemma-2-9B-IT
Gemma Family
GPU
L40s (48GB)
18.48GBFP161,138.091,723.46+51.4%34.0%34.0%
InternVL2-8B
InternVL Family
GPU
L40s (48GB)
16.16GBFP161,020.331,700.65+66.7%40.0%40.0%
DeepSeek-R1-Distill-Llama-8B
DeepSeek Family
GPU
L40s (48GB)
4.92GBQ41,298.21,569.88+20.9%17.3%17.3%
Llama-3.2-3B
Llama Family
GPU
T4 (16GB)
6.6GBFP161,132.431,518.19+34.0%25.4%25.4%
Gemma-2-9B-IT
Gemma Family
GPU
L40s (48GB)
5.76GBQ41,156.51,428.07+23.5%19.0%19.0%
Gemma-3-12B-IT
Gemma Family
GPU
L40s (48GB)
24.32GBFP16941.331,412.09+50.0%33.3%33.3%
Gemma-3-12B-IT
Gemma Family
GPU
L40s (48GB)
7.3GBQ4831.91,049.86+26.2%20.8%20.8%
Llama-Guard-3-8B
Llama Family
GPU
L40s (48GB)
4.92GBQ4769.15978.74+27.3%21.4%21.4%
Llama-3.1-8B
Llama Family
Device
Tesla T4 (16GB)
16GBINT4380.59502.43+32.0%24.3%24.3%
Shakti-250M
Shakti Family
Device
MacBook Pro M3 (36GB)
148MBQ4295385+30.5%23.4%N/A
Shakti-100M
Shakti Family
Device
MacBook Pro M3 (36GB)
126MBQ4280365+30.4%23.3%N/A
Shakti-500M
Shakti Family
Device
MacBook Pro M3 (36GB)
303MBQ4215281.43+30.9%23.6%N/A
SmolLM2-135M
SmolLM Family
Device
MacBook Pro M3 (36GB)
105MBQ4175227.21+29.8%23.0%N/A
SmolLM2-360M
SmolLM Family
Device
MacBook Pro M3 (36GB)
271MBQ4140182.81+30.6%23.4%N/A
Qwen2.5-500M
Qwen Family
Device
MacBook Pro M3 (36GB)
398MBQ4135173.82+28.8%22.3%N/A
Shakti-100M
Shakti Family
Device
iPhone 14 (6GB)
126MBQ4120153.7+28.1%21.9%N/A
Qwen3-0.6B
Qwen Family
CPU
AMD EPYC 9554 (60 cores, 201GB)
456.11MBQ454.03152.7+182.5%64.6%64.6%
Qwen3-1.7B
Qwen Family
CPU
AMD EPYC 9554 (60 cores, 201GB)
1.28GBQ449.22135.6+175.4%63.7%63.7%
Shakti-2.5B
Shakti Family
Device
MacBook Pro M3 (36GB)
1.5GBQ495128+34.7%25.8%N/A
Llama-3.2-3B
Llama Family
CPU
AMD EPYC 9554 (60 cores, 201GB)
2.02GBQ446.4124.76+168.8%62.8%62.8%
Qwen3-0.6B
Qwen Family
CPU
AMD EPYC 9554 (32 cores, 117GB)
456.11MBQ445.11115.6+156.3%61.0%61.0%
Qwen3-4B
Qwen Family
CPU
AMD EPYC 9554 (60 cores, 201GB)
2.2GBQ438.0497.45+156.2%61.0%61.0%
Qwen3-1.7B
Qwen Family
CPU
AMD EPYC 9554 (32 cores, 117GB)
1.28GBQ436.4494.34+158.9%61.4%61.4%
Shakti-250M
Shakti Family
Device
iPhone 14 (6GB)
148MBQ46588.11+35.5%26.2%N/A
Llama-3.3-70B
Llama Family
GPU
A100 (80GB)
42.5GBQ448.8784.24+72.3%41.8%42.0%
Llama-3.2-3B
Llama Family
CPU
AMD EPYC 9554 (32 cores, 117GB)
2.02GBQ432.982.34+150.4%60.1%60.0%
Qwen3-0.6B
Qwen Family
CPU
Intel Core i7-14700K (28 cores, 94GB)
456.11MBQ437.982.2+117.0%53.9%53.9%
DeepSeek-R1-Distill-Qwen-1.5B
DeepSeek Family
CPU
AMD EPYC 9554 (60 cores, 201GB)
1.12GBQ442.0181.9+95.0%48.7%N/A
Qwen3-8B
Qwen Family
CPU
AMD EPYC 9554 (60 cores, 201GB)
5.03GBQ428.6479.42+177.3%64.0%63.9%
Gemma-2-2B-IT
Gemma Family
CPU
AMD EPYC 9554 (60 cores, 201GB)
1.71GBQ441.3175.28+82.2%45.1%45.1%
Qwen3-1.7B
Qwen Family
CPU
Intel Core i7-14700K (28 cores, 94GB)
1.28GBQ432.7774.23+126.5%55.9%55.9%
Llama-3.2-3B
Llama Family
CPU
Intel Core i7-14700K (28 cores, 94GB)
2.02GBQ428.3964.23+126.3%55.8%55.8%
Shakti-500M
Shakti Family
Device
iPhone 14 (6GB)
303MBQ44562.4+38.7%27.9%N/A
Shakti-100M
Shakti Family
Device
Raspberry Pi 5 (8GB)
126MBQ44560.74+35.0%25.9%N/A
Qwen3-0.6B
Qwen Family
CPU
AMD EPYC 9554 (16 cores, 105GB)
456.11MBQ429.3456.72+93.4%48.3%48.3%
Qwen3-4B
Qwen Family
CPU
AMD EPYC 9554 (32 cores, 117GB)
2.2GBQ422.653.99+138.8%58.1%58.1%
DeepSeek-R1-Distill-Qwen-1.5B
DeepSeek Family
CPU
Intel Core i7-14700K (28 cores, 94GB)
1.12GBQ430.7853.74+74.6%42.7%42.7%
Llama-3.1-8B
Llama Family
CPU
AMD EPYC 9554 (60 cores, 201GB)
4.92GBQ422.6752.34+130.9%56.7%56.7%
DeepSeek-R1-Distill-Qwen-1.5B
DeepSeek Family
CPU
AMD EPYC 9554 (60 cores, 201GB)
1.12GBQ429.4752.25+95.0%43.6%43.6%
Shakti-250M
Shakti Family
Device
Raspberry Pi 5 (8GB)
148MBQ43548.911+39.8%28.4%N/A
Qwen3-1.7B
Qwen Family
CPU
AMD EPYC 9554 (16 cores, 105GB)
1.28GBQ422.8747.32+106.9%51.7%N/A
Phi-4-mini-reasoning
Phi Family
CPU
AMD EPYC 9554 (60 cores, 201GB)
2.49GBQ42546.42+85.7%46.1%46.1%
Phi-4-mini-Instruct
Phi Family
CPU
AMD EPYC 9554 (60 cores, 201GB)
2.49GBQ426.645.69+71.8%41.8%41.8%
Qwen3-4B
Qwen Family
CPU
Intel Core i7-14700K (28 cores, 94GB)
2.2GBQ419.744.01+123.4%55.2%55.2%
Gemma-2-2B-IT
Gemma Family
CPU
AMD EPYC 9554 (16 cores, 105GB)
1.71GBQ422.1743.02+94.0%48.5%48.5%
Gemma-3-4B-IT
Gemma Family
CPU
AMD EPYC 9554 (60 cores, 201GB)
2.49GBQ421.9940.83+85.7%46.1%46.1%
Qwen3-8B
Qwen Family
CPU
AMD EPYC 9554 (32 cores, 117GB)
5.03GBQ416.0239.98+149.6%59.9%59.9%
Shakti-100M
Shakti Family
CPU
Intel Xeon Silver 4110 (197GB)
126MBQ421.3536.64+71.6%41.7%41.7%
Phi-4-mini-reasoning
Phi Family
CPU
AMD EPYC 9554 (16 cores, 105GB)
2.49GBQ417.1734.74+102.3%50.6%50.6%
Llama-3.1-8B
Llama Family
CPU
AMD EPYC 9554 (32 cores, 117GB)
4.92GBQ419.4334.42+77.1%43.6%43.5%
Llama-3.2-3B
Llama Family
CPU
AMD EPYC 9554 (16 cores, 105GB)
2.02GBQ416.633.98+104.7%51.1%51.1%
Phi-4-mini-Instruct
Phi Family
CPU
AMD EPYC 9554 (16 cores, 105GB)
2.49GBQ416.3333.98+108.1%51.9%51.9%
Llama-3.3-70B
Llama Family
GPU
L40s (48GB)
42.5GBQ419.7833.48+69.3%40.9%41.9%
SmolLM2-135M
SmolLM Family
Device
Raspberry Pi 5 (8GB)
105MBQ42532.355+29.4%22.7%N/A
Gemma-2-2B-IT
Gemma Family
CPU
Intel Core i7-14700K (28 cores, 94GB)
1.71GBQ414.6731.8+116.8%53.9%53.9%
Gemma-3-4B-IT
Gemma Family
CPU
AMD EPYC 9554 (16 cores, 105GB)
2.49GBQ417.0730.33+77.7%43.7%43.7%
Shakti-500M
Shakti Family
Device
Raspberry Pi 5 (8GB)
303MBQ42229.54+34.3%25.5%N/A
Qwen3-4B
Qwen Family
CPU
AMD EPYC 9554 (16 cores, 105GB)
2.2GBQ413.729.44+114.9%53.2%53.5%
Qwen3-8B
Qwen Family
CPU
Intel Core i7-14700K (28 cores, 94GB)
5.03GBQ412.6529.11+130.1%56.5%59.9%
SmolLM2-360M
SmolLM Family
Device
Raspberry Pi 5 (8GB)
271MBQ42228.99+31.8%24.1%N/A
DeepSeek-R1-Distill-Llama-8B
DeepSeek Family
CPU
AMD EPYC 9554 (60 cores, 201GB)
4.92GBQ415.428.09+82.4%45.2%45.2%
Shakti-2.5B
Shakti Family
Device
iPhone 14 (6GB)
1.5GBQ41827.32+51.8%34.1%N/A
Llama-Guard-3-8B
Llama Family
CPU
AMD EPYC 9554 (60 cores, 201GB)
4.92GBQ413.0127.28+109.7%52.3%52.3%
Llama-3.1-8B
Llama Family
CPU
Intel Core i7-14700K (28 cores, 94GB)
4.92GBQ417.826.56+49.2%33.0%33.0%
Shakti-250M
Shakti Family
CPU
Intel Xeon Silver 4110 (197GB)
148MBQ414.7525.67+74.0%42.5%42.5%
Llama-3.2-1B
Llama Family
CPU
Intel Xeon Silver 4110 (197GB)
808MBQ410.3524.78+139.4%58.2%58.2%
Phi-4-mini-Instruct
Phi Family
CPU
Intel Core i7-14700K (28 cores, 94GB)
2.49GBQ41123.94+117.6%54.0%54.0%
Gemma-2-9B-IT
Gemma Family
CPU
AMD EPYC 9554 (60 cores, 201GB)
5.76GBQ411.0923.09+108.2%52.0%52.0%
Gemma-3-4B-IT
Gemma Family
CPU
Intel Core i7-14700K (28 cores, 94GB)
2.49GBQ411.6322.48+93.3%48.3%43.7%
Phi-4-mini-reasoning
Phi Family
CPU
Intel Core i7-14700K (28 cores, 94GB)
2.49GBQ48.5518.8+119.9%54.5%54.5%
Gemma-3-12B-IT
Gemma Family
CPU
AMD EPYC 9554 (60 cores, 201GB)
7.3GBQ410.618.71+76.5%43.3%43.3%
Qwen2.5-500M
Qwen Family
Device
Raspberry Pi 5 (8GB)
398MBQ41418.24+30.3%23.2%N/A
Llama-3.1-8B
Llama Family
CPU
AMD EPYC 9554 (16 cores, 105GB)
4.92GBQ413.417.03+27.1%21.3%21.3%
Llama-Guard-3-8B
Llama Family
CPU
AMD EPYC 9554 (16 cores, 105GB)
4.92GBQ47.9816.78+110.2%52.4%52.4%
DeepSeek-R1-Distill-Llama-8B
DeepSeek Family
CPU
AMD EPYC 9554 (16 cores, 105GB)
4.92GBQ48.7915.99+81.9%45.0%45.0%
Qwen3-8B
Qwen Family
CPU
AMD EPYC 9554 (16 cores, 105GB)
5.03GBQ48.3415.34+83.9%45.5%45.6%
Shakti-500M
Shakti Family
CPU
Intel Xeon Silver 4110 (197GB)
303MBQ44.5614.26+212.7%68.0%68.0%
Gemma-2-9B-IT
Gemma Family
CPU
AMD EPYC 9554 (16 cores, 105GB)
5.76GBQ47.9213.07+65.0%39.4%39.4%
Llama-Guard-3-8B
Llama Family
CPU
Intel Core i7-14700K (28 cores, 94GB)
4.92GBQ46.9612.76+90.7%49.5%45.5%
Gemma-3-12B-IT
Gemma Family
CPU
AMD EPYC 9554 (16 cores, 105GB)
7.3GBQ46.7510.9+61.5%38.1%38.1%
Gemma-2-9B-IT
Gemma Family
CPU
Intel Core i7-14700K (28 cores, 94GB)
5.76GBQ45.1110.08+97.3%49.3%49.3%
DeepSeek-R1-Distill-Llama-8B
DeepSeek Family
CPU
Intel Core i7-14700K (28 cores, 94GB)
4.92GBQ44.889.58+96.3%49.1%49.1%
Shakti-2.5B
Shakti Family
CPU
Intel Xeon Silver 4110 (197GB)
1.5GBQ45.129.35+82.6%45.2%45.2%
Gemma-3-12B-IT
Gemma Family
CPU
Intel Core i7-14700K (28 cores, 94GB)
7.3GBQ44.527.89+74.5%42.7%42.7%
Llama-3.3-70B
Llama Family
CPU
AMD EPYC 9554 (60 cores, 201GB)
42.5GBQ42.77.65+183.3%64.7%64.7%
Llama-3.3-70B
Llama Family
CPU
AMD EPYC 9554 (32 cores, 117GB)
42.5GBQ42.015.6+178.6%64.1%64.1%
Shakti-2.5B
Shakti Family
Device
Raspberry Pi 5 (8GB)
1.5GBQ43.24.45+39.1%28.1%N/A
Llama-3.3-70B
Llama Family
CPU
Intel Core i7-14700K (28 cores, 94GB)
42.5GBQ42.14.34+106.7%51.6%51.6%
Llama-3.3-70B
Llama Family
CPU
AMD EPYC 9554 (16 cores, 105GB)
42.5GBQ41.432.01+40.6%28.9%28.9%