Local runner
Recommended for you
Choose the quantized setup
Recent runs
Tracked execution
Secondary tools
Build, export, and learn
Top contributors
Community evidence should stay cumulative, exportable, and easy to inspect.
Recommendations
Which setup should I run?
Decision input
Describe the deployment shape and let InferGrade rank the current evidence.
Decision table
Use the table first, then open the rows and families that need a closer read.
Leading candidates
Shortlist cards stay secondary to the table and plot.
Browse raw evidence
Keep one foot in the broader catalog while the decision slice stays active.
Best under VRAM
Practical local choices for the current envelope.
Pareto frontier
Trade-offs worth opening directly in compare.
Historical Results
Recent benchmark evidence
| Model | Backend | Use Case | TTFT | Tok/s | Hardware | Capability | Verification |
|---|
Compare
Choose between families, variants, and quants
Preset views
Start from a useful model-choice stance, then refine the exact variants or drop to raw runs.
Raw result drill-down
Result Detail
One run, fully explained
Family Explorer
Branches, quants, and nearby slices
Build
Create a short decision-suite run
Start with a local-friendly decision suite. Expand only when you need deeper reference evidence.
Generated Config
Stored server-side so local and cloud execution stay consistent.
No run config generated yet.
Run Status
Active and recent runs
Recent runs
Pick a run to inspect its current stage and progress.
Live timeline
Use the timeline to understand what changed and why.
Stored Run Configs
Reusable benchmark programs
My Runs