Your data is hierarchical.Your model should be too.

Deliver precise visual search for highly specific queries. hyper3-clip uses hyperbolic geometry to understand complex hierarchies and rank the exact matches standard models miss.

Where exact matches get buried.

Each slice tests whether hyper3-clip ranks exact visual matches higher when hierarchy, variants, or compositional details matter.

Catalog-Scale Variant Search

Retrieve the exact product variant based on fine-grained visual details.

Visual Search Query

grey velvet tufted sofa, brass legs, mid-century

Target Product
Rivet Tufted Sofa

Rivet Tufted Sofa

Mid-Century Modern, Grey Velvet, Brass Legs

hyper3-CLIP
#1
OpenAI-CLIP
#20
hyper3-CLIPRank: #1
Rivet Freemont Sofa

Rivet Freemont Sofa

Rank #1Sofa

Target
Rivet Uptown Sofa

Rivet Uptown Sofa

Rank #2Sofa

Rivet Freemont Sofa

Rivet Freemont Sofa

Rank #3Sofa

Rivet Freemont Sofa

Rivet Freemont Sofa

Rank #4Sofa

Rivet Uptown Sofa

Rivet Uptown Sofa

Rank #5Sofa

OpenAI-CLIPRank: #20
Rivet Uptown Sofa

Rivet Uptown Sofa

Rank #1Sofa

Rivet Alden Sofa

Rivet Alden Sofa

Rank #2Sofa

Rivet Uptown Sofa

Rivet Uptown Sofa

Rank #3Sofa

Rivet Bradford Sofa

Rivet Bradford Sofa

Rank #4Sofa

Rivet Accent Chair

Rivet Accent Chair

Rank #5Chair

Sibling

Why Geometry Matters

Standard models cram your data into a flat (Euclidean) space. This creates a fundamental mathematical mismatch:
  • Hierarchical data grows exponentially.
  • Euclidean space grows only polynomially.
  • The Mismatch: Flat models force exponential hierarchies into polynomial dimensions, crowding siblings together and causing retrieval failures like category bleed.
Hyperbolic space naturally expands exponentially, giving every variant room to breathe.

Flat Euclidean Space

Leaf nodes crowd tightly near the bottom. Sibling details overlap and cause retrieval confusion.

Overcrowding

Hyperbolic Space

Separation

Industry Benchmarks

We combine specialized vision-language models with an open-source inspection workbench, then test them against concrete retrieval failure modes by industry.

Industry / DatasetBenchmarkhyper3-CLIPOpenAI-CLIPReadout
Ecommerce Catalog Retrieval – Amazon Berkeley Objects
Retail catalogs500 product images, 20 product typesProduct-type mAP0.5820.552+3.05 pts
Retail catalogs50 parsed catalog departmentsDepartment mAP0.2640.212+5.20 pts
Retail catalogsParent category retrieves diverse childrenChild coverage@500.7800.655+12.50 pts
Fashion Product Search – DeepFashion In-Shop
Apparel retailSame-item product image retrievalmAP0.4070.240+16.7 pts
Apparel retailSame-item first-result recoveryRecall@10.5950.375+22.0 pts
Apparel retailSpecific typed product searchHit@100.5720.550+2.2 pts
General Visual Hierarchy – COCO Objects
Object search5,000 COCO val images, 80 categoriesCategory mAP0.5540.532+2.22 pts
Object search12 object supercategoriesSupercategory mAP0.5360.516+2.08 pts
Object searchCoverage of child types under broad labelsChild coverage@1000.8870.951CLIP +6.40 pts

hyper3-CLIPMultimodal Model

A specialized vision-language model trained with hierarchy-aware geometry to reduce category and sibling bleed in dense visual catalogs.

HyperViewInspection Workbench

An open-source viewer for mapping embedding projections, tracing nearest neighbors, and identifying the failure modes behind bad retrieval results.

Our Research

Understand how our frontier specialized models are trained from scratch, and explore our latest preprints and technical deep-dives.

arXiv:2604.09690v1 [cs.CV] 12 Apr 2026
arXiv
Are We Recognizing the Jaguar or Its Background? A Diagnostic Framework for Jaguar Re-ID
M. Mahmood & A. Rueda-Toicen
Abstract—Standard deep vision embeddings are heavily biased by background shortcuts. We introduce a diagnostic framework that evaluates background biases in wildlife monitoring datasets, showcasing substantial accuracy drops in non-aligned environments...
Proceedings of Biodiversity Computer Vision (BCV) 2026
The geometry mistake — Euclidean vs hyperbolic embedding spaces
Blog

The Geometry Mistake Behind Modern Embedding Models

Why mainstream ML's reliance on Euclidean manifolds is a mistake, and how hyperbolic spaces can efficiently encode hierarchical data with fewer dimensions.

read post →

Request a hyper3-clip eval.

Send a representative retrieval slice. We compare hyper3-clip with your current baseline and return metrics, ranked examples, and a pilot recommendation.

01

Send an eval set

Send us a small sample of your highly specific queries. Provide a handful of images, expected matches, and your current baseline.

02

We run the evaluation

We benchmark hyper3-clip against your baseline and inspect the missed exact matches.

03

Get the readout

Receive metrics, ranked examples, and a clear recommendation on whether the model is a fit.

Request Eval