Clip | Hicham Assoudi

Gradio UI with image and text search tabs returning ranked car damage images from Oracle 26ai, with a dataframe showing claim ID, label, cosine distance, and description

CLIP in Oracle 26ai: A Gradio UI for Qualitative Encoder Validation

SQL distance metrics confirm that CLIP’s shared vector space is geometrically organized. A Gradio UI confirms whether the results are meaningful to a human reviewer. This post builds that prototype and uses it to surface CLIP’s real constraints on a car damage dataset: class confusion in the dent category, complete accuracy inversion on French-language glass shatter queries, and a search engine that returns results for every input regardless of domain fit.

CLIP ViT-B/32 dual-encoder architecture with image and text encoders producing 512-dim vectors in a shared space, loaded into Oracle 26ai via DBMS_VECTOR.LOAD_ONNX_MODEL

Building a Multimodal Visual Similarity Pipeline Inside Oracle 26ai with CLIP ONNX Models

Oracle 26ai can act as a compact environment for multimodal visual similarity experiments. This post shows how to use real HuggingFace image/text data, Oracle’s pre-built CLIP ViT-B/32 ONNX models, VECTOR columns, and SQL-based similarity search to build and validate an image similarity pipeline inside the database.

Path from HuggingFace DamageCarDataset through Python import to an Oracle 26ai table with BLOB, CLOB, and empty VECTOR column, ready for CLIP embedding

HuggingFace Datasets in Oracle 26ai: Jump-Starting CLIP Vector Search Experiments

Before experimenting with CLIP-based image and text similarity in Oracle 26ai, you need data that is real enough to produce meaningful results. Oracle’s documentation examples are toy-scale; production claims data isn’t ready for a local POC. HuggingFace is the answer. This post shows exactly how to import tahaman/DamageCarDataset into Oracle 26ai and wire up the table structure that the entire CLIP experiment series runs on.