01
Public registry
The ontology, examples, parent ids, and model release metadata stay in one open-source surface instead of being buried in private infrastructure.
Cold Industrial Tech
Open Tool Embeddings is building a public, community-evolving embedding set for open-source tools, with GitHub as the collaboration surface and Hugging Face as the model distribution surface.
Distribution
The repo explains the system, carries the registry, and accepts new tool contributions. The OpenToolEmbeddings organization is where staged checkpoints become public downloads once the release bundle is pushed upstream.
What This Project Is
Instead of mapping each query directly to a closed tool list, the project builds a public embedding space over tool metadata and examples. That space can support routing, retrieval, clustering, and hierarchical reasoning across open-source tooling.
01
The ontology, examples, parent ids, and model release metadata stay in one open-source surface instead of being buried in private infrastructure.
02
Tools are not only flat labels. The project treats parent ids as first-class training structure for hierarchical retrieval and broader tool families.
03
The dataset is meant to evolve with open-source tooling, not freeze around a one-time benchmark snapshot or a vendor-specific schema.
Workflow
Import or add tool manifests and examples in the registry.
Build the snapshot with explicit parent_id values.
Sync release metadata, then publish the checkpoint bundle to Hugging Face.