Dataset Builder

Generate labeled datasets by searching nearest neighbors in HSV and/or MobileNet embeddings.

Exact filename of the anchor image to search around.
Optional. Leave blank to search across all sites.
Embedding space used for nearest-neighbor search.
Label to assign to exported dataset rows.
Only used when class label is 'other'.
Maximum number of rows returned after merge, sorting, and dedupe.
Minimum desired row count. Warning shown if fewer are found.
Optional max embedding distance (raw vector distance). Lower is stricter.
When model=Both, fetch this many candidates per model before merge.
Optional +/- day window around anchor timestamp.
Keep best match per time bucket of this many minutes. 0 disables dedupe.
Number of random sample images shown in grid preview.
Download CSV now or store dataset rows in DB first.

Summary

Run a search to see summary info.