Locality-sensitive hashing

Main page

What are your thoughts?

Be the first to start a discussion here.

Recent from talks

Be the first to start a discussion here.

Recent from talks

Be the first to start a discussion here.

Locality-sensitive hashing

Community hub0 subscribers

Talks overview Knowledge Base overview

About hubStatsRules

Wikipedia

Grokipedia

Locality-sensitive hashing

In computer science, locality-sensitive hashing (LSH) is a fuzzy hashing technique that hashes similar input items into the same "buckets" with high probability. The number of buckets is much smaller than the universe of possible input items. Since similar items end up in the same buckets, this technique can be used for data clustering and nearest neighbor search. It differs from conventional hashing techniques in that hash collisions are maximized, not minimized. Alternatively, the technique can be seen as a way to reduce the dimensionality of high-dimensional data; high-dimensional input items can be reduced to low-dimensional versions while preserving relative distances between items.

Hashing-based approximate nearest-neighbor search algorithms generally use one of two main categories of hashing methods: either data-independent methods, such as locality-sensitive hashing (LSH); or data-dependent methods, such as locality-preserving hashing (LPH).

Locality-preserving hashing was initially devised as a way to facilitate data pipelining in implementations of massively parallel algorithms that use randomized routing and universal hashing to reduce memory contention and network congestion.

A finite family ${\mathcal {F}}$ of functions $h\colon M\to S$ is defined to be an LSH family for

if it satisfies the following condition. For any two points $a,b\in M$ and a hash function $h$ chosen uniformly at random from ${\mathcal {F}}$ :

Such a family ${\mathcal {F}}$ is called $(r,cr,p_{1},p_{2})$ -sensitive.

Alternatively it is possible to define an LSH family on a universe of items $U$ endowed with a similarity function $\phi \colon U\times U\to [0,1]$ . In this setting, a LSH scheme is a family of hash functions $H$ coupled with a probability distribution $D$ over $H$ such that a function $h\in H$ chosen according to $D$ satisfies $Pr[h(a)=h(b)]=\phi (a,b)$ for each $a,b\in U$ .

Given a $(d_{1},d_{2},p_{1},p_{2})$ -sensitive family ${\mathcal {F}}$ , we can construct new families ${\mathcal {G}}$ by either the AND-construction or OR-construction of ${\mathcal {F}}$ .

See all

Hub AI

Locality-sensitive hashing AI simulator

(@Locality-sensitive hashing_simulator)

Wikipedia

Grokipedia

Hub AI

Locality-sensitive hashing

A finite family ${\mathcal {F}}$ of functions $h\colon M\to S$ is defined to be an LSH family for

if it satisfies the following condition. For any two points $a,b\in M$ and a hash function $h$ chosen uniformly at random from ${\mathcal {F}}$ :

Such a family ${\mathcal {F}}$ is called $(r,cr,p_{1},p_{2})$ -sensitive.

Given a $(d_{1},d_{2},p_{1},p_{2})$ -sensitive family ${\mathcal {F}}$ , we can construct new families ${\mathcal {G}}$ by either the AND-construction or OR-construction of ${\mathcal {F}}$ .

See all

Talk Channels

Knowledge Base

Special Pages

Talk Channels

Knowledge Base

Special Pages

Locality-sensitive hashing

Locality-sensitive hashing

Recent from talks

Recent from talks

Knowledge base stats:

Talk channels stats:

Members stats:

Locality-sensitive hashing

Hub AI

Locality-sensitive hashing

Contribute something to knowledge base

History

History

Locality-sensitive hashing

Locality-sensitive hashing

Recent from talks

Recent from talks

Knowledge base stats:

Talk channels stats:

Members stats:

Locality-sensitive hashing

Hub AI

Locality-sensitive hashing

Contribute something to knowledge base