written 7.0 years ago by | • modified 5.6 years ago |
Hyperlink-Induced Topic Search (HITS) (also known as Hubs and authorities) is a link analysis algorithm that rates Web pages, developed by Jon Kleinberg. It determines two values for a page: its authority, which estimates the value of the content of the page, and its hub value, which estimates the value of its links to other pages.
Steps involved in HITS algorithm:
Starting from the user supplied query, HITS assembles initial set S of pages:
The initial set of pages is called root set. These pages are then expanded to a larger root set T by adding any pages that are linked to or from any page in the initial set S.
HITS then associates with each page p a hub weight h(p) and an authority weight a(p), all initialized to 1.
HITS then iteratively update the hubs and authority weights of each page. Let p → q denote “ page p has an hyperlink to page q”. HITS updates the hubs and authority as follows:
$a(p)=∑_{p →q} h(q)$
$h(p)= ∑_{q→p} a(q)$