Algorithms / Techniques for a Ranked Website (PageRank aside)

I am looking for algorithms / methods that can represent the importance of a single web page. Leaving aside the PageRank, are there other methods of such a rating based on the content, structure and hyperlinks to each other?

I'm not just talking about connecting from www.foo.com to www.bar.com, like PageRank, but also from www.foo.com/bar to www.foo.com/baz, etc. (next to the fact that PageRank has adapted to these needs)

How I β€œdefine” importance: I think of importance in this context as β€œhow relevant this side is to the user, and also how important it is to the rest of the site”.
For instance. The Christmas lottery is announced on the start page, and only one link leading to this site is more important for both the user and the site. The fingerprint that has a link from each site (since it is mostly somewhere in the footer) does not matter, although it has many links to it. The fingerprint is also not important for the site as a β€œunit”, since it does not provide real value for the page intended for viewing (= providing information, selling products, general maintenance, etc.).

+4
source share
2 answers

There is also SALSA , which is more stable than HITS [so it suffers less from spam].

Since you are also interested in the context of the pages, you can look at the work of Havelivala on the topic, page sensitivity

+1
source

Another well-known algorithm is Hubs and Authorities (HITS) . Basically, you classify your page as a hub (page with a lot of outgoing links) and authorities (page with a lot of inbound links).

But you must really determine what you mean by importance. What is really important? PageRank defines it with respect to incoming links. These are PageRank definitions.

If you determine the importance of photography because you like photography. Then you can come up with an important metric, such as the number of photos per page. Another indicator may be the number of inbound links from a photo site (e.g. flickr.com , 500px , ...)

Using your definition of importance, you can use β€œ1- (the number of inbound links divided by the number of pages on the site). This gives you a number from 0 to 1. 0 means it doesn't matter, but 1 means value.

Using this indicator, your fingerprint that appears on all pages of the site has a value of 0. Your Christmas sale page, which has only one link, has a value of almost 1

+1
source

All Articles