{"id":37471,"date":"2022-02-14T09:00:00","date_gmt":"2022-02-14T07:00:00","guid":{"rendered":"https:\/\/forklog.com\/en\/?p=37471"},"modified":"2025-08-29T17:27:12","modified_gmt":"2025-08-29T14:27:12","slug":"recommender-systems-what-they-are-and-how-they-work","status":"publish","type":"post","link":"https:\/\/forklog.com\/en\/recommender-systems-what-they-are-and-how-they-work\/","title":{"rendered":"Recommender systems: what they are and how they work"},"content":{"rendered":"<div class=\"wp-block-text-wrappers-cards single_card\">\n<h2 class=\"card_label\"><strong>What are recommender systems?<\/strong><\/h2>\n<p>Recommender systems are algorithms that select relevant goods and services based on user data.<\/p>\n<p>The technology is a subfield of machine learning.<\/p>\n<\/div>\n<div class=\"wp-block-text-wrappers-cards single_card\">\n<h2 class=\"card_label\"><strong>When did recommender systems emerge?<\/strong><\/h2>\n<p>Recommender systems are relatively new. In 1990 the technology was <a href=\"https:\/\/www.diva-portal.org\/smash\/get\/diva2:931533\/FULLTEXT01.pdf\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">first mentioned<\/a> by the Swedish scholar Jussi Karlgren, who described it as \u201ca digital bookshelf\u201d. The work laid the foundation for his later research.<\/p>\n<p>In the 2000s recommendation algorithms began to permeate e-commerce. One of the pioneers was Amazon.<\/p>\n<p>In 2006 Netflix, then a DVD-by-post subscription company, launched a <a href=\"https:\/\/ru.wikipedia.org\/wiki\/Netflix_Prize\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">competition for the best recommendation algorithm<\/a> with a $1m prize. To claim it, independent developers had to improve the accuracy of the recommendation engine by 10%. In 2009 the prize went to the team BellKor\u2019s Pragmatic Chaos.<\/p>\n<p>In the 2010s recommender systems arrived on social media. Today most popular platforms have abandoned the chronological feed in favour of an algorithmic one.<\/p>\n<\/div>\n<div class=\"wp-block-text-wrappers-cards single_card\">\n<h2 class=\"card_label\"><strong>How do recommender systems work?<\/strong><\/h2>\n<p>Two principal approaches are used today in recommender systems: collaborative filtering and content-based models.<\/p>\n<p>The core idea of collaborative filtering is to generate recommendations using data from other users with similar interests. Filtering can be <strong>user-based<\/strong> or <strong>item-based<\/strong>.<\/p>\n<p>The main task of a <strong>user-based<\/strong> algorithm is to find users with the most similar tastes based on the products they have consumed and the ratings they have given. Suppose Anna and Vadim bought juice, a bun and yoghurt. It is also known that Maksim often buys juice and buns. He should therefore be recommended yoghurt.<\/p>\n<p><strong>Item-based<\/strong> recommendations tackle the problem from the other side: find similar items and see how they were rated before. Let us try to infer whether Maksim likes yoghurt. We know he likes juice and buns. As a food product, yoghurt has similar characteristics. We can therefore assume he will like it.<\/p>\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"625\" src=\"https:\/\/forklog.com\/wp-content\/uploads\/recommendation_sys_strats-1024x625.png\" alt=\"What are recommender systems?\" class=\"wp-image-164957\" srcset=\"https:\/\/forklog.com\/wp-content\/uploads\/recommendation_sys_strats-1024x625.png 1024w, https:\/\/forklog.com\/wp-content\/uploads\/recommendation_sys_strats-300x183.png 300w, https:\/\/forklog.com\/wp-content\/uploads\/recommendation_sys_strats-768x469.png 768w, https:\/\/forklog.com\/wp-content\/uploads\/recommendation_sys_strats.png 1200w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><figcaption>The logic of determining similarity in user-based (left) and item-based (right) filtering.<\/figcaption><\/figure>\n<p>The aim of collaborative filtering is to find a user who has rated a particular item and to compute the correlation coefficient between the vectors of their ratings across all items in the database. A common method is <a href=\"https:\/\/ru.wikipedia.org\/wiki\/%D0%9C%D0%B5%D1%82%D0%BE%D0%B4_k-%D0%B1%D0%BB%D0%B8%D0%B6%D0%B0%D0%B9%D1%88%D0%B8%D1%85_%D1%81%D0%BE%D1%81%D0%B5%D0%B4%D0%B5%D0%B9\">k-nearest neighbours<\/a>.<\/p>\n<p>At the centre of a <strong>content-based<\/strong> model is the item itself. User ratings are not required for the algorithm to work. What matters are properties that characterise the item: author, genre, country of origin, manufacturer, and so on. Not all attributes are relevant to the consumer, so it is best to focus on the key ones.<\/p>\n<p>Content-based models have grown popular in recent years. They do not require lengthy training; developers can start recommending items right away.<\/p>\n<p>There are drawbacks. Many users have noticed that after searching for a particular product on Google they are \u201cchased\u201d by ads offering to buy it in some online shop. To reduce complaints about irrelevant ads, developers complement such algorithms with knowledge-based models. These also do not rely on ratings, using only user and item profiles.<\/p>\n<\/div>\n<div class=\"wp-block-text-wrappers-cards single_card\">\n<h2 class=\"card_label\"><strong>How do recommender systems collect data?<\/strong><\/h2>\n<p>Data for recommendation algorithms can be collected explicitly or implicitly.<\/p>\n<p>Explicit methods include asking users to rate items on a scale, rank them from best to worst, compare two similar products, or compile a favourites list. The key point is that the user understands their data are used by algorithms and consents to processing.<\/p>\n<p>With implicit methods, site visitors are not always aware that their actions may feed recommender systems. This includes cookies, Google or Facebook ad trackers, detailed analysis of interactions with videos, and so on.<\/p>\n<p>Governments in many countries typically require sites to notify visitors about such data collection. Users, however, cannot always opt out.<\/p>\n<\/div>\n<div class=\"wp-block-text-wrappers-cards single_card\">\n<h2 class=\"card_label\"><strong>Where are recommender systems used?<\/strong><\/h2>\n<p>As noted, recommender systems are widely used in e-commerce. Online shops use them to suggest relevant products in the \u201cYou might also like\u201d section or to offer complementary items directly in the basket. If a product is out of stock, algorithms can surface alternatives.<\/p>\n<p>Email newsletters also often contain personalised recommendations.<\/p>\n<p>Retailers such as Amazon, Ozon and Wildberries use these algorithms.<\/p>\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"520\" src=\"https:\/\/forklog.com\/wp-content\/uploads\/amazon-recommendations-1-1024x520.png\" alt=\"What are recommender systems?\" class=\"wp-image-164973\" srcset=\"https:\/\/forklog.com\/wp-content\/uploads\/amazon-recommendations-1-1024x520.png 1024w, https:\/\/forklog.com\/wp-content\/uploads\/amazon-recommendations-1-300x152.png 300w, https:\/\/forklog.com\/wp-content\/uploads\/amazon-recommendations-1-768x390.png 768w, https:\/\/forklog.com\/wp-content\/uploads\/amazon-recommendations-1.png 1348w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><figcaption>Recommendations on an Amazon product page. Data: Amazon.<\/figcaption><\/figure>\n<p>Major streaming services also use recommender systems, including Netflix, Spotify, Apple Music, Yandex Music, YouTube, Megogo and others.<\/p>\n<p>Recommendation algorithms are widespread on social networks. Facebook, Twitter, Instagram, VKontakte and others have for years shown users content assembled by algorithms. Only a few allow switching to a chronological feed.<\/p>\n<\/div>\n<div class=\"wp-block-text-wrappers-cards single_card\">\n<h2 class=\"card_label\"><strong>What problems do recommender systems have?<\/strong><\/h2>\n<p>Recommender systems have several limitations. One is the cold-start problem\u2014when the algorithm lacks sufficient data to operate well. This is typical for a new or unpopular item rated by few users, or for an atypical consumer whose preferences diverge markedly from the average.<\/p>\n<p>In such cases ratings are adjusted artificially. For example, instead of a simple mean, a smoothed average is used. With few reviews, an item\u2019s rating will tend towards a \u201csafe average\u201d; once enough real ratings accumulate, the artificial smoothing is switched off.<\/p>\n<p>Another issue is bias. Poorly tuned algorithms, built-in stereotypes and user behaviour can all distort rankings.<\/p>\n<p>In 2021 Facebook\u2019s advertising algorithms <a href=\"https:\/\/www.reuters.com\/article\/us-facebook-advertising-idUSKBN2BW1LG\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">disproportionately showed<\/a> different job ads to men and women. Twitter\u2019s auto-cropping tool for the home feed tended to focus on young, slim women.<\/p>\n<p>In both cases developers quickly fixed the errors, but that is not always possible. Google is frequently criticised over how its recommendation algorithms work.<\/p>\n<p>For example, results for the queries \u201cathletes\u201d and \u201cfemale athletes\u201d differ markedly. For men the algorithms surface articles about professional achievements. For women the system returns various rankings of \u201cattractiveness\u201d and \u201csexiness\u201d.<\/p>\n<figure class=\"wp-block-gallery has-nested-images columns-default is-cropped wp-block-gallery-1 is-layout-flex wp-block-gallery-is-layout-flex\">\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"583\" data-id=\"164970\" src=\"https:\/\/forklog.com\/wp-content\/uploads\/google-search-male-athletes-1-1024x583.png\" alt=\"What are recommender systems?\" class=\"wp-image-164970\" srcset=\"https:\/\/forklog.com\/wp-content\/uploads\/google-search-male-athletes-1-1024x583.png 1024w, https:\/\/forklog.com\/wp-content\/uploads\/google-search-male-athletes-1-300x171.png 300w, https:\/\/forklog.com\/wp-content\/uploads\/google-search-male-athletes-1-768x438.png 768w, https:\/\/forklog.com\/wp-content\/uploads\/google-search-male-athletes-1.png 1348w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"584\" data-id=\"164969\" src=\"https:\/\/forklog.com\/wp-content\/uploads\/google-search-female-athletes-1024x584.png\" alt=\"What are recommender systems?\" class=\"wp-image-164969\" srcset=\"https:\/\/forklog.com\/wp-content\/uploads\/google-search-female-athletes-1024x584.png 1024w, https:\/\/forklog.com\/wp-content\/uploads\/google-search-female-athletes-300x171.png 300w, https:\/\/forklog.com\/wp-content\/uploads\/google-search-female-athletes-768x438.png 768w, https:\/\/forklog.com\/wp-content\/uploads\/google-search-female-athletes.png 1347w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/figure><figcaption class=\"blocks-gallery-caption\">Google search results for \u201cathletes\u201d and \u201cfemale athletes\u201d. Data: Google.<\/figcaption><\/figure>\n<p>Search results can be influenced not only by users but also by bots. In 2018 Reddit users staged deliberate <a href=\"https:\/\/www.vice.com\/en\/article\/ev8zbz\/trumps-face-is-the-first-google-image-search-result-for-idiot\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">manipulations<\/a> of Google\u2019s algorithms so that a photo of former US president Donald Trump appeared for the query \u201cidiot\u201d.<\/p>\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"551\" src=\"https:\/\/forklog.com\/wp-content\/uploads\/google-search-trump-1024x551.png\" alt=\"What are recommender systems?\" class=\"wp-image-165006\" srcset=\"https:\/\/forklog.com\/wp-content\/uploads\/google-search-trump-1024x551.png 1024w, https:\/\/forklog.com\/wp-content\/uploads\/google-search-trump-300x161.png 300w, https:\/\/forklog.com\/wp-content\/uploads\/google-search-trump-768x413.png 768w, https:\/\/forklog.com\/wp-content\/uploads\/google-search-trump-1536x826.png 1536w, https:\/\/forklog.com\/wp-content\/uploads\/google-search-trump-2048x1101.png 2048w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><figcaption>Donald Trump appearing in results for the query \u201cidiot\u201d. Data: Google.<\/figcaption><\/figure>\n<p>During a congressional hearing on the incident, the company\u2019s chief executive Sundar Pichai <a href=\"https:\/\/www.bbc.com\/news\/technology-46521887\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">said<\/a> that employees do not intervene in ranking. According to him, algorithms do this on their own, scanning millions of search strings and ranking them by more than 200 parameters.<\/p>\n<p>Developers of recommender systems can also exploit algorithmic bias. In October 2021 a former Facebook employee published documents showing the deliberate use of \u201charmful\u201d tools on the platform. She said top managers knew the algorithms were intolerant towards vulnerable groups. But the company was slow to fix the errors because such content drove stronger user engagement and boosted advertising revenue.<\/p>\n<\/div>\n<p>Subscribe to ForkLog news on Telegram: <a href=\"https:\/\/t.me\/forklogAI\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">ForkLog AI<\/a> \u2014 all the news from the world of AI!<\/p>\n","protected":false},"excerpt":{"rendered":"<p>A guide to recommender systems: how they work, where they are used, and their strengths and weaknesses.<\/p>\n","protected":false},"author":1,"featured_media":37472,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"select":"1","news_style_id":"1","cryptorium_level":"2","_short_excerpt_text":"","creation_source":"","_metatest_mainpost_news_update":false,"footnotes":""},"categories":[2113],"tags":[2130,438],"class_list":["post-37471","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-cryptorium","tag-101-artificial-intelligence","tag-artificial-intelligence"],"aioseo_notices":[],"amp_enabled":true,"views":"80","promo_type":"1","layout_type":"1","short_excerpt":"","is_update":"","_links":{"self":[{"href":"https:\/\/forklog.com\/en\/wp-json\/wp\/v2\/posts\/37471","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/forklog.com\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/forklog.com\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/forklog.com\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/forklog.com\/en\/wp-json\/wp\/v2\/comments?post=37471"}],"version-history":[{"count":1,"href":"https:\/\/forklog.com\/en\/wp-json\/wp\/v2\/posts\/37471\/revisions"}],"predecessor-version":[{"id":37473,"href":"https:\/\/forklog.com\/en\/wp-json\/wp\/v2\/posts\/37471\/revisions\/37473"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/forklog.com\/en\/wp-json\/wp\/v2\/media\/37472"}],"wp:attachment":[{"href":"https:\/\/forklog.com\/en\/wp-json\/wp\/v2\/media?parent=37471"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/forklog.com\/en\/wp-json\/wp\/v2\/categories?post=37471"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/forklog.com\/en\/wp-json\/wp\/v2\/tags?post=37471"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}