Jump to content

Algorithmic curation

From Wikipedia, the free encyclopedia
A feed of posts curated for a user on the Mastodon social network

Algorithm curation is the selection of online media by technologies such as recommender systems and personalized search. Curation entails the selective sharing of online content and recommendations based on inferred interests.[1] Curation algorithms implement different filter approaches, such as collaborative filtering and content-based filtering. Examples include search engine and social media products such as the Twitter feed, Facebook's News Feed, and Google Personalized Search.[2]

History

[edit]

Early algorithmic curation

[edit]

Online platforms use newsfeed algorithms to determine what content to present to each user.[3][4] The volume of content published on social media platforms created a need for automated filtering, as manual review of all available content by users is not feasible.[3][4] These systems function as a form of gatekeeper, shaping which new material users are exposed to and influencing knowledge, attention, and political exposure.[4]

Information overload

[edit]

Early ranking algorithms addressed information overload by surfacing the most recent or most popular posts.[3] Later systems shifted toward ranking content based on predicted engagement, aiming to increase the time users spend on a platform.[3] Research has found that these engagement-oriented systems can increase the spread of misinformation and contribute to political polarization as a side effect of optimising for user interaction.[4]

How algorithm changes users' feeds over time

[edit]

Algorithmic curation has been found to increase source diversity in some respects while simultaneously reducing the number of external links presented to users, which limits exposure to off-platform content.[4] Research using agent-based modelling has examined how user behaviour, information quality, and algorithmic design interact with one another over time.[3][4]

Emergence of AI

[edit]

Platforms increasingly shifted from rule-based ranking systems toward machine-learning and AI-driven approaches, which allow feeds to be personalised at a larger scale and with greater responsiveness to user behaviour.[3][4] For example, X (formerly Twitter) moved away from a chronological feed toward an AI-powered ranking system that personalises content for each user.[4] These systems are capable of making ranking decisions across volumes of content and user interactions that would not be practical to handle manually.[4]

Approach

[edit]

Filter types

[edit]

Collaborative filtering

[edit]

Collaborative filtering (CF) methods create recommendations based on a person's usage patterns.[5] CF predicts a person's preference for an item by matching their interests with those of users who have similar interests.[5] This process allows for the sharing of ratings between users with similar profiles.[5] CF is based on patterns of human behaviour rather than machine analysis of content itself.[5] Users of CF systems rate items they have interacted with, and these ratings form a profile of interests.[5] The CF system then matches that user with others who have similar profiles, and uses their ratings to generate recommendations.[5] Collaborative filtering can be applied across various content types including text, images, music, and financial products, and can account for complex attributes such as taste and quality that are difficult to represent explicitly.[6]

Content-based filtering

[edit]

Content-based filtering (CBF) builds a user profile to represent the types of items a user has engaged with, based on keywords and attributes used to describe those items.[6][7] Recommendations are generated by presenting items similar to those the user has previously engaged with or is currently viewing.[7] The CBF method creates a profile for each item based on discrete attributes and features, and then constructs a content-based user profile using a weighted vector of those features derived from items the user has rated, purchased, or interacted with.[6][7] The weights represent the relative importance of each feature, and can be computed using techniques such as Bayesian classifiers, cluster analysis, decision trees, and artificial neural networks, with the goal of estimating the probability that a user will engage with a suggested item.[6] One application of content-based filtering is Pandora Radio, where users provide an artist, genre, or composer to generate a station, and the system surfaces music with similar attributes.[6]

Technology

[edit]

Recommender system

[edit]

Recommender systems rank and suggest content to users based on a combination of implicit and explicit input.[8] Implicit signals include time spent viewing or engaging with a specific item.[8] Explicit signals include actions such as liking posts, saving store pages, reading news articles, or sharing content.[8]

[edit]

Personalized search aims to retrieve results most relevant to the user by incorporating contextual factors beyond the explicit query, such as past queries, browsing history, and inferred interests.[9] Social media platforms such as X (formerly Twitter) and Bluesky generate recommendations based on similar users and the content those users interact with.[10] Personalized search may also allow users to explicitly filter results by blocking content containing certain phrases or hashtags.[11] For first-time users without prior history, personalized search may draw on content-based filtering to establish an initial context.[6] Similar processes are used by search engines and retail platforms to tailor results and product recommendations to individual users.

AI contribution

[edit]

Artificial intelligence contributes to algorithmic curation through machine-learning models capable of processing large volumes of data.[12] Techniques such as deep learning and reinforcement learning allow curation algorithms to model user preferences with greater granularity alongside established filtering approaches.[12] This enables platforms to adjust content rankings rapidly in response to user behaviour.[12] In social media and streaming contexts, AI-driven systems arrange feeds according to predicted relevance, with the outputs shaped by patterns present in the training data.[13]

Social media and potential impact

[edit]

Echo chambers

[edit]

Social media algorithms, such as those used by X (formerly Twitter), recommend content that the system predicts a user will engage with positively. Content from accounts with differing perspectives is less likely to be surfaced, which may reduce source and topic diversity and contribute to the formation of echo chambers.[4] For example, Facebook's news feed is designed to surface content aligned with users' prior engagement, which may reinforce existing views.[14] This dynamic may contribute to filter bubbles, in which users are seldom exposed to content outside their existing interests. Users may further narrow their feeds by actively blocking certain content or accounts.[4]

Over-representation

[edit]

A pattern observed across social media platforms is the concentration of algorithmic visibility among a small subset of users. Content from the most active users, those with the largest followings, or those generating the most engagement tends to be surfaced more frequently, meaning a small number of accounts can account for a disproportionate share of what appears in other users' feeds.[4]

See also

[edit]

References

[edit]
  1. ^ Khan, Sadia; Bhatt, Ibrar (2018), "Curation", The International Encyclopedia of Media Literacy, John Wiley & Sons, Ltd, pp. 1–9, doi:10.1002/9781118978238.ieml0047, ISBN 978-1-118-97823-8, retrieved 2025-11-21
  2. ^ Berman, Ron; Katona, Zsolt (Sep 2016). "The Impact of Curation Algorithms on Social Network Content Quality and Structure". Working Papers.
  3. ^ a b c d e f Gausen, Anna; Luk, Wayne; Guo, Ce (2022-12-28). "Using Agent-Based Modelling to Evaluate the Impact of Algorithmic Curation on Social Media". J. Data and Information Quality. 15 (1): 2:1–2:24. doi:10.1145/3546915. ISSN 1936-1955.
  4. ^ a b c d e f g h i j k l Bandy, Jack; Diakopoulos, Nicholas (2021-04-22). "More Accounts, Fewer Links: How Algorithmic Curation Impacts Media Exposure in Twitter Timelines". Proc. ACM Hum.-Comput. Interact. 5 (CSCW1): 78:1–78:28. doi:10.1145/3449152.
  5. ^ a b c d e f Herlocker, Jonathan (2000). "Explaining Collaborative Filtering Recommendations". Proceedings of the 2000 ACM conference on Computer supported cooperative work. pp. 241–250. doi:10.1145/358916.358995. ISBN 1-58113-222-0.
  6. ^ a b c d e f "Online Recommender Systems – How Does a Website Know What I Want? |". Retrieved 2025-11-21.
  7. ^ a b c Wang, Donghui; Liang, Yanchun; Xu, Dong; Feng, Xiaoyue; Guan, Renchu (2018-10-01). "A content-based recommender system for computer science publications". Knowledge-Based Systems. 157: 1–9. doi:10.1016/j.knosys.2018.05.001. ISSN 0950-7051.
  8. ^ a b c Roy, Deepjyoti; Dutta, Mala (2022-05-03). "A systematic review and research perspective on recommender systems". Journal of Big Data. 9 (1): 59. doi:10.1186/s40537-022-00592-5. ISSN 2196-1115.
  9. ^ Dou, Zhicheng; Song, Ruihua; Wen, Ji-Rong (2007-05-08). "A large-scale evaluation and analysis of personalized search strategies". Proceedings of the 16th international conference on World Wide Web. Banff Alberta Canada: ACM. pp. 581–590. doi:10.1145/1242572.1242651. ISBN 978-1-59593-654-7.
  10. ^ Liu, Yuhan; Song, Emmy; Zhang, Owen Xingjian; Merriman, Jewel; Zhang, Lei; Monroy-Hernández, Andrés (2025-10-16). "Understanding Decentralized Social Feed Curation on Mastodon". Proc. ACM Hum.-Comput. Interact. 9 (7): CSCW507:1–CSCW507:25. doi:10.1145/3757688.
  11. ^ Quelle, Dorian; Bovet, Alexandre (2025-02-26). "Bluesky: Network topology, polarization, and algorithmic curation". PLOS ONE. 20 (2) e0318034. arXiv:2405.17571. Bibcode:2025PLoSO..2018034Q. doi:10.1371/journal.pone.0318034. ISSN 1932-6203. PMID 40009593.
  12. ^ a b c Lazer, David; Swire-Thompson, Briony; Wilson, Christo (2024-09-01). "A Normative Framework for Assessing the Information Curation Algorithms of the Internet". Perspectives on Psychological Science. 19 (5): 749–757. doi:10.1177/17456916231186779. ISSN 1745-6916. PMID 38010888.
  13. ^ Villermet, Quentin; Poiroux, Jérémie; Moussallam, Manuel; Louail, Thomas; Roth, Camille (2021-09-13). "Follow the guides: Disentangling human and algorithmic curation in online music consumption". Fifteenth ACM Conference on Recommender Systems. RecSys '21. New York, NY, USA: Association for Computing Machinery. pp. 380–389. arXiv:2109.03915. doi:10.1145/3460231.3474269. ISBN 978-1-4503-8458-2.
  14. ^ Papa, Venetia; Photiadis, Thomas (2021-12-15). "Algorithmic Curation and Users' Civic Attitudes: A Study on Facebook News Feed Results". Information. 12 (12): 522. doi:10.3390/info12120522. hdl:20.500.14279/32861. ISSN 2078-2489.