Recent from talks
Nothing was collected or created yet.
Social graph
View on Wikipedia

A social graph is a graph that represents social relations between entities. It is a model or representation of a social network. The social graph has been referred to as "the global mapping of everybody and how they're related".[1]
The term was used as early as 1964, albeit in the context of isoglosses.[2] Leo Apostel uses the term in the context here in 1978.[3] The concept was originally called sociogram.
The term was popularized at the Facebook F8 conference on May 24, 2007, when it was used to explain how the newly introduced Facebook Platform would take advantage of the relationships between individuals to offer a richer online experience.[4] The definition has been expanded to refer to a social graph of all Internet users.
Since explaining the concept of the social graph, Mark Zuckerberg, one of the founders of Facebook, has often touted Facebook's goal of offering the website's social graph to other websites so that a user's relationships can be put to use on websites outside Facebook's control.[5]
Facebook's social graph
[edit]As of 2010[update], Facebook's social graph is the largest social network dataset in the world,[6] and it contains the largest number of defined relationships between the largest number of people among all websites because it is the most widely used social networking service in the world.[7]
Impact
[edit]Facebook's social graph played a crucial role in the rapid growth of the company by increasing the engagement of its users, optimizing what each user sees in their feed and enabling an extremely efficient advertising policy. With their social graph, Facebook created a huge network of their platform's users which enabled them to grow exponentially.[8]
One of the stars features of Facebook is its feed – what each user sees in their app. Facebook's feed is mainly distributed using its Social Graph. Instead of displaying random publication from random users, the graph allows the app to display personalized content based on each user's previous interactions. This individualized approach enhances the experience that the app offers which increase users' engagement towards the social media application. Likes, shares and comments also play a key role in the social graph's layout, by reinforcing interactions and visibility between two users who enjoy the same classes of entertainment.[9]
Analysis
[edit]Facebook's social graph has been analyzed by multiple papers. In 2011, a study[10] confirmed the six degrees of separation phenomenon on the scale of the graph.
Data storing
[edit]Social graphs are typically stored using graph databases, which utilize graph query languages to manage and query relationships efficiently.
For the storing of its social graph, Facebook relies on TAO (The Associations and Objects), a custom-built, distributed system optimized for fast read operations at a massive scale.[11]
Issues
[edit]Several issues have come forward regarding the existing implementation of the social graph owned by Facebook. For example, currently, a social networking service is unaware of the relationships forged between individuals on a different service. This creates an online experience that is not seamless, and instead provides for a fragmented experience due to the lack of an openly available graph between services. In addition, existing services define relationships differently.
Concern has also focused on the fact that Facebook's social graph is owned by the company and is not shared with other services, giving it a major advantage over other services and preventing its users from taking their graph with them to other services when they wish to do so, such as when a user is dissatisfied with Facebook.
Google has attempted to offer a solution to this problem by creating the Social Graph API, released in January 2008,[12] which allows websites to draw publicly available information about a person to form a portable identity of the individual, in order to represent a user's online identity.[13] This did not, however, experience Google's desired uptake and was thus retired in 2012.[14]
Facebook introduced its own Graph API at the 2010 f8 conference. Both companies monetise collected data sets through direct marketing and social commerce.[15] In December 2016, Microsoft acquired LinkedIn for $26.2 billion.[16]
Lastly, massive use of Social Graph raised ethical questions and confidentiality problems. The Cambridge Analytica scandal in 2018[17] displayed to the open world how other apps had used data of the social graph to do political profiling, which sparked global outrage. Moreover, extreme personalization algorithms caused another problematic effect – the creation of filter bubble and echo chambers, reinforcing user's existing beliefs which influenced public debates.[18] These concerns led to the adoption of stricter regulations on data protection, like the California Consumer Privacy Act, forcing Facebook to change its way of using data.[19]
Twitter's social graph
[edit]As of 2012, Twitter is the most popular micro-blogging service in the world. Unlike classical social networks (e.g., Facebook), the relation between Twitter users is unidirectional, which makes information propagation in Twitter much closer to how information propagates in real life.
In 2012, Twitter's social graph consisted of 537 million Twitter accounts connected by 23.95 billion links.[20]
Open Graph
[edit]Facebook's Graph API allows websites to draw information about more objects than simply people, including photos, events, and pages, and their relationships between each other. This expands the social graph concept to more than just relationships between individuals and instead applies it to virtual non-human objects between individuals, as well.[21]
Other uses of Social Graph
[edit]The concept of the social graph can be extended to other uses than online social networks. It finds uses in multiple fields where interconnected relationships can be found. For companies, the idea of an Enterprise social graph has been explored.[22]
In sports, most commonly in team's sports, interactions between players and teams can be studied to enhance performances, such as the amount of passes between two specific players in football, proximity and distance between players in basketball.[23] Those interactions can be modeled through a social graph and can lead to strategy optimization. In statistical studies, social graphs can map the spread of diseases in a society.
See also
[edit]References
[edit]- ^ "Facebook: One Social Graph to Rule Them All?". CBS News. Retrieved July 11, 2010.
- ^ "Graphic Representation of Social Isoglosses". Anthropological Linguistics. Volume 6. Issue 4. Horne, Kibbey M. February 1964.
- ^ Apostel, Leo (1978). "The Elementary Theory of Collective Action" (PDF). Philosophica, (Volume 21). Archived from the original (PDF) on November 10, 2021.
- ^ "Facebook Unveils Platform for Developers of Social Applications". Facebook. Retrieved July 11, 2010.
- ^ "Facebook's Zuckerberg uncorks the social graph". ZDNet. Retrieved July 11, 2010.
- ^ "One Graph To Rule Them All?". A VC. Retrieved July 11, 2010.
- ^ "Facebook: No. 1 Globally". BusinessWeek. Retrieved July 11, 2010.
- ^ Fattal, Alex (2012). "Facebook: Corporate Hackers, a Billion Users, and the Geo-politics of the "Social Graph"". Anthropological Quarterly. 85 (3): 927–955. doi:10.1353/anq.2012.0051. ISSN 1534-1518.
- ^ Wilson, Christo; Boe, Bryce; Sala, Alessandra; Puttaswamy, Krishna P.N.; Zhao, Ben Y. (2009-04-01). "User interactions in social networks and their implications". Proceedings of the 4th ACM European conference on Computer systems. EuroSys '09. New York, NY, USA: Association for Computing Machinery. pp. 205–218. doi:10.1145/1519065.1519089. ISBN 978-1-60558-482-9.
- ^ Ugander, Johan; Karrer, Brian; Backstrom, Lars; Marlow, Cameron (2011-11-18), The Anatomy of the Facebook Social Graph, arXiv:1111.4503
- ^ "TAO: The power of the graph". Engineering at Meta. 2013-06-25. Retrieved 2025-03-31.
- ^ "Google Launches Social Graph API". InformationWeek. Retrieved July 11, 2010.
- ^ "Is Google's social graph API a Creeping Privacy Violation?". ReadWriteWeb. Retrieved July 11, 2010.
- ^ "Renewing old resolutions for the new year". Official Google Blog. Retrieved August 14, 2012.
- ^ Barok, Dušan (2011-05-22). "Privatising Privacy: Trojan Horse in Free Open Source Distributed Social Platforms". Retrieved 2011-10-15.
- ^ Lunden, Ingrid (2016-12-08). "Microsoft officially closes its $26.2B acquisition of LinkedIn". TechCrunch. Retrieved 2017-11-24.
- ^ Hinds, Joanne; Williams, Emma J.; Joinson, Adam N. (2020-11-01). ""It wouldn't happen to me": Privacy concerns and perspectives following the Cambridge Analytica scandal". International Journal of Human-Computer Studies. 143 102498. doi:10.1016/j.ijhcs.2020.102498. hdl:1983/f6b54d4c-8afc-4aa2-a859-1699501e5d2f. ISSN 1071-5819.
- ^ Wolfowicz, Michael; Weisburd, David; Hasisi, Badi (2023-03-01). "Examining the interactive effects of the filter bubble and the echo chamber on radicalization". Journal of Experimental Criminology. 19 (1): 119–141. doi:10.1007/s11292-021-09471-0. ISSN 1572-8315.
- ^ Brinckerhoff, Rosie (2018). "Social Network or Social Nightmare: How California Courts Can Prevent Facebook's Frightening Foray into Facial Recognition Technology from Haunting Consumer Privacy Rights Forever". Federal Communications Law Journal. 70: 105.
- ^ Gabielkov, Maksym; Legout, Arnaud (2012-12-10). "The Complete Picture of the Twitter Social Graph".
{{cite journal}}: Cite journal requires|journal=(help) - ^ "Graph API". Facebook. Retrieved July 11, 2010.
- ^ Woods, Dan. "Building The Enterprise Social Graph". Forbes. Retrieved 2025-09-09.
- ^ Clemente, Filipe Manuel; Martins, Fernando Manuel Lourenço; Mendes, Rui Sousa (2016). "Social Network Analysis Applied to Team Sports Analysis". SpringerBriefs in Applied Sciences and Technology. doi:10.1007/978-3-319-25855-3. ISBN 978-3-319-25854-6. ISSN 2191-530X.
External links
[edit]Social graph
View on GrokipediaDefinition and Conceptual Foundations
Core Definition
A social graph is a mathematical model derived from graph theory that depicts social networks as consisting of nodes representing entities—such as individuals, organizations, or groups—and edges representing the relationships or interactions between those entities, such as friendships, follows, or collaborations.[4] This structure captures the topology of connections within a population, enabling quantitative analysis of properties like centrality, clustering, and path lengths between nodes.[8] The concept formalizes real-world social structures by abstracting interpersonal ties into a directed or undirected graph, where edge weights may quantify interaction strength or frequency, as seen in datasets from online platforms tracking user behaviors like messaging or endorsements.[9] In computational terms, social graphs facilitate algorithms for tasks such as community detection or influence propagation, grounded in the premise that social influence correlates with network position rather than isolated attributes.[10] The term "social graph" gained prominence in 2007 when Facebook CEO Mark Zuckerberg described it as the underlying network of user connections powering platform features and third-party applications, emphasizing its role in distributing content through interpersonal links.[11] This usage highlighted the graph's scalability to billions of nodes, though empirical studies confirm that real social graphs exhibit small-world properties, with average path lengths around 4-6 in large-scale networks like early Facebook data.[12]Historical Origins and Evolution
![Sociogram representing social network analysis][float-right]The application of graph theory to social relationships originated in early 20th-century sociology, building on mathematical foundations laid by Leonhard Euler's 1736 solution to the Seven Bridges of Königsberg problem, which formalized the study of networks as nodes and edges.[13] Sociologist Georg Simmel's 1908 analysis of dyads and triads provided conceptual precursors by examining how social structures emerge from interpersonal ties, influencing later network thinking.[14] Jacob L. Moreno advanced this in 1934 with sociometry, introducing sociograms—visual diagrams mapping individuals as nodes and their relations as directed edges based on empirical choices, such as preferences in group settings.[15] These tools quantified social dynamics, revealing isolates, cliques, and centrality, and were applied in clinical and educational contexts to diagnose group structures.[16] By the mid-20th century, anthropologists like Clyde Kluckhohn and sociologists like Mark Granovetter extended these methods, incorporating concepts like weak ties in 1973 to explain information diffusion and opportunity structures.[14] The specific term "social graph" appeared in academic contexts by the late 1970s but gained prominence in computing through Facebook's 2007 F8 conference, where CEO Mark Zuckerberg described it as a universal map of human connections stored digitally for scalable querying and personalization.[17] This marked a shift from manual, small-scale sociograms to vast, algorithmically processed databases; by 2012, Facebook's graph encompassed over 1 billion users and trillions of edges, enabling features like friend recommendations via metrics such as common neighbors.[7] Evolution continued with decentralized protocols and semantic extensions, but the core digital social graph retained graph-theoretic principles for modeling persistent relations amid transient interactions.[13]
Technical Foundations
Graph Theory Basics
In graph theory, a graph is formally defined as a pair consisting of a set of vertices, also known as nodes, and a set of edges, which represent connections between pairs of vertices.[18] Vertices typically model discrete entities, such as individuals in a social network, while edges capture pairwise relationships, like friendships or communications.[19] This structure abstracts relational data without regard to geometric embedding, focusing solely on incidence relations between elements.[20] Graphs are classified as undirected or directed based on edge symmetry. In an undirected graph, edges form unordered pairs , implying bidirectional relations, as in mutual acquaintances where the connection lacks inherent direction.[21] Directed graphs, or digraphs, use ordered pairs , suitable for asymmetric ties like follower relationships in social platforms, where .[22] Simple graphs prohibit self-loops (edges from a vertex to itself) and multiple edges between the same pair, though multigraphs and weighted variants extend these for richer modeling, assigning numerical values to edges to quantify interaction strength.[23] Fundamental properties include the degree of a vertex, defined as the number of edges incident to it—in undirected graphs, this counts neighbors directly; in directed graphs, in-degree and out-degree distinguish incoming and outgoing ties.[8] A path is a sequence of distinct vertices connected by consecutive edges, enabling measures of reachability; a graph is connected if a path exists between every pair of vertices, otherwise comprising disconnected components.[24] Cycles, closed paths returning to the starting vertex, underpin analyses of redundancy and structure, while adjacency—whether vertices share an edge—forms the basis for matrix representations like the adjacency matrix, where entry if an edge exists from to , facilitating computational traversal and analysis.[25] These elements provide the foundational toolkit for modeling social graphs, where vertices represent users and edges denote interactions.Modeling Relationships and Properties
In social graph modeling, entities such as individuals, organizations, or content items are represented as nodes (or vertices), while the connections between them—such as friendships, follows, collaborations, or endorsements—are modeled as edges (or links).[8][1] This structure draws from graph theory, where a graph consists of a vertex set and an edge set , enabling the quantification of relational patterns like connectivity and influence.[22] Edges in social graphs can be undirected, indicating symmetric relationships where the connection is mutual and bidirectional, as in traditional friendships where if A knows B, then B knows A.[26][22] In contrast, directed edges (or arcs) capture asymmetric ties, such as one-way follows on platforms like Twitter, where the direction from source to target matters and reciprocity is not assumed.[27][28] Directed graphs are particularly suited to modeling influence flows or citations, whereas undirected graphs simplify analysis of cohesive groups but may overlook directional asymmetries in real-world interactions.[22] Properties and attributes enhance the expressiveness of these models by attaching metadata to nodes and edges. Node properties might include demographic details like age, location, or role, allowing for segmentation in analyses such as community detection.[29] Edge properties can specify attributes like relationship strength (via weights, e.g., frequency of interaction), timestamps of formation, or types (e.g., familial versus professional), which support weighted graph algorithms for measuring tie robustness.[30][4] In labeled property graph models, both nodes and edges carry labels for categorization, facilitating queries over multifaceted relationships, though this increases storage complexity compared to simple graphs.[4] Advanced modeling accommodates complexity beyond basic graphs, such as multigraphs that permit multiple edges between the same node pair to represent diverse relation types (e.g., colleague and friend simultaneously).[31] Hypergraphs extend this by allowing edges to connect multiple nodes, capturing group interactions like joint authorship or shared events that pairwise edges cannot fully represent.[2] These extensions preserve causal insights into network dynamics, such as how edge weights correlate with persistence, but require careful validation against empirical data to avoid overparameterization.[1]Key Implementations in Centralized Platforms
Facebook's Social Graph
Facebook's social graph constitutes a directed graph structure modeling its users as nodes and interpersonal connections—primarily friendships, but extending to follows, family ties, and other associations—as edges, enabling the platform's core functionality of surfacing relevant content and recommendations. Mark Zuckerberg introduced the term publicly on May 24, 2007, at the inaugural f8 developer conference, framing the social graph as the underlying network of human relationships that developers could access via the newly launched Facebook Platform to build interconnected applications.[11] [32] This conceptualization positioned the graph not merely as data storage but as a foundational layer for interoperability, allowing apps to query and incorporate users' social contexts without rebuilding relational mappings from scratch.[33] Early implementations relied on a MySQL-based relational database augmented by memcache for caching frequent reads, treating the graph as a "lookaside" system where edge data was fetched on demand during PHP queries.[34] As user growth accelerated—reaching 50 million active users by 2008—the architecture proved inadequate for the graph's dynamism, prompting shifts toward specialized graph stores. In 2013, Facebook introduced TAO (The Associations and Objects), a distributed datastore tailored for social graph workloads, which separates storage into persistent MySQL shards for objects (nodes like users or pages) and associations (typed edges with metadata such as timestamps or visibility settings).[35] [34] TAO employs a multi-tier caching strategy—leader-follower replicas in memcache for hot data, backed by durable storage—to achieve sub-millisecond latencies on reads while ensuring atomic writes via leader election and versioning, thus accommodating the graph's high-velocity updates from billions of daily interactions.[34] The graph's edges are typed and directed, supporting operations like traversal for friend-of-friend suggestions or aggregation for News Feed ranking, with APIs exposing subsets via the Graph API for external access under user permissions.[36] This structure scaled to handle workloads exceeding 10 billion queries per second by the early 2020s, leveraging sharding by node ID and geographic distribution to manage partition tolerance.[37] Evolving from unidirectional friendships to multifaceted associations—including likes, shares, and event RSVPs—the social graph has underpinned revenue-generating features like social advertising, launched November 6, 2007, which targets users via inferred interests derived from edge traversals.[38] Despite its efficacy in personalization, the centralized control has drawn scrutiny for enabling unchecked data aggregation, though empirical analyses confirm its causal role in user retention through network effects rather than mere convenience.[35]Twitter's Follow Graph
Twitter's follow graph is a directed graph in which nodes represent users and edges denote unidirectional "follow" relationships, with an edge from user A to user B indicating that A follows B and thereby receives B's posts in their timeline.[39] This model prioritizes asymmetric information flow, enabling one-way content consumption without requiring mutual approval, which distinguishes it from bidirectional friendship graphs on platforms like Facebook.[40] The graph's structure exhibits power-law degree distributions, with a small number of high-degree nodes (celebrities or influencers) attracting disproportionate followers, while most users have few outgoing edges.[41] To manage the graph's scale—historically encompassing hundreds of millions of nodes and billions of edges by the early 2010s—Twitter developed FlockDB, a distributed, fault-tolerant graph database optimized for storing and querying adjacency lists rather than full traversals.[42] Introduced on May 3, 2010, FlockDB supports efficient operations like counting followers or checking mutual follows but avoids complex path-finding to maintain performance at high volumes, such as billions of edges.[42] It integrates with MySQL for storage and uses a web service interface for reads and writes, facilitating fan-out mechanisms where a user's tweet is pushed to followers' timelines in real-time.[43] The follow graph underpins key features, including timeline generation via fan-out writes and personalized recommendations through the "Who to Follow" (WTF) service, which leverages graph-based machine learning to suggest connections.[44] For instance, WTF employs collaborative filtering over the graph's structure, analyzing paths and similarities in follow patterns to predict relevant follows, with models trained on historical data to rank candidates by predicted engagement.[39] Later enhancements, such as RealGraph introduced around 2019, refine these predictions by embedding user-tweet interactions into denser representations for real-time scoring.[45] Despite its utility, the graph's directed nature contributes to low reciprocity—typically 22-30% of follows are mutual—reflecting its role more as an interest or information network than a purely social one.[40][41]Implementations in Other Platforms
LinkedIn employs a distributed graph database named LIquid to model professional relationships as a social graph, handling tens of terabytes of data and supporting up to half a million queries per second for features like connection recommendations and network analysis.[46] This implementation emphasizes directed edges representing endorsements, follows, and collaborations, differing from consumer platforms by prioritizing economic and career-oriented ties over casual friendships.[47] Google+ utilized a directed social graph structured around "circles," allowing users to categorize connections into asymmetric groups for selective sharing, which facilitated ego-centric network analysis in datasets comprising millions of edges from public circle exports.[48] Launched in 2011, this system aimed to integrate social data across Google's ecosystem but faced challenges in user adoption, leading to its discontinuation in 2019; empirical studies of its graph revealed denser clusters among celebrities and IT professionals compared to broader populations.[49] Other platforms, such as Instagram, integrate social graph elements inherited from Meta's infrastructure to infer relationships via mutual follows and interactions, powering feed algorithms that prioritize content from strong ties, though increasingly augmented by interest-based signals.[50] In contrast, TikTok largely eschews a traditional connection-focused social graph in favor of an interest graph, recommending videos based on user engagement patterns rather than explicit follower links, which enabled rapid scaling to over 1 billion users by 2021 without relying on imported social networks.[51][52]Extensions and Advanced Protocols
Open Graph Protocol
The Open Graph Protocol (OGP), introduced by Facebook on April 21, 2010, is a framework of standardized meta tags embedded in HTML documents to describe the properties of web pages, enabling them to function as rich objects within social networks.[53] It allows platforms to generate preview cards with titles, descriptions, images, and other media when links are shared, thereby integrating external web content into the social graph by associating it with user interactions such as likes, shares, and comments.[53] This protocol extends the social graph beyond platform-specific data by mapping web resources to graph entities, facilitating richer connections between users, content, and external sites.[54] Technically, OGP employs namespace-prefixed meta elements in the<head> section of HTML, such as og:title for the page's title, og:image for a representative image (recommended at least 200x200 pixels), og:description for a brief summary, and og:type to specify object types like "website," "article," or "video.other" from a predefined set.[53] Additional properties support advanced features, including audio (og:audio), video (og:video), and determiners for locale (og:locale), with the protocol drawing inspiration from established standards like Dublin Core, RDFa, and Microformats to ensure semantic interoperability.[53] When a link is shared, social platforms parse these tags via web crawlers to construct interactive previews, which users can then engage with, effectively incorporating third-party content into the graph's relational structure without requiring direct API integration.[55]
In the context of social graphs, OGP's primary impact has been to democratize content representation across networks, with adoption extending to platforms like Twitter (now X), LinkedIn, and WhatsApp, though implementations vary—Twitter favors its own cards protocol alongside OGP for compatibility.[56] By 2010, Facebook's rollout coincided with the "Like" button's launch, enabling over 1 million websites to integrate within months, amplifying graph density through viral sharing mechanics.[57] However, reliance on self-declared metadata introduces risks of manipulation, as sites can alter tags without verification, potentially disseminating misleading previews that propagate through the graph.[57] Despite these vulnerabilities, OGP remains a foundational extension for scalable, web-wide social connectivity, powering billions of daily shares while underscoring the tension between openness and control in graph architectures.[53]
