Open-source software
View on Wikipedia

Open-source software (OSS) is computer software that is released under a license in which the copyright holder grants users the rights to use, study, change, and distribute the software and its source code to anyone and for any purpose.[1][2] Open-source software may be developed in a collaborative, public manner. Open-source software is a prominent example of open collaboration, meaning any capable user is able to participate online in development, making the number of possible contributors indefinite. The ability to examine the code facilitates public trust in the software.[3]
Open-source software development can bring in diverse perspectives beyond those of a single company. A 2024 estimate of the value of open-source software to firms is $8.8 trillion, as firms would need to spend 3.5 times the amount they currently do without the use of open source software.[4]
Open-source code can be used for studying and allows capable end users to adapt software to their personal needs in a similar way user scripts and custom style sheets allow for web sites, and eventually publish the modification as a fork for users with similar preferences, and directly submit possible improvements as pull requests.
Definitions
[edit]
The Open Source Initiative's (OSI) definition is recognized by several governments internationally[5] as the standard or de facto definition. OSI uses The Open Source Definition to determine whether it considers a software license open source. The definition was based on the Debian Free Software Guidelines, written and adapted primarily by Bruce Perens.[6][7][8] Perens did not base his writing on the "four freedoms" from the Free Software Foundation (FSF), which were only widely available later.[9]
Under Perens' definition, open source is a broad software license that makes source code available to the general public with relaxed or non-existent restrictions on the use and modification of the code. It is an explicit "feature" of open source that it puts very few restrictions on the use or distribution by any organization or user, in order to enable the rapid evolution of the software.[10]
According to Feller et al. (2005), the terms "free software" and "open-source software" should be applied to any "software products distributed under terms that allow users" to use, modify, and redistribute the software "in any manner they see fit, without requiring that they pay the author(s) of the software a royalty or fee for engaging in the listed activities."[11]
Despite initially accepting it,[12] Richard Stallman of the FSF now flatly opposes the term "Open Source" being applied to what they refer to as "free software". Although he agrees that the two terms describe "almost the same category of software", Stallman considers equating the terms incorrect and misleading.[13] Stallman also opposes the professed pragmatism of the Open Source Initiative, as he fears that the free software ideals of freedom and community are threatened by compromising on the FSF's idealistic standards for software freedom.[14] The FSF considers free software to be a subset of open-source software, and Richard Stallman explained that DRM software, for example, can be developed as open source, despite that it does not give its users freedom (it restricts them), and thus does not qualify as free software.[13]
Open-source software development
[edit]Development model
[edit]In his 1997 essay The Cathedral and the Bazaar, open-source influential contributor Eric S. Raymond suggests a model for developing OSS known as the bazaar model. Raymond likens the development of software by traditional methodologies to building a cathedral, with careful isolated work by individuals or small groups. He suggests that all software should be developed using the bazaar style, with differing agendas and approaches.[15]
In the traditional model of development, which he called the cathedral model, development takes place in a centralized way. Roles are clearly defined. Roles include people dedicated to designing (the architects), people responsible for managing the project, and people responsible for implementation. Traditional software engineering follows the cathedral model.[15]
The bazaar model, however, is different. In this model, roles are not clearly defined.[15] Some proposed characteristics of software developed using the bazaar model should exhibit the following patterns:[16]
- Users should be treated as co-developers: The users are treated like co-developers and so they should have access to the source code of the software. Furthermore, users are encouraged to submit additions to the software, code fixes for the software, bug reports, documentation, etc. Having more co-developers increases the rate at which the software evolves. Linus's law states that given enough eyeballs all bugs are shallow. This means that if many users view the source code, they will eventually find all bugs and suggest how to fix them. Some users have advanced programming skills, and furthermore, each user's machine provides an additional testing environment. This new testing environment offers the ability to find and fix a new bug.[16]
- Early releases: The first version of the software should be released as early as possible so as to increase one's chances of finding co-developers early.[16]
- Frequent integration: Code changes should be integrated (merged into a shared code base) as often as possible so as to avoid the overhead of fixing a large number of bugs at the end of the project life cycle.[16][17] Some open-source projects have nightly builds where integration is done automatically.[16]
- Several versions: There should be at least two versions of the software. There should be a buggier version with more features and a more stable version with fewer features. The buggy version (also called the development version) is for users who want the immediate use of the latest features and are willing to accept the risk of using code that is not yet thoroughly tested.[16] The users can then act as co-developers, reporting bugs and providing bug fixes.[16][18]
- High modularization: The general structure of the software should be modular allowing for parallel development on independent components.[16]
- Dynamic decision-making structure: There is a need for a decision-making structure, whether formal or informal, that makes strategic decisions depending on changing user requirements and other factors. This is in comparison with extreme programming.[16]
The process of Open source development begins with a requirements elicitation where developers consider if they should add new features or if a bug needs to be fixed in their project. This is established by communicating with the OSS community through avenues such as bug reporting and tracking or mailing lists and project pages. Next, OSS developers select or are assigned to a task and identify a solution. Because there are often many different possible routes for solutions in OSS, the best solution must be chosen with careful consideration and sometimes even peer feedback. The developer then begins to develop and commit the code. The code is then tested and reviewed by peers. Developers can edit and evolve their code through feedback from continuous integration. Once the leadership and community are satisfied with the whole project, it can be partially released and user instruction can be documented. If the project is ready to be released, it is frozen, with only serious bug fixes or security repairs occurring. Finally, the project is fully released and only changed through minor bug fixes.[18]
Advantages
[edit]Open source implementation of a standard can increase adoption of that standard.[19] This creates developer loyalty as developers feel empowered and have a sense of ownership of the end product.[20]
Moreover, lower costs of marketing and logistical services are needed for OSS.[21] OSS can be a tool to promote a company's image, including its commercial products.[22] The OSS development approach has helped produce reliable, high quality software quickly and inexpensively.[21]
Open source development offers the potential to quicken innovation and create of social value. In France for instance, a policy that incentivized government to favor free open-source software increased to nearly 600,000 OSS contributions per year, generating social value by increasing the quantity and quality of open-source software. This policy also led to an estimated increase of up to 18% of tech startups and a 14% increase in the number of people employed in the IT sector.[23]
OSS can be highly reliable when it has thousands of independent programmers testing and fixing bugs of the software.[16] Open source is not dependent on the company or author that originally created it. Even if the company fails, the code continues to exist and be developed by its users.[24]
OSS is flexible because modular systems allow programmers to build custom interfaces, or add new abilities to it and it is innovative since open-source programs are the product of collaboration among a large number of different programmers.[16] The mix of divergent perspectives, corporate objectives, and personal goals speeds up innovation.[25]
Moreover, free software can be developed in accordance with purely technical requirements. It does not require thinking about commercial pressure that often degrades the quality of the software. Commercial pressures make traditional software developers pay more attention to customers' requirements than to security requirements, since such features are somewhat invisible to the customer.[26]
Development tools
[edit]In open-source software development, tools are used to support the development of the product and the development process itself.[18]
Version control systems such as Centralized Version control system (CVCS) and the distributed version control system (DVCS) are examples of tools, often open source, that help manage the source code files and the changes to those files for a software project in order to foster collaboration. CVCS are centralized with a central repository while DVCS are decentralized and have a local repository for every user. Concurrent Versions System (CVS) and later Subversion (SVN) are examples of CVCS, whereas Git is a DVCS and the most widely used version control software.[27] The repositories are hosted and published on source-code-hosting facilities such as GitHub.[28]
Open-source projects use utilities such as issue trackers to organize open-source software development. Commonly used bug trackers include Bugzilla and Redmine.[18]
Tools such as mailing lists and IRC provide means of coordination and discussion of bugs among developers. Project web pages, wiki pages, roadmap lists and newsgroups allow for the distribution of project information that focuses on end users.[18]
Opportunities for participation
[edit]Contributing
[edit]The basic roles OSS participants can fall into multiple categories, beginning with leadership at the center of the project who have control over its execution. Next are the core contributors with a great deal of experience and authority in the project who may guide the other contributors. Non-core contributors have less experience and authority, but regularly contribute and are vital to the project's development. New contributors are the least experienced but with mentorship and guidance can become regular contributors.[29]
Some possible ways of contributing to open-source software include such roles as programming, maintaining, user interface design and testing, web design, bug triage, accessibility design and testing, UX design, code testing, and security review and testing. However, there are several ways of contributing to OSS projects even without coding skills. For example, some less technical ways of participating are documentation writing and editing, translation, project management, event organization and coordination, marketing, release management, community management, and public relations and outreach.[29]
Funding is another way that individuals and organizations choose to contribute to open source projects. Groups like Open Collective provide a means for individuals to contribute monthly to supporting their favorite projects.[30] Organizations like the Sovereign Tech Fund is able to contribute to millions to supporting the tools the German Government uses.[31] The National Science Foundation established a Pathways to Enable Open-Source Ecosystems (POSE) program to support open source innovation.[32]
Industry participation
[edit]The adoption of open-source software by industry is increasing over time.[33] OSS is popular in several industries such as telecommunications, aerospace, healthcare, and media & entertainment due to the benefits it provides.[34] Adoption of OSS is more likely in larger organizations and is dependent on the company's IT usage, operating efficiencies, and the productivity of employees.[33]
Industries are likely to use OSS due to back-office functionality, sales support, research and development, software features, quick deployment, portability across platforms and avoidance of commercial license management. Additionally, lower cost for hardware and ownership are also important benefits.[33]
Prominent organizations
[edit]Organizations that contribute to the development and expansions of free and open-source software movements exist all over the world. These organizations are dedicated to goals such as teaching and spreading technology. As listed by a former vice president of the Open Source Initiative, some American organizations include the Free Software Foundation, Software Freedom Conservancy, the Open Source Initiative and Software in the Public Interest. Within Europe some notable organizations are Free Software Foundation Europe, open-source projects EU (OSP) and OpenForum Europe (OFE). One Australian organization is Linux Australia while Asia has Open source Asia and FOSSAsia. Free and open source software for Africa (FOSSFA) and OpenAfrica are African organizations and Central and South Asia has such organizations as FLISOL and GRUP de usuarios de software libre Peru. Outside of these, many more organizations dedicated to the advancement of open-source software exist.[29]
Legal and economic issues
[edit]Licensing
[edit]FOSS products are generally licensed under two types of licenses: permissive licensing and copyleft licensing. Both of these types of licenses are different than proprietary licensing in that they can allow more users access to the software and allow for the creation of derivative works as specified by the terms of the specific license, as each license has its own rules. Permissive licenses allow recipients of the software to implement the author's copyright rights without having to use the same license for distribution. Examples of this type of license include the BSD, MIT, and Apache licenses. Copyleft licenses are different in that they require recipients to use the same license for at least some parts of the distribution of their works. Strong copyleft licenses require all derivative works to use the same license while weak copyleft licenses require the use of the same license only under certain conditions. Examples of this type of license include the GNU family of licenses, and the MPL and EPL licenses. The similarities between these two categories of licensing include that they provide a broad grant of copyright rights, require that recipients preserve copyright notices, and that a copy of the license is provided to recipients with the code.[35]
One important legal precedent for open-source software was created in 2008, when the Jacobson v Katzer case enforced terms of the Artistic license, including attribution and identification of modifications. The ruling of this case cemented enforcement under copyright law when the conditions of the license were not followed. Because of the similarity of the Artistic license to other open-source software licenses, the ruling created a precedent that applied widely.[35]
Examples of free-software license / open-source licenses include Apache licenses, BSD licenses, GNU General Public Licenses, GNU Lesser General Public License, MIT License, Eclipse Public License and Mozilla Public License.[35]
Legal issues
[edit]Several gray areas exist within software regulation that have great impact on open-source software, such as if software is a good or service, what can be considered a modification, governance through contract vs license, ownership and right of use. While there have been developments on these issues, they often lead to even more questions. The existence of these uncertainties in regulation has a negative impact on industries involved in technologies as a whole.[35]
Within the legal history of software as a whole, there was much debate on whether to protect it as intellectual property under patent law, copyright law or establishing a unique regulation. Ultimately, copyright law became the standard with computer programs being considered a form of literary work, with some tweaks of unique regulation.[35]
Software is generally considered source code and object code, with both being protectable, though there is legal variety in this definition. Some jurisdictions attempt to expand or reduce this conceptualization for their own purposes. For example, The European Court of Justice defines a computer program as not including the functionality of a program, the programing language, or the format of data files. By limiting protections of the different aspects of software, the law favors an open-source approach to software use. The US especially has an open approach to software, with most open-source licenses originating there. However, this has increased the focus on patent rights within these licenses, which has seen backlash from the OSS community, who prefer other forms of IP protection.[35]
Another issue includes technological protection measures (TPM) and digital rights management (DRM) techniques which were internationally legally recognized and protected in the 1996 World Intellectual Property Organization (WIPO) Treaty. Open source software proponents disliked these technologies as they constrained end-users potentially beyond copyright law. Europe responded to such complaints by putting TPM under legal controls, representing a victory for OSS supporters.[35]
Economic/business implications
[edit]
In open-source communities, instead of owning the software produced, the producer owns the development of the evolving software. In this way, the future of the software is open, making ownership or intellectual property difficult within OSS. Licensing and branding can prevent others from stealing it, preserving its status as a public good. Open source software can be considered a public good as it is available to everyone and does not decrease in value for others when downloaded by one person. Open source software is unique in that it becomes more valuable as it is used and contributed to, instead of diminishing the resource. This is explained by concepts such as investment in reputation and network effects.[36]
The economic model of open-source software can be explained as developers contribute work to projects, creating public benefits. Developers choose projects based on the perceived benefits or costs, such as improved reputation or value of the project. The motivations of developers can come from many different places and reasons, but the important takeaway is that money is not the only or even most important incentivization.[36]
Because economic theory mainly focuses on the consumption of scarce resources, the OSS dynamic can be hard to understand. In OSS, producers become consumers by reaping the rewards of contributing to a project. For example, a developer becomes well regarded by their peers for a successful contribution to an OSS project. The social benefits and interactions of OSS are difficult to account for in economic models as well. Furthermore, the innovation of technology creates constantly changing value discussions and outlooks, making economic model unable to predict social behavior.[36]
Although OSS is theoretically challenging in economic models, it is explainable as a sustainable social activity that requires resources. These resources include time, money, technology and contributions. Many developers have used technology funded by organizations such as universities and governments, though these same organizations benefit from the work done by OSS. As OSS grows, hybrid systems containing OSS and proprietary systems are becoming more common.[36]
Throughout the mid 2000s, more and more tech companies have begun to use OSS. For example, Dell's move of selling computers with Linux already installed. Microsoft itself has launched a Linux-based operating system despite previous animosity with the OSS movement. Despite these developments, these companies tend to only use OSS for certain purposes, leading to worries that OSS is being taken advantage of by corporations and not given anything in return.[24]
Government uses
[edit]Many governments are interested in implementing and promoting open-source software due to the many benefits provided: for example, the UK government issued a policy promoting open source and open standards in 2004, restating the policy in 2009: "the Government will actively and fairly consider open source solutions alongside proprietary ones".[37] However, an issue to be considered is cybersecurity. While accidental vulnerabilities are possible, so are attacks by outside agents. Because of these fears, governmental interest in contributing to the governance of software has become more prominent. However, these are the broad strokes of the issue, with each country having their own specific politicized interactions with open-source software and their goals for its implementation. For example, the United States has focused on national security in regard to open-source software implementation due to the perceived threat of the increase of open-source software activity in countries like China and Russia, with the Department of Defense considering multiple criteria for using OSS. These criteria include whether it comes from and is maintained by trusted sources, whether it will continue to be maintained, if there are dependencies on sub-components in the software, component security and integrity, and foreign governmental influence.[38]
Another issue for governments in regard to open source is their investments in technologies such as operating systems, semiconductors, cloud, and artificial intelligence. These technologies all have implications for global cooperation, again opening up security issues and political consequences. Many countries have to balance technological innovation with technological dependence in these partnerships. For example, after China's open-source dependent company Huawei was prevented from using Google's Android system in 2019, they began to create their own alternative operating system: Harmony OS.[38]
Germany recently[when?] established a Sovereign Tech Fund, to help support the governance and maintenance of the software that they use.
Open software movement
[edit]History
[edit]In the early days of computing, such as the 1950s and into the 1960s, programmers and developers shared software to learn from each other and evolve the field of computing. For example, Unix included the operating system source code for users. Eventually, the commercialization of software in the years 1970–1980 began to prevent this practice. However, academics still often developed software collaboratively.[39]
In response, the open-source movement was born out of the work of skilled programmer enthusiasts, widely referred to as hackers or hacker culture.[40] One of these enthusiasts, Richard Stallman, was a driving force behind the free software movement, which would later allow for the open-source movement. In 1984, he resigned from MIT to create a free operating system, GNU, after the programmer culture in his lab was stifled by proprietary software preventing source code from being shared and improved upon. GNU was UNIX compatible, meaning that the programmer enthusiasts would still be familiar with how it worked. However, it quickly became apparent that there was some confusion with the label Stallman had chosen of free software, which he described as free as in free speech, not free beer, referring to the meaning of free as freedom rather than price. He later expanded this concept of freedom to the four essential freedoms. Through GNU, open-source norms of incorporating others' source code, community bug fixes and suggestions of code for new features appeared. In 1985, Stallman founded the Free Software Foundation (FSF) to promote changes in software and to help write GNU. In order to prevent his work from being used in proprietary software, Stallman created the concept of copyleft, which allowed the use of his work by anyone, but under specific terms. To do this, he created the GNU General Public License (GNU GPL) in 1989, which was updated in 1991.[17] In 1991, GNU was combined with the Linux kernel written by Linus Torvalds, as a kernel was missing in GNU.[41] The operating system is now usually referred to as Linux.[17] Throughout this whole period, there were many other free software projects and licenses around at the time, all with different ideas of what the concept of free software was and should be, as well as the morality of proprietary software, such as Berkeley Software Distribution, TeX, and the X Window System.[42]
As free software developed, the Free Software Foundation began to look how to bring free software ideas and perceived benefits to the commercial software industry. It was concluded that FSF's social activism was not appealing to companies and they needed a way to rebrand the free software movement to emphasize the business potential of sharing and collaborating on software source code.[42] The term open source was suggested by Christine Peterson in 1998 at a meeting of supporters of free software. Many in the group felt the name free software was confusing to newcomers and holding back industry interest and they readily accepted the new designation of open source, creating the Open Source Initiative (OSI) and the OSI definition of what open source software is.[17] The Open Source Initiative's (OSI) definition is now recognized by several governments internationally as the standard or de facto definition.[41] The definition was based on the Debian Free Software Guidelines, written and adapted primarily by Bruce Perens.[43] The OSI definition differed from the free software definition in that it allows the inclusion of proprietary software and allows more liberties in its licensing. Some, such as Stallman, agree more with the original concept of free software as a result because it takes a strong moral stance against proprietary software, though there is much overlap between the two movements in terms of the operation of the software.[17]
While the Open Source Initiative sought to encourage the use of the new term and evangelize the principles it adhered to, commercial software vendors found themselves increasingly threatened by the concept of freely distributed software and universal access to an application's source code, with an executive of Microsoft calling open source an intellectual property destroyer in 2001. However, while free and open-source software (FOSS) has historically played a role outside of mainstream private software development, companies as large as Microsoft have begun to develop official open source presences on the Internet. IBM, Oracle, and State Farm are just a few of the companies with a serious public stake in today's competitive open source market, marking a significant shift in the corporate philosophy concerning the development of FOSS.[44]
Future
[edit]The future of the open source software community, and the free software community by extension, has become successful if not confused about what it stands for. For example, Android and Ubuntu are examples milestones of success in the open source software rise to prominence from the sidelines of technological innovation as it existed in the early 2000s. However, some in the community consider them failures in their representation of OSS due to issues such as the downplaying of the OSS center of Android by Google and its partners, the use of an Apache license that allowed forking and resulted in a loss of opportunities for collaboration within Android, the prioritization of convenience over freedom in Ubuntu, and features within Ubuntu that track users for marketing purposes.[24]
The use of OSS has become more common in business with 78% of companies reporting that they run all or part of their operations on FOSS. The popularity of OSS has risen to the point that Microsoft, a once detractor of OSS, has included its use in their systems. However, this success has raised concerns that will determine the future of OSS as the community must answer questions such as what OSS is, what should it be, and what should be done to protect it, if it even needs protecting. All in all, while the free and open source revolution has slowed to a perceived equilibrium in the market place, that does not mean it is over as many theoretical discussions must take place to determine its future.[24]
Comparisons with other software licensing/development models
[edit]Closed source / proprietary software
[edit]Open source software differs from proprietary software in that it is publicly available, the license requires no fees, modifications and distributions are allowed under license specifications. All of this works to prevent a monopoly on any OSS product, which is a goal of proprietary software. Proprietary software limits their customers' choices to either committing to using that software, upgrading it or switching to other software, forcing customers to have their software preferences impacted by their monetary cost. The ideal case scenario for the proprietary software vendor would be a lock-in, where the customer does not or cannot switch software due to these costs and continues to buy products from that vendor.[45]
Within proprietary software, bug fixes can only be provided by the vendor, moving platforms requires another purchase and the existence of the product relies on the vendor, who can discontinue it at any point.[40] Additionally, proprietary software does not provide its source code and cannot be altered by users. For businesses, this can pose a security risk and source of frustration, as they cannot specialize the product to their needs, and there may be hidden threats or information leaks within the software that they cannot access or change.[17]
Free software
[edit]Under OSI's definition, open source is a broad software license that makes source code available to the general public with relaxed or non-existent restrictions on the use and modification of the code. It is an explicit feature of open source that it puts very few restrictions on the use or distribution by any organization or user, in order to enable the rapid evolution of the software.[46]
Richard Stallman, leader of the Free software movement and member of the free software foundation opposes the term open source being applied to what they refer to as free software. Although he agrees that the two terms describe almost the same category of software, Stallman considers equating the terms incorrect and misleading.[13] He believes that the main difference is that by choosing one term over the other lets others know about what one's goals are: development (open source) or a social stance (free software).[47] Nevertheless, there is significant overlap between open source software and free software.[13] Stallman also opposes the professed pragmatism of the Open Source Initiative, as he fears that the free software ideals of freedom and community are threatened by compromising on the FSF's idealistic standards for software freedom.[47] The FSF considers free software to be a subset of open-source software, and Richard Stallman explained that DRM software, for example, can be developed as open source, despite how it restricts its users, and thus does not qualify as free software.[13]
The FSF said that the term open source fosters an ambiguity of a different kind such that it confuses the mere availability of the source with the freedom to use, modify, and redistribute it.[13] On the other hand, the term free software was criticized for the ambiguity of the word free, which was seen as discouraging for business adoption, and for the historical ambiguous usage of the term.[47]
Developers have used the alternative terms Free and Open Source Software (FOSS), or Free/Libre and Open Source Software (FLOSS), consequently, to describe open-source software that is also free software.[29]
Source-available software
[edit]Software can be distributed with source code, which is a code that is readable. Software is source available when this source code is available to be seen. However to be source available or FOSS, the source code does not need to be accessible to all, just the users of that software. While all FOSS software is source available because this is a requirement made by the Open Source Definition, not all source available software is FOSS. For example, if the software does not meet other aspects of the Open Source Definition such as permitted modification or redistribution, even if the source code is available, the software is not FOSS.[48]
Open-sourcing
[edit]A recent trend within software companies is open-sourcing, or transitioning their previous proprietary software into open-source software through releasing it under an open-source license.[49][50] Examples of companies who have done this are Google, Microsoft and Apple.[49] Additionally, open-sourcing can refer to programming open-source software or installing open-source software.[50] Open-sourcing can be beneficial in multiple ways, such as attracting more external contributors who bring new perspectives and problem solving capabilities. The downsides of open-sourcing include the work that has to be done to maintaining the new community, such as making the base code easily understandable, setting up communication channels for new developers and creating documentation to allow new developers to easily join. However, a review of several open-sourced projects found that although a newly open-sourced project attracts many newcomers, a great amount are likely to soon leave the project and their forks are also likely to not be impactful.[49]
Other
[edit]Other concepts that may share some similarities to open source are shareware, public domain software, freeware, and software viewers/readers that are freely available but do not provide source code. However, these differ from open source software in access to source code, licensing, copyright and fees.[17]
Society and culture
[edit]Demographics
[edit]Despite being able to collaborate internationally, open source software contributors were found to mostly be located in large clusters such as Silicon Valley that largely collaborate within themselves. Possible reasons for this phenomenon may be that the OSS contributor demographic largely works in software, meaning that the OSS geographic location is closely related to that dispersion and collaborations could be encouraged through work and social networks.[51] Code acceptance can be impacted by status within these social network clusters, creating unfair predispositions in code acceptance based on location.[52] Barriers to international collaboration also include linguistic or cultural differences.[53] Furthermore, each country has been shown to have a higher acceptance rate for code from contributors within their country except India, indicating a bias for culturally similar collaborators.[53]
In 2021, the countries with the highest open source software contributions included the United States, China, Germany, India, and the UK, in that order.[51] The countries with the highest OSS developers per capita from a study in 2021 include, in order, Iceland, Switzerland, Norway, Sweden, and Finland, while in 2008 the countries with top amount of estimated contributors in SourceForge were the United States, Germany, United Kingdom, Canada and France.[51][53] Though there have been several studies done on the distribution and contributions of OSS developers, this is still an open field that can be measured in several different ways. For instance, Information and communication technology participation, population, wealth and proportion of access to the internet have been shown to be correlated with OSS contributions.[53]
Although gender diversity has been found to enhance team productivity, women still face biases while contributing to open source software projects when their gender is identifiable.[54] In 2002, only 1.5% of international open-source software developers were women, while women made up 28% of tech industry roles, demonstrating their low representation in the software field.[55] Despite OSS contributions having no prerequisites, this gender bias may continue to exist due to the common belief of contributors that gender should not matter, and the quality of code should be the only consideration for code acceptance, preventing the community from addressing the systemic disparities in female representation.[40] However, a more recent figure of female OSS participation internationally calculated across 2005 to 2021 is 9.8%, with most being recent contributors, indicating that female participation may be growing.[56]
Motivations
[edit]There are many motivations for contributing to the OSS community. For one, it is an opportunity to learn and practice multiple skills such as coding and other technology related abilities, but also fundamental skills such as communication and collaboration and practical skills needed to excel in technology related fields such as issue tracking or version control. Instead of learning through a classroom or a job, learning through contributing to OSS allows participants to learn at their own pace and follow what interests them. When contributing to OSS, the contributor can learn the current industry best practices, technology and trends and even have the opportunity to contribute to the next big innovation as OSS grows increasingly popular within the tech field. Contributing to OSS without payment means there is no threat of being fired, though reputations can take a hit. On the other hand, a huge motivation to contribute to OSS is the reputation gained as one grows one's public portfolio.[29]
Disparities
[edit]Even though programming was originally seen as a female profession, there remains a large gap in computing.[57] Social identity tends to be a large concern as women in the tech industry face insecurity about attracting unwanted male attention and harassment or being unfeminine in their technology knowledge, having a large impact on confidence.[40] Some male tech participants make clear that they believe women fitting in within the culture is impossible, furthering the insecurity for women and their place in the tech industry. Additionally, even in a voluntary contribution environment like open source software, women tend to end up doing the less technical aspects of projects, such as manual testing or documentation despite women and men showing the same productivity in OSS contributions. Explicit biases include longer feedback time, more scrutinization of code and lower acceptance rate of code.[54] Specifically in the open-source software community, women report that sexually offensive language is common and the women's identity as female is given more attention that as an OSS contributor Bias is hard to address due to the belief that gender should not matter, with most contributors feeling that women getting special treatment is unfair and success should be dependent on skill, preventing any changes to be more inclusive.[40]
Adoption and application
[edit]Key projects
[edit]Open source software projects are built and maintained by a network of programmers, who may often be volunteers, and are widely used in free as well as commercial products.[58]
- Unix: Unix is an operating system created by AT&T that began as a precursor to open source software in that the free and open-source software revolution began when developers began trying to create operating systems without Unix code. Unix was created in the 1960s, before the commercialization of software and before the concept of open source software was necessary, therefore it was not considered a true open source software project. It started as a research project before being commercialized in the mid 1980s. Before its commercialization, it represented many of the ideals held by the Free and Open source software revolution, including the decentralized collaboration of global users, rolling releases and a community culture of distaste towards proprietary software.[24]
- BSD: Berkeley Software Distribution (BSD) is an operating system that began as a variant of Unix in 1978 that mixed Unix code with code from Berkeley labs to increase functionality. As BSD was focused on increasing functionality, it would publicly share its greatest innovations with the main Unix operating system. This is an example of the free public code sharing that is a central characteristic of FOSS today. As Unix became commercialized in the 1980s, developers or members of the community who did not support proprietary software began to focus on BSD and turning it into an operating system that did not include any of Unix's code. The final version of BSD was released in 1995.[24]
- GNU: GNU is a free operating system created by Richard Stallman in 1984 with its name meaning Gnu's Not Unix. The idea was to create a Unix alternative operating system that would be available for anyone to use and allow programmers to share code freely between them. However, the goal of GNU was not to only replace Unix, but to make a superior version that had more technological capabilities. It was released before the philosophical beliefs of the Free and Open source software revolution were truly defined. Because of its creation by prominent FOSS programmer Richard Stallman, GNU was heavily involved in FOSS activism, with one of the greatest achievements of GNU being the creation of the GNU General Public License or GPL, which allowed developers to release software that could be legally shared and modified.[24]
- Linux: Linux is an operating system kernel that was introduced in 1991 by Linus Torvalds. Linux was inspired by making a better version of the for profit operating service Minux. It was radically different than what other hackers were producing at the time due to it being totally free of cost and being decentralized. Later, Linux was put under the GPL license, allowing people to make money with Linux and bringing Linux into the FOSS community.[24]
- Apache: Apache began in 1995 as a collaboration between a group of developers releasing their own web server due to their frustration with NCSA HTTPd code base. The name Apache was used because of the several patches they applied to this code base. Within a year of its release, it became the worldwide leading web server. Soon, Apache came out with its own license, creating discord in the greater FOSS community, though ultimately proving successful. The Apache license allowed permitted members to directly access source code, a marked difference from GNU and Linux's approaches.[24]
Extensions for non-software use
[edit]While the term open source applied originally only to the source code of software, it is now being applied to many other areas such as open-source ecology, a movement to decentralize technologies so that any human can use them.[13][59] However, it is often misapplied to other areas that have different and competing principles, which overlap only partially.[40]
The same principles that underlie open-source software can be found in many other ventures, such as open source, open content, and open collaboration.[60][3]
This "culture" or ideology takes the view that the principles apply more generally to facilitate concurrent input of different agendas, approaches, and priorities, in contrast with more centralized models of development such as those typically used in commercial companies.[15]
Value
[edit]More than 90 percent of companies use open-source software as a component of their proprietary software.[61] The decision to use open-source software, or even engage with open-source projects to improve existing open-source software, is typically a pragmatic business decision.[62][63] When proprietary software is in direct competition with an open-source alternative, research has found conflicting results on the effect of the competition on the proprietary product's price and quality.[64]
For decades, some companies have made servicing of an open-source software product for enterprise users their business model. These companies control an open-source software product, and instead of charging for licensing or use, charge for improvements, integration, and other servicing.[65] Software as a service (SaaS) products based on open-source components are increasingly common.[66]
Open-source software is preferred for scientific applications, because it increases transparency and aids in the validation and acceptance of scientific results.[67]
See also
[edit]- Comparison of free and open-source software licenses
- Free software
- Free-software license
- Free software movement
- List of free and open-source software packages
- Free content
- Open-source hardware
- Open Source Initiative
- Open-source license
- Open-source software advocacy
- Open Source Software Institute
- Open-source software security
- Open-source video game
- All articles with titles containing "Open source"
- Proprietary software
- Shared Source Initiative
- Timeline of free and open-source software
- Software composition analysis
- Digital public goods
References
[edit]- ^ St. Laurent, Andrew M. (2008). Understanding Open Source and Free Software Licensing. O'Reilly Media. p. 4. ISBN 978-0-596-55395-1. Archived from the original on 22 April 2023. Retrieved 21 March 2023.
- ^ Corbly, James Edward (25 September 2014). "The Free Software Alternative: Freeware, Open Source Software, and Libraries". Information Technology and Libraries. 33 (3): 65. doi:10.6017/ital.v33i3.5105. ISSN 2163-5226. Archived from the original on 1 May 2021. Retrieved 28 April 2021.
- ^ a b Levine, Sheen S.; Prietula, Michael J. (30 December 2013). "Open Collaboration for Innovation: Principles and Performance". Organization Science. 25 (5): 1414–1433. arXiv:1406.7541. doi:10.1287/orsc.2013.0872. ISSN 1047-7039. S2CID 6583883.
- ^ Hoffmann, Manuel; Nagle, Frank; Zhou, Yanuo (2024). "The Value of Open Source Software". SSRN Electronic Journal. doi:10.2139/ssrn.4693148. ISSN 1556-5068.
- ^ "International Authority & Recognition". Opensource.org. 21 April 2015. Archived from the original on 23 July 2019. Retrieved 7 December 2017.
- ^ Perens, Bruce. Open Sources: Voices from the Open Source Revolution Archived 15 September 2014 at the Wayback Machine. O'Reilly Media. 1999.
- ^ Dibona, Chris; Ockman, Sam (January 1999). The Open Source Definition by Bruce Perens. O'Reilly. ISBN 978-1-56592-582-3.
- ^ "The Open Source Definition". 7 July 2006. Archived from the original on 15 October 2013. Retrieved 24 August 2008., The Open Source Definition according to the Open Source Initiative
- ^ "How Many Open Source Licenses Do You Need? – Slashdot". News.slashdot.org. 16 February 2009. Archived from the original on 17 July 2013. Retrieved 25 March 2012.
- ^ Open Source Initiative (24 July 2006). "The Open Source Definition (Annotated)". opensource.org. Archived from the original on 5 May 2021. Retrieved 22 July 2016.
- ^ Feller, Joseph; Fitzgerald, Brian; Hissam, Scott; Lakhani, Karim R. (2005). "Introduction". Perspectives on Free and Open Source Software. Cambridge, MA: The MIT Press. p. xvii. ISBN 0-262-06246-1.
- ^ Tiemann, Michael. "History of the OSI". Open Source Initiative. Archived from the original on 24 September 2006. Retrieved 13 May 2014.
- ^ a b c d e f g Stallman, Richard (2007). "Why Open Source Misses the Point of Free Software".
- ^ Stallman, Richard (19 June 2007). "Why "Free Software" is better than "Open Source"". Philosophy of the GNU Project. Free Software Foundation. Archived from the original on 27 March 2021. Retrieved 23 July 2007.
- ^ a b c d Raymond, Eric (2005). "The Cathedral and the Bazaar (originally published in Volume 3, Number 3, March 1998)". First Monday. doi:10.5210/fm.v0i0.1472. ISSN 1396-0466.
- ^ a b c d e f g h i j k Robles, Gregorio (2006). "Empirical Software Engineering Research on Free/Libre/Open Source Software". 2006 22nd IEEE International Conference on Software Maintenance. pp. 347–350. doi:10.1109/icsm.2006.25. ISBN 0-7695-2354-4. S2CID 6589566.
- ^ a b c d e f g Napoleao, Bianca M.; Petrillo, Fabio; Halle, Sylvain (2020). "Open Source Software Development Process: A Systematic Review". 2020 IEEE 24th International Enterprise Distributed Object Computing Conference (EDOC). IEEE. pp. 135–144. arXiv:2008.05015. doi:10.1109/EDOC49727.2020.00025. ISBN 978-1-7281-6473-1.
- ^ a b c d e Napoleao, Bianca M.; Petrillo, Fabio; Halle, Sylvain (2020). "Open Source Software Development Process: A Systematic Review". 2020 IEEE 24th International Enterprise Distributed Object Computing Conference (EDOC). IEEE. pp. 135–144. arXiv:2008.05015. doi:10.1109/EDOC49727.2020.00025. ISBN 978-1-7281-6473-1.
- ^ US Department of Defense. "Open Source Software FAQ". Chief Information Officer. Archived from the original on 28 August 2016. Retrieved 22 July 2016.
- ^ Sharma, Srinarayan; Vijayan Sugumaran; Balaji Rajagopalan (2002). "A framework for creating hybrid-open source software communities" (PDF). Information Systems Journal. 12: 7–25. doi:10.1046/j.1365-2575.2002.00116.x. S2CID 5815589. Archived (PDF) from the original on 30 October 2008. Retrieved 8 September 2008.
- ^ a b Reynolds, Carl; Jeremy Wyatt (February 2011). "Open Source, Open Standards, and Health Care Information Systems". Journal of Medical Internet Research. 13 (1): e24. doi:10.2196/jmir.1521. PMC 3221346. PMID 21447469.
- ^ Landry, John; Rajiv Gupta (September 2000). "Profiting from Open Source". Harvard Business Review. doi:10.1225/F00503 (inactive 12 July 2025).
{{cite journal}}: CS1 maint: DOI inactive as of July 2025 (link) - ^ Nagle, Frank (3 March 2019). "Government Technology Policy, Social Value, and National Competitiveness" (PDF). Information Systems Journal. 12. doi:10.2139/ssrn.3355486. S2CID 85509685. SSRN 3355486.
- ^ a b c d e f g h i Tozzi, Christopher (2017). For Fun and Profit: A History of the Free and Open Source Software Revolution. United States: MIT Press. ISBN 978-0-262-34118-9.
- ^ Plotkin, Hal (December 1998). "What (and Why) you should know about open-source software". Harvard Management Update: 8–9.
- ^ Payne, Christian (February 2002). "On the Security of Open Source Software". Information Systems Journal. 12 (1): 61–78. doi:10.1046/j.1365-2575.2002.00118.x. S2CID 8123076.
- ^ Shirey, Russell G. (1 March 2015). "Git as an Encrypted Distributed Version Control System" (PDF). Defense Technical Information Center. p. 38. Retrieved 5 September 2025.
- ^ Zolkifli, Nazatul Nurlisa; Ngah, Amir; Deraman, Aziz (2018). "Version Control System: A Review". Procedia Computer Science. 135: 408–415. doi:10.1016/j.procs.2018.08.191.
- ^ a b c d e Brasseur, V. M. (2018). Forge your future with open source: build your skills, build your network, build the future of technology. The pragmatic programmers. Raleigh, North Carolina: The Pragmatic Bookshelf. ISBN 978-1-68050-301-2.
- ^ "Open Source". Open Collective. 20 October 2022. Retrieved 28 May 2024.
- ^ "Technologies". Sovereign Tech Fund. Retrieved 28 May 2024.
- ^ "NSF invests over $26 million in open-source projects | NSF - National Science Foundation". new.nsf.gov. 25 October 2023. Retrieved 28 May 2024.
- ^ a b c Spinellis, Diomidis; Giannikas, Vaggelis (2012). "Organizational adoption of open source software". Journal of Systems and Software. 85 (3): 666–682. doi:10.1016/j.jss.2011.09.037.
- ^ Zhang, Yiming; Malhotra, Baljeet; Chen, Cheng (2018). "Industry-Wide Analysis of Open Source Security". 2018 16th Annual Conference on Privacy, Security and Trust (PST). IEEE. pp. 1–10. doi:10.1109/PST.2018.8514185. ISBN 978-1-5386-7493-2. S2CID 53234981.
- ^ a b c d e f g Brock, Amanda (2023). Open Source Law, Policy and Practice (2nd ed.). UK: Oxford University Press. ISBN 978-0-19-886234-5.
- ^ a b c d Wynants, M., & Cornelis, J. (Eds.). (2005). How open is the future? : Economic, social and cultural scenarios inspired by free and open-source software. ASP.
- ^ Cabinet Office, All about Open Source: An Introduction to Open Source Software for Government IT, Version 2.0, page 8, published in April 2012, accessed on 1 October 2025
- ^ a b Pannier, Alice (2022). Software Power: The Economic and Geopolitical Implications of Open Source Software. Études de l'Ifri. ISBN 979-10-373-0641-8.
- ^ Maracke, Catharina (2019). "Free and Open Source Software and FRAND-based patent licenses: How to mediate between Standard Essential Patent and Free and Open Source Software". The Journal of World Intellectual Property. 22 (3–4): 78–102. doi:10.1111/jwip.12114. ISSN 1422-2213.
- ^ a b c d e f Bretthauer, David (2001). "Open Source Software: A History". Information Technology and Libraries. 21 (1).
- ^ a b "International Authority & Recognition". Open Source Initiative. 21 April 2015. Retrieved 18 December 2023.
- ^ a b Fogel, Karl (2006). Producing open source software: how to run a successful free software project (1. Aufl., [Nachdr.] ed.). Beijing Köln: O'Reilly. ISBN 978-0-596-00759-1.
- ^ Kelty, Christopher (2008). Two Bits: The Cultural Significance of Free Software. Duke University Press. ISBN 978-0-8223-8900-2.
- ^ Miller, Keith W.; Voas, Jeffrey; Costello, Tom (2010). "Free and Open Source Software". IT Professional. 12 (6): 14–16. Bibcode:2010ITPro..12f..14M. doi:10.1109/mitp.2010.147. ISSN 1520-9202. S2CID 265508713.
- ^ Zhu, Kevin Xiaoguo; Zhou, Zach Zhizhong (2012). "Research Note —Lock-In Strategy in Software Competition: Open-Source Software vs. Proprietary Software". Information Systems Research. 23 (2): 536–545. doi:10.1287/isre.1110.0358. ISSN 1047-7047.
- ^ "The Open Source Definition (Annotated)". Open Source Initiative. 24 July 2006. Retrieved 18 December 2023.
- ^ a b c Stallman, Richard M.; Gay, Joshua (2002). Free software, free society. Boston (Mass.): Free software foundation. ISBN 978-1-882114-98-6.
- ^ Fortunato, Laura; Galassi, Mark (17 May 2021). "The case for free and open source software in research and scholarship". Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences. 379 (2197) rsta.2020.0079. Bibcode:2021RSPTA.37900079F. doi:10.1098/rsta.2020.0079. ISSN 1364-503X. OSTI 1836982. PMID 33775148. S2CID 232387092.
- ^ a b c Pinto, Gustavo; Steinmacher, Igor; Dias, Luiz Felipe; Gerosa, Marco (2018). "On the challenges of open-sourcing proprietary software projects". Empirical Software Engineering. 23 (6): 3221–3247. doi:10.1007/s10664-018-9609-6. ISSN 1382-3256. S2CID 254467440.
- ^ a b Ågerfalk; Fitzgerald (2008). "Outsourcing to an Unknown Workforce: Exploring Opensurcing as a Global Sourcing Strategy". MIS Quarterly. 32 (2): 385. doi:10.2307/25148845. ISSN 0276-7783. JSTOR 25148845.
- ^ a b c Wachs, Johannes; Nitecki, Mariusz; Schueller, William; Polleres, Axel (March 2002). "The Geography of Open Source Software: Evidence from GitHub". Technological Forecasting and Social Change. 176 121478. arXiv:2107.03200. doi:10.1016/j.techfore.2022.121478.
- ^ Rastogi, Ayushi; Nagappan, Nachiappan; Gousios, Georgios; van der Hoek, André (11 October 2018). "Relationship between geographical location and evaluation of developer contributions in github". Proceedings of the 12th ACM/IEEE International Symposium on Empirical Software Engineering and Measurement. ACM. pp. 1–8. doi:10.1145/3239235.3240504. ISBN 978-1-4503-5823-1. S2CID 215822439.
- ^ a b c d Gonzalez-Barahona, Jesus M.; Robles, Gregorio; Andradas-Izquierdo, Roberto; Ghosh, Rishab Aiyer (August 2008). "Geographic origin of libre software developers". Information Economics and Policy. 20 (4): 356–363. doi:10.1016/j.infoecopol.2008.07.001.
- ^ a b Bosu, Amiangshu; Sultana, Kazi Zakia (2019). "Diversity and Inclusion in Open Source Software (OSS) Projects: Where do We Stand?". 2019 ACM/IEEE International Symposium on Empirical Software Engineering and Measurement (ESEM). IEEE. pp. 1–11. doi:10.1109/ESEM.2019.8870179. ISBN 978-1-7281-2968-6. S2CID 197640269.
- ^ Nafus, Dawn (June 2012). "'Patches don't have gender': What is not open in open source software". New Media & Society. 14 (4): 669–683. doi:10.1177/1461444811422887. ISSN 1461-4448. S2CID 206727320.
- ^ Trinkenreich, Bianca; Wiese, Igor; Sarma, Anita; Gerosa, Marco; Steinmacher, Igor (31 October 2022). "Women's Participation in Open Source Software: A Survey of the Literature". ACM Transactions on Software Engineering and Methodology. 31 (4): 1–37. arXiv:2105.08777. doi:10.1145/3510460. ISSN 1049-331X. S2CID 234778104.
- ^ Albusays, Khaled; Bjorn, Pernille; Dabbish, Laura; Ford, Denae; Murphy-Hill, Emerson; Serebrenik, Alexander; Storey, Margaret-Anne (April 2021). "The Diversity Crisis in Software Development". IEEE Software. 38 (2): 19–25. Bibcode:2021ISoft..38b..19A. doi:10.1109/MS.2020.3045817. ISSN 0740-7459.
- ^ Popp, Karl Michael, ed. (2020). Best practices for commercial use of open source software: business models, processes and tools for managing open source software. Synomic Academy. Norderstedt: BoD – Books on Demand. ISBN 978-3-7386-1909-6.
- ^ Powers, Stephen M.; Hampton, Stephanie E. (2019). "Open science, reproducibility, and transparency in ecology". Ecological Applications. 29 (1): e01822. Bibcode:2019EcoAp..29E1822P. doi:10.1002/eap.1822. ISSN 1051-0761. PMID 30362295.
- ^ Cheliotis, Giorgos (2009). "From open source to open content: Organization, licensing and decision processes in open cultural production". Decision Support Systems. 47 (3): 229–244. doi:10.1016/j.dss.2009.02.006. ISSN 0167-9236.
- ^ Butler et al. 2022, p. 1.
- ^ Butler et al. 2022, p. 11152.
- ^ Davila 2015, p. 7.
- ^ Zhou & Choudhary 2022, p. 731.
- ^ August et al. 2021, pp. 1–2.
- ^ August et al. 2021, p. 1.
- ^ Morin et al. 2012, Compatibility, Proliferation, Fragmentation, and Directionality.
Further reading
[edit]- Androutsellis-Theotokis, Stephanos; Spinellis, Diomidis; Kechagia, Maria; Gousios, Georgios (2010). "Open source software: A survey from 10,000 feet" (PDF). Foundations and Trends in Technology, Information and Operations Management. 4 (3–4): 187–347. doi:10.1561/0200000026. ISBN 978-1-60198-484-5.
- August, Terrence; Chen, Wei; Zhu, Kevin (2021). "Competition Among Proprietary and Open-Source Software Firms: The Role of Licensing in Strategic Contribution". Management Science. 67 (5): 3041–3066. doi:10.1287/mnsc.2020.3674.
- Birkinbine, Benjamin J. (2020). Incorporating the Digital Commons: Corporate Involvement in Free and Open Source Software. University of Westminster Press. hdl:20.500.12657/37226. ISBN 978-1-912656-43-1.
- Butler, Simon; Gamalielsson, Jonas; Lundell, Björn; Brax, Christoffer; Mattsson, Anders; Gustavsson, Tomas; Feist, Jonas; Kvarnström, Bengt; Lönroth, Erik (2022). "Considerations and challenges for the adoption of open source components in software-intensive businesses". Journal of Systems and Software. 186 111152. Elsevier BV. doi:10.1016/j.jss.2021.111152. ISSN 0164-1212.
- Coleman, E. Gabriella. Coding Freedom: The Ethics and Aesthetics of Hacking (Princeton UP, 2012)
- Davila, Jacinto (2015). "The Political Logic of Free, Libre, Open Source Software". Societal Benefits of Freely Accessible Technologies and Knowledge Resources. IGI Global. pp. 1–24. ISBN 978-1-4666-8337-2.
- Fadi P. Deek; James A. M. McHugh (2008). Open Source: Technology and Policy. Cambridge: Cambridge University Press. ISBN 978-0-511-36775-5.
- Chris DiBona and Sam Ockman and Mark Stone, ed. (1999). Open Sources: Voices from the Open Source Revolution. O'Reilly. ISBN 978-1-56592-582-3.
- Joshua Gay, ed. (2002). Free Software, Free Society: Selected Essays of Richard M. Stallman. Boston: GNU Press, Free Software Foundation. ISBN 978-1-882114-98-6.
- Understanding FOSS | editor = Sampathkumar Coimbatore India
- Benkler, Yochai (2002), "Coase's Penguin, or, Linux and The Nature of the Firm." Yale Law Journal 112.3 (Dec 2002): p367(78) (in Adobe pdf format)
- v. Engelhardt, Sebastian (2008). ""The Economic Properties of Software", Jena Economic Research Papers, Volume 2 (2008), Number 2008-045" (PDF). Jena Economics Research Papers.
- Lerner, J. & Tirole, J. (2002): 'Some simple economics on open source', Journal of Industrial Economics 50(2), p 197–234
- Välimäki, Mikko (2005). The Rise of Open Source Licensing: A Challenge to the Use of Intellectual Property in the Software Industry (PDF). Turre Publishing. Archived from the original (PDF) on 4 March 2009.
- Morin, Andrew; Urban, Jennifer; Sliz, Piotr (2012). "A Quick Guide to Software Licensing for the Scientist-Programmer". PLOS Computational Biology. 8 (7) e1002598. Bibcode:2012PLSCB...8E2598M. doi:10.1371/journal.pcbi.1002598. ISSN 1553-7358. PMC 3406002. PMID 22844236.
- Polley, Barry (11 December 2007). "Open Source Discussion Paper – version 1.0" (PDF). New Zealand Open Source Society. New Zealand Ministry of Justice. Archived from the original (PDF) on 23 February 2018. Retrieved 12 December 2007.
- Rossi, M. A. (2006): Decoding the free/open-source software puzzle: A survey of theoretical and empirical contributions, in J. Bitzer P. Schröder, eds, 'The Economics of Open Source Software Development', p 15–55.
- Open Sources: Voices from the Open Source Revolution — an online book containing essays from prominent members of the open-source community
- Berry, D M (2004). The Contestation of Code: A Preliminary Investigation into the Discourse of the Free Software and Open Software Movement, Critical Discourse Studies, Volume 1(1).
- Schrape, Jan-Felix (2017). "Open Source Projects as Incubators of Innovation. From Niche Phenomenon to Integral Part of the Software Industry" (PDF). Stuttgart: Research Contributions to Organizational Sociology and Innovation Studies 2017-03.
- Sustainable Open Source, a Confluence article providing guidelines for fair participation in the open source ecosystem, by Radovan Semancik
- Zhou, Zach Zhizhong; Choudhary, Vidyanand (2022). "Impact of Competition from Open Source Software on Proprietary Software". Production and Operations Management. 31 (2): 731–742. doi:10.1111/poms.13575.
External links
[edit]- The Open Source Initiative's definition of open source
- Free / Open Source Research Community — Many online research papers about Open Source
Open-source software
View on GrokipediaCore Concepts
Definition and Principles
Open-source software consists of computer programs whose source code is made publicly available under a license that permits users to study, modify, and distribute the code, either as-is or in altered form, while adhering to specified conditions. This model contrasts with proprietary software, where source code access is restricted to protect intellectual property. The Open Source Initiative (OSI), established in 1998, serves as the primary steward of the open-source label, certifying licenses that meet its criteria to ensure interoperability and broad applicability.[7][8] The foundational principles of open-source software are enshrined in the Open Source Definition (OSD), a set of ten criteria derived from the Debian Free Software Guidelines of July 1997 and formally adopted by the OSI in 1998. These criteria emphasize practical freedoms over ideological mandates, focusing on enabling collaborative development and widespread use without imposing restrictive conditions. Key among them is the requirement for free redistribution, allowing the program and its source to be sold or given away without royalties or fees accruing to original authors. Source code must be included or readily derivable, with permissions to create and distribute derivative works, though licenses may require that modifications be clearly marked to preserve the integrity of the original author's code.[1][9] Further principles prohibit discrimination against individuals, groups, or specific fields of endeavor, ensuring the software's applicability across personal, commercial, and institutional contexts. The license itself must accompany distributions, remain product-agnostic, avoid limiting integration with other software, and apply neutrally across technologies rather than favoring particular hardware or platforms. These rules facilitate a merit-based ecosystem where contributions are evaluated on technical merit, fostering innovation through decentralized participation, as evidenced by the widespread adoption of OSI-approved licenses in projects handling billions of lines of code annually. While the OSD prioritizes usability and non-restriction, critics from the free software movement, such as the Free Software Foundation, argue it permits "non-free" practices like non-copyleft licensing, which allows proprietary derivatives, potentially undermining long-term openness.[1][10]Distinction from Related Terms
Open-source software differs from free software in its foundational philosophy and licensing scope. Free software, as articulated by the Free Software Foundation (FSF) since 1985, mandates four user freedoms— to run the program for any purpose, study and modify its workings, redistribute copies, and distribute modified versions— with an ideological commitment to ensuring these rights extend to all recipients via copyleft mechanisms.[11] Open-source software, formalized by the Open Source Initiative (OSI) in 1998, emphasizes pragmatic benefits such as accelerated development, reliability through peer review, and economic incentives, approving licenses that include permissive ones without copyleft, which the FSF rejects for potentially allowing proprietary derivatives.[1] Consequently, all free software meets open-source criteria, but certain OSI-approved licenses, like the MIT or Apache 2.0, fail FSF standards by not guaranteeing perpetual freedoms in downstream works.[12] In contrast to source-available software, open-source software requires adherence to OSI-defined freedoms for modification, distribution, and commercial use without undue restrictions. Source-available models, emerging prominently in the 2010s among venture-backed firms, provide source code visibility for inspection or limited adaptation but often bar competitive redistribution, SaaS deployment, or paid feature extensions, as seen in licenses like Business Source License (BSL) or Commons Clause.[13] Examples include Redis adopting source-available terms in 2024 to curb "free-riding" by cloud providers, disqualifying it from OSI approval.[14] This distinction preserves developer control at the expense of communal reuse, positioning source-available as a hybrid between proprietary and fully open models.[15] Open-source software also contrasts with public domain releases, which waive copyright entirely, allowing unrestricted use without a license. While public domain software permits viewing and modification akin to permissive open-source licenses, it lacks explicit grant language ensuring enforceability across jurisdictions, leading the OSI to deem it incompatible with open-source certification since 2017.[16] For instance, public domain works may face ambiguity in patent grants or trademark issues, whereas open-source licenses standardize protections like those in the BSD license.[17] The open-core model, a commercialization strategy since the early 2000s, differentiates by offering a basic open-source component while reserving advanced features as proprietary extensions. Companies like GitLab and Elastic employ this to monetize via subscriptions for enterprise tools, ensuring the core complies with OSI licenses but gating scalability or integrations behind closed code.[18] This approach, while leveraging open-source collaboration for the base, limits full transparency and forks of premium functionality, unlike pure open-source projects where all code is modifiable and redistributable.[19] Finally, open-source software fundamentally opposes proprietary software, where source code remains inaccessible to users, restricting inspection, modification, or independent redistribution. Proprietary models, dominant pre-1980s, rely on binary distribution and end-user license agreements (EULAs) enforcing vendor control, as in Microsoft Windows, prioritizing intellectual property protection over collaborative evolution.[20] Open-source, by contrast, derives value from transparency, enabling audits for security—evidenced by over 90% of cloud infrastructure running on Linux kernels by 2023—and community-driven fixes, absent in closed ecosystems.[21]Open-Source Projects
Open-source projects are the collaborative entities through which open-source software is created, maintained, and evolved. They bring together developers, organizations, and users worldwide to build software in a transparent and inclusive manner.Implementation Principles
Open-source projects operate on principles of openness, collaboration, and community governance. Key aspects include:- Public Source Code and Transparency: All project code, history, issues, and discussions are publicly accessible, allowing anyone to inspect, learn from, or contribute to the software.
- Distributed Version Control: Most projects use Git for tracking changes, enabling branching for new features or fixes and merging via pull requests.
- Collaborative Workflow: Contributors fork repositories, develop in isolated branches, and submit pull requests. Maintainers review code for quality, security, and alignment with project goals before merging.
- Merit-Based Governance: Contributions are judged on technical merit through code reviews, often supported by automated testing and continuous integration (CI) pipelines.
- Community Tools and Platforms: Platforms like GitHub, GitLab, or self-hosted solutions provide issue tracking, wikis, forums, and CI/CD integration to facilitate global participation.
Usage Scenarios
Open-source projects support a broad range of applications and industries:- Web and Cloud Infrastructure: Projects like Apache HTTP Server, Nginx, and Kubernetes power websites, APIs, and container orchestration for scalable cloud deployments.
- Operating Systems and Servers: Linux distributions serve as the foundation for servers, supercomputers, embedded devices, and Android mobile OS.
- Databases and Data Management: Solutions such as PostgreSQL, MySQL, and MongoDB handle data storage and querying for applications of all scales.
- AI and Machine Learning: Frameworks like TensorFlow and PyTorch enable model development, training, and deployment in research and production.
- Developer Tools and Frameworks: Languages (Python, Rust), libraries, and tools accelerate software development across domains.
- Enterprise and Government: Organizations adopt OSS for cost savings, customization, security auditing, and avoiding vendor lock-in.
- Education, Research, and Personal Use: Free access supports learning, experimentation, and hobby projects without licensing barriers.
Licensing Fundamentals
Open-source software licenses are legal agreements that grant users specific freedoms to run, study, modify, and redistribute the software, provided the terms of the license are followed. These licenses must conform to the Open Source Definition (OSD), a set of ten criteria established by the Open Source Initiative (OSI) in 1997 to ensure software distribution promotes collaborative development while avoiding restrictions that hinder innovation or access.[1] The OSD requires free redistribution without fees to recipients, provision of source code, allowance for derived works under the same terms, and no discrimination against persons, groups, or fields of endeavor.[1] Licenses are broadly categorized into permissive and copyleft types, differing primarily in how they handle derivative works. Permissive licenses, such as the MIT License (first published in 1988 by the Massachusetts Institute of Technology), the BSD licenses (originating from the University of California, Berkeley in the 1980s), and the Apache License 2.0 (released by the Apache Software Foundation in 2004), allow users to modify and redistribute the software, including in proprietary products, with minimal obligations beyond retaining copyright notices and disclaimers. The MIT License, for instance, permits commercial use, modification, and distribution without requiring the release of source code for derivatives, making it highly compatible with closed-source software. Apache 2.0 adds an explicit patent grant and requires notices for modifications, providing stronger protections against patent litigation compared to simpler BSD variants. Copyleft licenses, exemplified by the GNU General Public License (GPL), enforce reciprocity by mandating that derivative works be licensed under the same terms, ensuring continued openness. The GPL family, developed by the Free Software Foundation starting with version 1 in 1989, version 2 in 1991, and version 3 in 2007, guarantees the four essential freedoms: to run the program, study and modify it, redistribute copies, and distribute modified versions.[22] Strong copyleft like GPL v3 propagates to combined works, preventing proprietary enclosure, while weaker variants like the GNU Lesser General Public License (LGPL) allow linking with proprietary code without forcing its source release.[22] This "viral" aspect of copyleft has been both praised for preserving communal access and criticized for limiting adoption in commercial contexts.[22] All open-source licenses typically disclaim warranties, stating the software is provided "as is" without guarantees of fitness or merchantability, and require attribution to original authors. Compatibility between licenses is crucial; for example, permissive licenses are broadly compatible, but mixing GPL with permissive code may require relicensing under GPL for distributions. The OSI maintains an approved list of over 80 licenses as of 2023, certifying compliance with the OSD, though not all free software licenses qualify as open source due to additional restrictions.[1] Selection of a license involves balancing developer intent for openness against practical needs for adoption and integration.Historical Evolution
Early Origins and Precursors
In the 1950s, computer software was typically bundled with hardware purchases from manufacturers like IBM, with source code often provided at no additional cost to enable customization by institutional users, who were primarily researchers and scientists focused on advancing computational capabilities rather than commercial exploitation.[23] This practice reflected the era's emphasis on collaborative problem-solving in academia and industry labs, where software served as a tool for scientific inquiry. A key institutional precursor emerged in 1955 with the formation of SHARE, a volunteer user group initiated by users of IBM's 701 and 704 mainframes in the Los Angeles area to facilitate the exchange of programs, documentation, and technical information among members.[24] SHARE's activities, including software libraries and meetings to discuss modifications, established early norms for peer-to-peer code sharing and collective influence on vendor development, predating formalized licensing by decades.[25] The 1960s saw these practices evolve with the advent of timesharing systems, which allowed multiple users to interact concurrently with a single machine, fostering incremental collaborative development in academic settings like MIT's Project MAC.[26] Researchers exchanged source code via physical media such as magnetic tapes, enabling iterative improvements without proprietary barriers, as software was viewed as a communal resource for experimentation rather than a marketable product.[27] The deployment of ARPANET in 1969 further accelerated this by connecting research institutions, permitting distributed collaboration on code across geographically separated teams and laying infrastructural groundwork for networked software distribution.[28] By the 1970s, these precedents crystallized in projects like Unix, initially developed at Bell Labs between 1969 and 1971 by Ken Thompson and Dennis Ritchie on a PDP-7 minicomputer, with subsequent versions ported to PDP-11 systems.[29] AT&T distributed Unix source code non-commercially to universities and research entities starting in the early 1970s, often via tape, which spurred widespread modifications and variants such as the Berkeley Software Distribution (BSD) released in 1977 by the University of California, Berkeley.[30] This distribution model emphasized source availability for adaptation, mirroring earlier sharing ethos but scaling it through minicomputers' affordability, thus providing a direct technical precursor to later open-source paradigms by demonstrating the viability of community-driven evolution absent restrictive copyrights.[31]Institutionalization of the OSS Movement
The institutionalization of the open-source software (OSS) movement gained momentum in the late 1990s through the establishment of formal organizations dedicated to standardization, advocacy, and governance. The Open Source Initiative (OSI) was founded in 1998 by figures including Eric S. Raymond and Bruce Perens during a strategy session in Palo Alto, California, to reframe the collaborative software development ethos in terms appealing to businesses and developers focused on pragmatic outcomes rather than ideological purity.[32] This shift addressed the limitations of the earlier "free software" terminology, which emphasized user freedoms and ethical imperatives as articulated by Richard Stallman via the Free Software Foundation (FSF), established in 1985.[10] Stallman criticized the "open source" branding for diluting these principles by highlighting development efficiencies over moral obligations, yet it enabled wider institutional acceptance by decoupling software sharing from political connotations.[10] Central to this institutionalization was the OSI's creation of the Open Source Definition (OSD) in 1998, adapted from the Debian Free Software Guidelines, which outlined ten criteria for licenses to qualify as open source, including free redistribution, source code availability, and allowance for derived works.[1] The OSI established a review process to approve licenses meeting these standards, with initial approvals including the GNU General Public License (GPL), Berkeley Software Distribution (BSD) license, and Mozilla Public License by the late 1990s, providing a certification mechanism that assured compatibility and legal clarity for contributors and adopters.[33] By standardizing terminology and criteria, the OSI fostered interoperability across projects and mitigated risks associated with proprietary restrictions, laying groundwork for scalable collaboration.[8] Parallel to OSI efforts, non-profit foundations emerged to provide fiscal sponsorship, legal support, and project stewardship, transitioning ad-hoc hacker collectives into structured entities. Software in the Public Interest (SPI) was incorporated on June 16, 1997, in New York to support Debian and other initiatives with infrastructure and tax-exempt status.[34] The Apache Software Foundation (ASF) followed in 1999, incorporating as a 501(c)(3) entity to oversee the Apache HTTP Server project, which had originated informally in 1995, and to enforce merit-based governance models emphasizing community consensus.[35] These organizations enabled sustainable funding through donations and sponsorships, professionalized contributor agreements, and protected intellectual property while preserving openness, marking a causal shift from volunteer-driven chaos to resilient institutional frameworks that supported OSS's expansion.[36] This period of formalization, spanning 1997–1999, correlated with increased corporate engagement, as evidenced by initial investments in OSS infrastructure, though it also introduced tensions over commercialization versus purity, with empirical growth in project maturity underscoring the efficacy of structured oversight.Growth and Mainstream Adoption (2000s-2010s)
The Apache HTTP Server solidified its position as the dominant web server software during the 2000s, powering over 50% of active websites by the early part of the decade and maintaining market shares often exceeding 60% through modular extensibility and community-driven enhancements.[37] Concurrently, Linux distributions gained substantial traction in server environments, with Linux achieving 27% of the server operating system market share in 2000, up from 25% the prior year, driven by cost efficiencies and reliability in enterprise deployments.[38] Companies like IBM increased investments, committing billions to Linux development by 2003, which accelerated adoption in data centers and high-performance computing.[38] Enterprise adoption of open-source software expanded notably in the mid-2000s, facilitated by user-friendly distributions such as Ubuntu, released in October 2004 by Canonical, which emphasized ease of installation and regular updates to broaden appeal beyond technical users. Surveys of U.S. Fortune 1000 firms indicated growing integration of open-source components for infrastructure, with factors like reduced licensing costs and customization flexibility cited as primary drivers.[39] By 2010, private sector penetration reached 44%, reflecting mainstream acceptance in business operations despite lingering concerns over support and security.[40] The 2010s marked explosive growth in mobile and collaborative ecosystems, propelled by Android's open-source foundation. Announced in 2005 and first commercially released in September 2008 via the Open Handset Alliance, Android's shipments surged, capturing nearly 900% year-over-year growth from 2009 to 2010 and reaching 65.9% of the global mobile operating system market by 2015 through fragmentation-tolerant licensing and hardware partnerships.[41][42] Platforms like GitHub, launched in April 2008, further democratized development, hosting millions of repositories by the early 2010s and enabling distributed version control that scaled open-source contributions across global teams.[43] This era also saw open-source integration in cloud infrastructure precursors, underscoring causal links between permissive licensing, rapid iteration, and market dominance in high-volume sectors.[44]Contemporary Developments (2020s)
In the early 2020s, open-source software (OSS) adoption accelerated dramatically, powering 96% of modern applications by late 2024 according to a comprehensive study aggregating data from scanned codebases.[45] Annual OSS package downloads reached projections of over 6.6 trillion in 2024, reflecting sustained growth driven by cloud-native architectures and containerization tools like Kubernetes, which saw enterprise deployments expand amid hybrid work shifts post-2020.[46] The OSS services market expanded to an estimated $50 billion by 2025, with a 15% compound annual growth rate, fueled by demand for alternatives to proprietary monopolies, prioritized by 49% of global stakeholders in 2024 surveys.[47][48] A pivotal development was the integration of OSS with artificial intelligence, particularly large language models (LLMs), where open-source variants proliferated to counter proprietary dominance. By mid-2025, nearly all software developers had experimented with open AI models, and 63% incorporated them into production workflows, enabling cost reductions of up to 60% compared to closed alternatives per enterprise surveys.[49][50] Frameworks like LangChain and AutoGen emerged as key enablers for agentic AI applications, while initiatives from organizations including the Linux Foundation promoted transparent model releases to foster innovation without vendor lock-in.[51] This shift democratized AI capabilities, with open models addressing privacy concerns through auditable codebases, though it amplified dependencies on community-maintained components.[52] Security challenges intensified alongside growth, with open-source supply chain attacks tripling since 2019 due to expanded attack surfaces from unvetted dependencies.[53] The average application incorporated over 16,000 OSS files by 2025, a threefold increase from 2020 levels, heightening vulnerability risks from poor oversight and legacy code persistence.[54][55] In response, trends toward long-term support (LTS) models gained traction among enterprises by 2025, emphasizing sustained maintenance to mitigate exploits, while tools for automated vulnerability scanning proliferated.[56] These developments underscored causal trade-offs: OSS's collaborative model accelerated innovation but required rigorous governance to counter adversarial insertions in widely used libraries.Development and Collaboration
Open Development Model
The open development model of open-source software involves decentralized, collaborative processes where source code is maintained in public repositories, enabling global contributors to propose, review, and integrate changes through transparent mechanisms. This approach relies on distributed version control systems, such as Git introduced in 2005 by Linus Torvalds for Linux kernel development, which facilitate branching for experimental work, forking to create independent variants, and pull requests for submitting modifications. Discussions and decision-making occur via public channels like mailing lists, issue trackers, and code review platforms, ensuring that contributions are evaluated on technical merit rather than contributor identity.[57] Eric S. Raymond formalized aspects of this model in his 1997 essay "The Cathedral and the Bazaar," contrasting it with centralized proprietary methods by advocating "release early, release often" to harness collective debugging, encapsulated in Linus Torvalds's principle that "given enough eyeballs, all bugs are shallow." Empirical observation of projects like the Linux kernel, which by 2023 incorporated over 30 million lines of code from thousands of contributors annually, demonstrates how frequent iterations and peer scrutiny accelerate defect identification and resolution compared to isolated teams. Core practices include automated testing via continuous integration tools, adherence to coding standards enforced through maintainer oversight, and modular design to lower barriers for partial contributions. Meritocracy governs acceptance, where maintainers—often volunteers or designated leads—apply criteria like functionality, efficiency, and compatibility, rejecting submissions that fail scrutiny regardless of origin. This has scaled to ecosystems like GitHub, hosting over 420 million repositories as of 2024, where fork-based experimentation allows parallel innovation without disrupting the mainline codebase.[58] Challenges arise from coordination overhead, as uncoordinated changes can introduce conflicts, necessitating tools like semantic versioning (introduced in 2010 by Tom Preston-Werner for semantic-release) to manage dependencies and API stability. Studies of Apache projects show that such models yield higher code churn rates—up to 10 times proprietary equivalents—but correlate with faster feature delivery due to diverse input.[59]Tools and Platforms
Git, a distributed version control system, serves as the foundational tool for managing source code in the majority of open-source projects, enabling developers to track changes, create branches, and merge contributions asynchronously. Developed by Linus Torvalds and released on April 7, 2005, initially to handle versioning for the Linux kernel, Git's design emphasizes speed, data integrity via cryptographic hashing, and decentralized workflows that reduce reliance on central servers.[60] Its adoption stems from these efficiencies, with over 90% of professional developers using it as of 2023 surveys, facilitating large-scale collaboration without performance bottlenecks seen in earlier centralized systems like Subversion.[61] Code hosting platforms build on Git to provide centralized repositories, social coding features, and integration ecosystems tailored for open-source workflows. GitHub, founded in 2008 by Tom Preston-Werner, Chris Wanstrath, and PJ Hyett, introduced pull requests in 2008 to streamline code reviews and forking mechanisms that lower barriers to contribution, hosting millions of repositories and powering events like Hacktoberfest to encourage participation.[62] Acquired by Microsoft in 2018 for $7.5 billion, it integrates GitHub Actions, a CI/CD service launched in 2019, allowing automated workflows directly within repositories using YAML-defined pipelines.[63] GitLab, emerging as an open-source Git repository manager in 2011 under Dmitriy Zaporozhets, differentiates through its all-in-one DevOps platform, including built-in CI/CD via GitLab CI introduced in 2014, which supports self-hosted instances and granular permissions, appealing to privacy-focused projects.[64] SourceForge, launched in 1999 by VA Linux Systems, pioneered web-based OSS hosting with support for multiple version control systems like CVS and later Git, but its popularity waned post-2010 due to perceived commercialization and slower innovation compared to GitHub.[65] Continuous integration and continuous delivery (CI/CD) tools automate testing and deployment, critical for maintaining open-source project velocity. Jenkins, an open-source automation server forked from Hudson in 2011 and maintained by the CloudBees community, dominates with over 1,800 plugins for extensible pipelines, used in projects like Apache Software Foundation repositories for build orchestration.[66] Travis CI, originating in 2011 and optimized for GitHub-hosted open-source repositories, provides hosted builds with simple YAML configuration, processing millions of builds annually before its 2021 acquisition by Idera's Kubermatic division shifted focus to enterprise.[67] These tools integrate with platforms to enforce code quality, with empirical data showing CI/CD adoption correlating to 20-30% faster release cycles in OSS ecosystems via reduced manual errors.[68] Other utilities, such as Docker for containerization (released 2013) and Kubernetes for orchestration (initially 2014), further enable reproducible builds across distributed contributors.[69]Contributor Participation
Contributions to open-source software projects encompass a range of activities beyond coding, including reporting bugs, writing documentation, translating materials, designing user interfaces, moderating discussions, and providing financial support. These non-code contributions often lower barriers for newcomers and sustain project health, with documentation improvements and issue triage comprising significant portions of activity in mature repositories.[70][71] In 2023, developers worldwide generated 301 million contributions to open-source projects hosted on GitHub, reflecting a surge driven by AI-related tools and broader developer engagement. Globally, approximately 2.5 million individuals actively contributed to open-source efforts that year, marking a 15% increase from prior periods amid rising adoption in emerging regions like Asia and Latin America. Corporate participation is substantial, with firms such as Google reporting that 10% of their full-time employees contributed in 2023, often to external projects comprising over 70% of their open-source output.[72][73][74] Demographic data indicate a skew toward male participants from North America and Europe, though shares from Asia, Eastern Europe, and Latin America have grown significantly since 2010, diversifying the contributor base. Empirical analyses of GitHub repositories confirm this geographic expansion correlates with increased project velocity in those areas. Motivations for participation blend intrinsic factors like skill enhancement and enjoyment with extrinsic ones such as reputational gains and career advancement; software-focused contributors, in particular, prioritize self-development and signaling expertise over ideological or reciprocal drivers.[75][76] Sustained participation faces hurdles, including poor onboarding documentation, maintainer overload, and a "contributor funnel" where most initial engagements fail to progress to meaningful commits due to unclear guidelines or rejection of novice pull requests. Studies highlight that only a small fraction of users—often under 20%—make repeated contributions, exacerbating dependency on core teams and risking burnout. Projects mitigate this through structured guides and mentorship, yet coordination challenges persist as contributor volume rises, introducing risks like code conflicts and security oversights.[77][78][79]Empirical Advantages
Innovation Acceleration
The open-source software (OSS) model accelerates innovation by enabling distributed, parallel development across global contributors, who can review, modify, and integrate code without centralized approval barriers, thereby shortening feedback loops and iteration times compared to proprietary systems confined to internal teams.[80] This structure facilitates forking, where developers create variants to experiment with novel features or fixes, merging successful changes back into the main project via mechanisms like pull requests, which empirically correlates with intensified synchronization between software contributions and patent filings among organizations.[81] For instance, analysis of 98 prominent OSS projects over 20 years shows that 1,556 organizations, representing 48% of contributions to these projects, aligned OSS activity with 26.6% of U.S. patents granted, with this linkage growing over time particularly in permissively licensed repositories.[81] Empirical studies confirm OSS's edge in development velocity, as projects often adopt rapid release cycles that do not proportionally increase defect rates. In the case of Mozilla Firefox, the transition to shorter release intervals starting in 2011—reducing from multi-month to six-week cycles—resulted in no significant rise in pre- or post-release bugs on a percentage basis, allowing quicker delivery of security updates and features to compete against proprietary browsers like Internet Explorer.[82] Similarly, OSS firms demonstrate accelerated market traction, raising Series A funding 20% faster and Series B 34% faster than proprietary counterparts, with 91% advancing from seed to Series A versus 48% industry-wide, attributing this to transparent collaboration that signals robust innovation potential to investors.[83] Domain-specific accelerations are evident in fields like artificial intelligence, where OSS frameworks enable community-driven model improvements at paces unattainable in closed ecosystems. Meta's Llama models, for example, achieved 1.2 billion downloads by mid-2025, establishing industry benchmarks through collective refinements that outstrip proprietary timelines, while broader OSS AI adoption has been linked to over 50% cost reductions in business applications via faster, collaborative enhancements.[80] Projects like the Linux kernel further exemplify this, evolving through daily integration of thousands of patches from disparate contributors since its 1991 inception, powering innovations in cloud computing and embedded systems that proprietary alternatives struggled to match in adaptability.[84] These dynamics underscore OSS's causal role in compressing innovation timelines, though outcomes depend on community scale and license permissiveness.[81]Productivity and Cost Benefits
Open-source software (OSS) eliminates proprietary licensing fees, enabling organizations to deploy robust systems without recurring costs that can exceed millions annually for enterprise-scale implementations. For instance, a 2024 Harvard Business School analysis estimated that the freely available OSS codebase underpinning global software infrastructure equates to $8.8 trillion in avoided development expenses if firms were required to replicate it independently. This valuation derives from applying economic replacement cost models to OSS contributions tracked via repositories like GitHub, highlighting direct fiscal relief particularly for startups and resource-constrained entities. In scientific and research domains, OSS adoption yields quantified savings of up to 87% in tool acquisition and maintenance relative to proprietary alternatives, as evidenced by a 2020 review of empirical cases across disciplines including bioinformatics and data analysis.[85] These reductions stem from zero upfront costs and communal maintenance, though total ownership costs may include internal integration efforts; nonetheless, net savings persist due to scalable reuse without vendor lock-in.[85] On productivity, OSS facilitates accelerated development cycles through modular reuse and community-driven enhancements, yielding measurable firm-level gains. A 2018 study in Management Science analyzed U.S. firm data and found that nonpecuniary OSS adoption correlates with significant value-added productivity increases, attributed to reduced reinvention of core functionalities and enhanced interoperability.[86] Complementing this, empirical research on software economics demonstrates that OSS integration boosts development productivity by enabling faster prototyping and bug resolution via distributed contributions, with organizations reporting up to 20-30% efficiency improvements in controlled adoption scenarios.[87] Recent surveys underscore these dynamics in enterprise contexts, where OSS drives faster time-to-market; for example, Linux Foundation research from 2023 identifies expedited development as a top benefit, with 60% of respondents citing reduced timelines due to pre-built, customizable components.[88] In AI subsets, open-source models further amplify productivity, enabling 50%+ reductions in business unit development costs through shared benchmarks and iterative improvements.[80] These advantages hold across scales, from individual developers leveraging libraries like TensorFlow to corporations optimizing infrastructure with Linux distributions.[89]Evidence from Economic Studies
A 2024 Harvard Business School working paper estimated the economic value of open-source software (OSS) at $8.8 trillion annually for U.S. firms, based on the replacement cost of OSS codebases that appear in 96% of commercial applications; this figure reflects avoided development expenses and enhanced productivity from freely accessible code. The analysis, drawing from code scanning data across industries, attributes these savings to OSS's role as a public good that reduces duplication of effort in foundational software layers. Research from the Linux Foundation, surveying over 1,000 technical decision-makers in 2023, identified cost savings as the top benefit of OSS adoption, cited by 70% of respondents, followed by accelerated development cycles that shorten time-to-market by an average of 25-50% in enterprise environments; these gains were reported to exceed implementation costs for 85% of organizations.[88] Led by open innovation scholar Henry Chesbrough, the study emphasized OSS's facilitation of interoperability and reduced vendor lock-in as causal drivers of net positive returns on investment.[88] Empirical analyses of OSS in software development processes have demonstrated productivity uplifts, with one 2009 study of adopting firms finding statistically significant reductions in per-module development costs—up to 30% lower than proprietary equivalents—due to reusable code and community-driven debugging efficiencies.[87] In scientific and research applications, a 2020 review of 20 tools across domains reported average cost savings of 87% from OSS versus closed-source alternatives, primarily through eliminated licensing fees and scalable maintenance.[85] Broader macroeconomic assessments, such as a 2020 report on U.S. OSS impacts, calculated that OSS sustains 1.3 million jobs with wages 40% above national averages, contributing $121 billion in annual value added through skill diffusion and innovation spillovers across sectors.[90] These findings align with models treating OSS as intangible capital, where a 2018 IMF framework quantified its economy-wide productivity boost via lowered barriers to software customization and integration.[91]Criticisms and Limitations
Security and Vulnerability Risks
Open-source software's publicly accessible source code facilitates scrutiny by security researchers, potentially enabling rapid identification and remediation of flaws, yet it simultaneously exposes code to adversaries who can analyze it for exploitable weaknesses without barriers.[92] This dual nature has led to documented vulnerabilities proliferating at rates exceeding those in closed-source counterparts, with empirical analyses indicating an annual growth of 98% in reported open-source vulnerabilities from 2015 to 2023, compared to a 25% baseline across all software.[93] Such escalation stems partly from the widespread adoption of open-source components, amplifying the attack surface through transitive dependencies in modern applications, where a single project may incorporate thousands of libraries prone to unpatched issues.[94] High-profile incidents underscore these risks, including the Log4Shell vulnerability (CVE-2021-44228) in the Apache Log4j library, disclosed on December 9, 2021, which permitted remote code execution via crafted log messages and affected millions of systems globally due to Log4j's ubiquity in Java-based applications.[95] Exploitation attempts surged into the millions within days, highlighting delays in patching across under-resourced volunteer-maintained projects, where initial fixes required coordinated efforts from organizations like Apache and vendors such as Red Hat.[96] Similarly, the XZ Utils backdoor (CVE-2024-3094), uncovered on March 29, 2024, involved a state-affiliated actor embedding malicious code over two years to gain maintainer trust, enabling remote code execution in Linux distributions via compromised compression utilities integrated into core systems.[97] This supply-chain compromise evaded detection through gradual code alterations, revealing vulnerabilities in governance for low-contributor projects.[98] Supply-chain attacks exploit open-source ecosystems' reliance on unvetted contributions and automated dependency resolution, as seen in cases where malicious packages infiltrate repositories like npm or PyPI, propagating to downstream users.[99] Empirical studies of GitHub-hosted projects identify persistent weaknesses, such as inadequate vulnerability disclosure practices, with over 3,600 analyzed patches from the National Vulnerability Database showing delays averaging weeks to months in volunteer-driven fixes.[100] While proprietary software obscures flaws, potentially concealing equivalent risks, open-source transparency mandates public CVE listings, inflating visible counts—yet underfunding and contributor burnout exacerbate unmitigated exposures, as evidenced by OWASP's classification of known unpatched components as the top open-source risk.[101] Comparative data suggest no inherent superiority in vulnerability density, but open-source's scale demands rigorous scanning and software bills of materials (SBOMs) to manage inherited flaws.[102]Sustainability Challenges
Open-source software sustainability is undermined by chronic underfunding and overreliance on voluntary contributions, which expose projects to risks of stagnation or abandonment. A majority of maintainers report operating without dedicated budgets, forcing them to balance project upkeep against personal or professional demands, often resulting in delayed updates or unresolved issues. For instance, in cloud-native ecosystems, corporate consumption vastly outpaces reciprocal contributions, creating a "free-rider" dynamic where beneficiaries extract value without sustaining the underlying codebases.[103] Maintainer burnout represents a core vulnerability, driven by the psychological toll of uncompensated labor amid escalating expectations for security patches, feature enhancements, and compatibility fixes. Surveys of open-source maintainers indicate that burnout rates are elevated due to this imbalance, with many citing exhaustion from handling user demands without proportional support; in one analysis of critical infrastructure projects, maintainers described quitting as a direct outcome of unsustainable workloads.[104][105] This issue is compounded by the concentration of effort: empirical data from large repositories show that a small cadre—often fewer than 10 individuals—shoulders the bulk of maintenance for widely used libraries, amplifying single points of failure when individuals depart.[106] Long-term viability is further threatened by end-of-life (EOL) decisions and skills gaps, as organizations struggle to allocate resources for legacy OSS amid shifting priorities. A 2025 industry report notes that enterprises express low confidence in managing OSS lifecycles, with many projects reaching EOL without viable successors due to depleted contributor pools.[107][108] While funding models like corporate sponsorships (e.g., via foundations) have emerged, they cover only a fraction of needs; for example, less than 20% of projects receive substantial financial backing, leaving the majority vulnerable to entropy despite technical efficiencies in code modularity. These dynamics underscore a causal mismatch: the public-good nature of OSS incentivizes widespread adoption but disincentivizes proportional investment, perpetuating cycles of crisis.Quality and Fragmentation Issues
Open-source software (OSS) projects often face quality challenges arising from decentralized development processes, where code contributions from diverse, sometimes uncoordinated volunteers can introduce inconsistencies and defects. Empirical analyses of OSS repositories, such as those examining bug-tracking data from projects like Apache and Mozilla, reveal patterns of higher initial defect densities compared to proprietary counterparts, attributed to the absence of centralized quality assurance teams and rigorous pre-release testing protocols.[109] For instance, a study of nine general-purpose OSS systems found elevated vulnerability rates linked to code complexity and irregular review cycles, underscoring how volunteer-driven maintenance can lag behind professionalized proprietary workflows.[110] While popular OSS like the Linux kernel benefits from large contributor pools enabling rapid fixes, smaller or niche projects frequently suffer from incomplete documentation, unaddressed edge cases, and stalled updates due to contributor burnout or shifting priorities. Systematic reviews of OSS quality models highlight that metrics such as maintainability and reliability vary widely, with many projects lacking formal metrics for usability or performance optimization, leading to perceptions of lower polish in user-facing applications.[111] In comparisons, proprietary software typically enforces uniform standards through vendor-controlled releases, reducing variability but at the cost of flexibility; OSS, conversely, trades this for adaptability, though empirical evidence from adoption studies shows quality shortfalls deterring enterprise uptake in mission-critical scenarios.[112] Fragmentation in OSS manifests as the proliferation of forks, variants, and distributions, diluting resources and complicating interoperability. In the Linux ecosystem, over 270 active distributions as of 2019 exemplify this, resulting in duplicated development efforts, inconsistent patching timelines, and heightened complexity for hardware vendors seeking broad compatibility.[113] This leads to slower bug resolution and feature rollout across variants, as maintainers split focus rather than converging on upstream improvements, a dynamic Linux creator Linus Torvalds has cited as a barrier to desktop market penetration.[114] Such fragmentation extends beyond kernels to libraries and applications, where incompatible forks—evident in web development tools like competing JavaScript utilities—exacerbate integration challenges and inflate support costs for users and enterprises. Reports from industry bodies note that while fragmentation fosters experimentation, its downsides include elevated testing burdens and vulnerability to unpatched divergences, particularly in ecosystems like Android where vendor customizations fragment security updates.[115] Strategies to mitigate this, such as upstream prioritization and modular standards, remain unevenly adopted, perpetuating inefficiencies in resource-constrained OSS communities.[116]Legal Framework
Major License Types
Permissive licenses, such as the MIT License and Apache License 2.0, impose minimal restrictions on the use, modification, and distribution of software, allowing recipients to incorporate the code into proprietary products without requiring the disclosure of modifications or source code beyond basic attribution.[117] These licenses prioritize broad accessibility and compatibility with closed-source development, fostering adoption in commercial environments.[118] The MIT License, first formulated at the Massachusetts Institute of Technology in the late 1980s for projects like X Window System distributions, requires only that the original copyright notice and permission statement be included in all copies or substantial portions of the software.[119] [120] Its brevity—spanning fewer than 200 words—has contributed to its status as one of the most popular licenses, used in over 40% of open-source projects on platforms like GitHub as of 2023.[121] The Apache License 2.0, introduced by the Apache Software Foundation in 2004, extends permissive terms with explicit grants of patent rights from contributors, protecting users against future patent litigation by original developers, and mandates notices for any changes made to the licensed material. [122] This makes it suitable for enterprise software, as evidenced by its use in projects like Android and Hadoop, where patent clarity reduces legal risks in collaborative ecosystems.[123] Other permissive variants, such as the BSD licenses, similarly emphasize few obligations beyond disclaiming warranties and retaining copyright notices.[17] Copyleft licenses, exemplified by the GNU General Public License (GPL), enforce reciprocity by mandating that any derivative works or distributions incorporating the software must be released under the same license, thereby preserving the availability of source code for all users.[22] The GPL version 1 was published by the Free Software Foundation on February 25, 1989, to ensure freedoms to run, study, modify, and redistribute software while preventing proprietary enclosures.[124] Version 2, released in June 1991, clarified compatibility with other licenses and addressed distribution requirements, such as providing source code alongside binaries or offering access to it.[125] By 2023, GPLv2 powered core components of the Linux kernel, which runs on over 90% of public cloud workloads.[118] GPL version 3, issued on June 29, 2007, strengthened protections against "tivoization"—hardware restrictions blocking user modifications—and added patent retaliation clauses to counter software patent threats.[22] The GNU Lesser General Public License (LGPL), version 2.1 from 1991 and version 3 from 2007, relaxes these rules for libraries, permitting linkage with proprietary code without forcing the entire application open, thus enabling hybrid developments like dynamically linked libraries in desktop applications. The Affero GPL variant extends copyleft to network-deployed software, requiring source disclosure for web-accessible modifications, addressing SaaS models where traditional GPL enforcement is limited.[118]| License | Category | Core Permissions | Key Obligations | Notable Adoption Example |
|---|---|---|---|---|
| MIT | Permissive | Use, modify, distribute (including proprietary) | Retain copyright/license notice | Node.js, Ruby on Rails |
| Apache 2.0 | Permissive | Use, modify, distribute; patent grant | Notice changes, state contributions separately | Apache HTTP Server, Kubernetes |
| GPLv2 | Strong Copyleft | Use, modify, distribute if source provided | Derivatives under GPLv2; source with binaries | Linux kernel |
| GPLv3 | Strong Copyleft | As GPLv2, plus anti-tivoization | As GPLv2, plus install/modify rights on hardware | MySQL (dual-licensed) |
| LGPLv3 | Weak Copyleft | Link to proprietary; relinkable libraries | Libraries modifiable/replaceable; source for changes | GTK+, FFmpeg libraries |
Compliance and Disputes
Open-source software compliance requires organizations to identify all incorporated components, verify their licenses, and fulfill obligations such as attributing copyrights, providing source code for copyleft-licensed modifications (e.g., under the GNU General Public License version 2 or 3), and avoiding incompatible combinations like pairing GPL code with proprietary binaries without disclosure.[128] Failure to comply can expose entities to breach-of-contract claims, as courts in multiple jurisdictions have upheld open-source licenses as enforceable agreements.[129] A 2024 report found license conflicts in 53% of audited codebases, often stemming from untracked dependencies or misinterpretations of terms like "derivative works."[130] Best practices include maintaining a software bill of materials (SBOM) for dependency tracking, conducting automated scans with tools like Black Duck or FOSSology to detect obligations, and establishing internal policies for review gates in development pipelines.[131] [132] Regular audits mitigate risks, particularly for copyleft licenses requiring source distribution upon binary release, while permissive licenses like MIT demand only notices.[133] Non-compliance often arises from incomplete inventories or assumptions that open-source use imposes no restrictions, leading to inadvertent violations in embedded systems or SaaS products.[134] Disputes frequently involve copyleft enforcement by copyright holders or organizations like the Software Freedom Conservancy (SFC) and Software Freedom Law Center (SFLC). In Entr'ouvert v. Orange (2011–2024), a French appeals court ruled Orange violated GPLv2 by distributing modified LASSO software without source code in its public portal, awarding €800,000 in damages plus interest, affirming individual standing to sue.[135] Similarly, SFC v. Vizio (2021) alleged GPLv2 breaches in smart TV firmware lacking required source releases, with the case testing third-party enforcement rights under California law.[136] SFLC's 2009 BusyBox suits against firms like Best Buy, Samsung, and Westinghouse settled multiple claims of undistributed sources in devices, yielding compliance commitments without public damage figures.[137] Other cases highlight escalation risks: CoKinetic Systems v. Panasonic Avionics (2020) sought $100 million for alleged GPLv2 violations in avionics software, underscoring potential financial penalties.[138] In Sebastian Steck v. AVM (recently affirmed), a German court enforced GPL terms, reinforcing that non-compliance can result in injunctions, back-payments, and reputational damage across Europe and the U.S.[129] These disputes demonstrate causal links between poor tracking and litigation, with outcomes varying by jurisdiction but consistently validating license conditions as binding, prompting enterprises to prioritize proactive scanning over reactive fixes.[139]Intellectual Property Interactions
Open-source software (OSS) fundamentally engages with intellectual property (IP) rights through copyright mechanisms, as OSS licenses operate as permissive or restrictive grants under copyright law, allowing users to access, modify, and redistribute source code while requiring attribution and, in copyleft variants like the GNU General Public License (GPL) version 2 released in 1991, preservation of freedoms for derivatives.[140] These licenses do not eliminate copyright ownership—contributors retain it—but shift from exclusive control to conditional sharing, enabling collaborative development while imposing obligations to avoid proprietary enclosure of shared code.[141] Copyright in OSS protects the specific expression of code rather than underlying ideas, facilitating forks and improvements but risking infringement if unmodified proprietary elements are incorporated without compliance.[142] Patents introduce additional tensions, as software patents grant 20-year monopolies on inventions, potentially conflicting with OSS's disclosure ethos; however, licenses like Apache 2.0, introduced in 2004, explicitly include patent grants from contributors, promising non-assertion or licensing of related patents to recipients, thereby mitigating litigation risks in ecosystems like Android.[140] OSS code publication creates prior art that can invalidate subsequent patent claims, as seen in defenses against "patent trolls" targeting OSS users, though undisclosed patents held by contributors can still expose downstream adopters to enforcement, exemplified by cases where companies like Oracle asserted Java-related patents against Google in 2010 despite OSS elements in Android.[143] Empirical data from the Open Source Initiative indicates that explicit patent clauses in licenses have proliferated since the early 2000s to foster trust, yet surveys by Black Duck Software in 2023 reported that 96% of codebases contain OSS, heightening inadvertent patent infringement exposure without systematic audits.[144] Trademarks apply orthogonally to OSS, protecting project names, logos, and branding to prevent consumer confusion rather than code functionality; for instance, the Linux Foundation enforces trademarks on "Linux" since 1994, allowing free code use but prohibiting misleading commercial endorsements.[145] This preserves community goodwill without restricting source availability, though disputes arise when forks misuse marks, as in the 2005 SCO Group litigation alleging trademark and copyright violations against Linux distributors, ultimately dismissed in favor of OSS in 2010 rulings.[146] Legal challenges persist in hybrid environments, where integrating OSS with proprietary software risks "infection" under copyleft terms, compelling source disclosure and eroding trade secret value, as highlighted in a 2024 analysis of compliance failures leading to multimillion-dollar settlements.[144] Patent assertions against OSS, often by non-practicing entities, numbered over 1,000 annually by 2022 per RPX Corporation data, prompting defensive strategies like patent pools (e.g., LOT Network joined by over 800 firms since 2016) to neutralize threats collectively.[143] Such interactions underscore OSS's reliance on license enforcement over traditional IP exclusivity, with empirical studies showing reduced innovation barriers but elevated due diligence costs for adopters.[147]Economic Dimensions
Business Models and Funding
Open-source software projects sustain operations through diverse business models that leverage the freely available source code while monetizing complementary value, such as enterprise-grade support, proprietary extensions, or hosted services. A prominent model is the subscription-based support and services approach, exemplified by Red Hat, which provides certified updates, security patches, and technical assistance to enterprise customers under long-term contracts.[148] This model generated over $6.5 billion in annual revenue for Red Hat by 2024, following its 2019 acquisition by IBM for approximately $34 billion, marking the first instance of an open-source company surpassing $1 billion in revenue in 2012.[149][148] Another common strategy is the open-core model, where a basic version remains open-source to attract users and foster community contributions, while premium features, tools, or integrations are offered as proprietary add-ons for paying customers.[150] Companies like GitLab and MongoDB employ variations of this, combining community editions with enterprise subscriptions that include advanced scalability, compliance, and management capabilities. Dual licensing allows developers to offer the software under permissive open-source terms for non-commercial use and restrictive licenses for commercial redistribution, enabling revenue from those seeking to embed or resell the code without contributing back.[151] Funding for open-source development often flows through non-profit foundations that aggregate corporate sponsorships, individual donations, and grants to support maintainers and infrastructure. The Linux Foundation, for instance, channels contributions from members like Google, Microsoft, and Intel to fund projects such as the Linux kernel, with corporate backing ensuring alignment between business interests and code maintenance.[152] Similarly, the Apache Software Foundation and Eclipse Foundation rely on membership dues and grants to steward ecosystems, with Eclipse supporting Java and cloud tools through industry consortia.[153] Government initiatives, such as Germany's Sovereign Tech Fund, provide direct grants to maintainers—totaling millions of euros since 2021—for critical infrastructure projects, prioritizing sovereignty over vendor lock-in.[154] Venture capital has increasingly targeted commercial open-source startups (COSS), with investments focusing on scalable models like SaaS wrappers around open components; by 2024, the sector saw robust funding despite market volatility, driven by the estimated $8.8 trillion value of open-source code that firms would otherwise develop internally.[155][156] Crowdfunding platforms, including GitHub Sponsors and Open Collective, enable per-project donations, though these typically supplement rather than replace institutional support, with commercial services remaining the most scalable for large-scale sustainability.[154][157]Corporate Strategies
Corporations have increasingly integrated open-source software (OSS) into their operations as a strategic imperative, leveraging its cost efficiencies, interoperability, and innovation potential to align with broader business objectives such as accelerating development cycles and enhancing competitiveness in digital markets. A 2021 analysis by Boston Consulting Group emphasized that deploying OSS is essential in fast-evolving tech landscapes, enabling firms to reduce proprietary development costs—estimated at $8.8 trillion globally if OSS were recreated from scratch—and foster ecosystem dependencies that lock in customers.[158][156] This approach contrasts with earlier proprietary dominance, as companies like Microsoft shifted from opposition to active OSS embrace, becoming the largest contributor on GitHub following its 2018 acquisition of the platform for $7.5 billion, which facilitated broader code sharing and developer engagement.[159] Key strategies include upstream contributions to OSS projects, where firms invest engineering resources to influence core technologies underpinning their products, as exemplified by Google's annual OSS efforts in 2024, which supported infrastructure like Kubernetes and Android while advancing AI and cloud services through community-driven improvements.[160] Similarly, IBM's 2019 acquisition of Red Hat for $34 billion—the largest OSS-related deal to date—bolstered its hybrid cloud strategy by integrating Red Hat's enterprise Linux distributions and OpenShift platform, allowing IBM to offer certified, supported OSS stacks that generate revenue via subscriptions without altering upstream codebases.[161] These moves prioritize "upstream-first" development, ensuring corporate modifications feed back into communal repositories to avoid forking fragmentation and maintain vendor neutrality.[162] Another prevalent tactic is establishing formal OSS programs to manage risks and maximize returns, including compliance auditing, contributor incentives, and strategic participation in foundations like the Linux Foundation, which guides firms in linking OSS usage to goals like talent recruitment and supply chain resilience.[163] For instance, Microsoft's OSS program, formalized post-GitHub acquisition, enforces license adherence while enabling engineers to upstream code, yielding benefits in code reuse and reduced redevelopment—evident in projects like .NET Core released under MIT licensing in 2016.[164] However, such strategies can introduce corporate sway over project directions, as noted in critiques of enmeshed interests where dominant contributors like hyperscalers prioritize proprietary extensions over pure community governance.[165] Despite this, empirical outcomes show accelerated innovation, with contributors reporting higher motivation and faster feature delivery in OSS-reliant stacks.[166]| Company | Key OSS Strategy | Notable Example | Outcome |
|---|---|---|---|
| Upstream contributions to ecosystems | Kubernetes co-founding (2014); AI/ML tooling | Enhanced cloud dominance; widespread adoption in enterprise infra[167] | |
| IBM/Red Hat | Acquisition and enterprise hardening | $34B Red Hat buy (2019) | Hybrid cloud revenue growth; maintained OSS upstream model[168] |
| Microsoft | Platform acquisition and program integration | GitHub purchase (2018); .NET open-sourcing | Shift to "open by default"; top GitHub contributor status[159] |
Government and Institutional Use
Various governments have adopted policies promoting open-source software (OSS) to reduce dependency on proprietary vendors, enhance security through community scrutiny, and achieve cost efficiencies. In the United States, the Federal Source Code Policy, established under the Office of Management and Budget, mandates that agencies release at least 20% of new custom-developed source code as OSS annually to foster reuse and innovation.[169] The General Services Administration (GSA) pursues a "open first" approach, targeting 100% OSS for its codebases, as outlined in its OSS policy updated in recent years.[170] Similarly, the Centers for Medicare & Medicaid Services (CMS) maintains a policy governed by its Technology Review Board for OSS adoption in frameworks and solutions.[171] The Department of Defense provides guidance via its OSS FAQ, affirming legal permissibility for use in non-classified systems provided compliance with licenses.[172] In Europe, the European Commission implemented its OSS Strategy for 2020-2023, guided by six principles: think open by default, transform public services through OSS, share code, contribute to communities, secure software via open collaboration, and maintain control over key technologies.[173] This strategy aligns with broader digital sovereignty goals, emphasizing OSS in ICT security and governance.[174] National examples include Norway's extensive use of OSS in public IT projects and Italy's active GitHub repositories for government code.[175] The U.S. Cybersecurity and Infrastructure Security Agency (CISA) released an OSS Security Roadmap in September 2023, focusing on visibility into usage, vulnerability prioritization, and community support to mitigate risks in critical infrastructure.[176] Institutional adoption extends to education and healthcare sectors. U.S. universities increasingly establish Open Source Program Offices (OSPOs) to coordinate OSS development and usage, supporting research and teaching tools.[177] Platforms like Moodle and Sakai serve as OSS learning management systems in higher education, enabling customization without vendor lock-in.[178] In healthcare, hospitals deploy OSS for electronic health records and informatics, such as open-source EHR systems in medical curricula, though integration requires addressing compliance and security hurdles.[179][180] These implementations yield economic benefits, with studies indicating up to 87% savings in scientific and technical tools adaptable to institutional needs, primarily through avoided licensing fees and collaborative maintenance.[85] Governments and institutions cite OSS for enabling rapid customization to public sector requirements, though empirical data on net savings varies by implementation scale.[181]Comparative Analysis
Versus Proprietary Software
Open-source software (OSS) differs fundamentally from proprietary software in its licensing model, which permits free access, modification, and redistribution of source code, contrasting with proprietary software's restrictions on usage, alteration, and distribution to protect intellectual property and generate revenue through licenses. This distinction influences development dynamics, where OSS relies on distributed volunteer and corporate contributors, while proprietary software typically involves centralized teams funded by sales. Empirical analyses indicate OSS adoption has surged, with 96% of organizations increasing or maintaining its use as of 2025, driven by its role in infrastructure like cloud computing and servers, though proprietary software retains dominance in consumer desktops and specialized enterprise tools.[182] In terms of cost, OSS eliminates licensing fees, yielding significant savings; a 2024 Harvard Business School study estimated the global value of OSS at $8.8 trillion if reproduced proprietarily, reflecting avoided development expenses for firms.[156] Enterprise surveys report cost reduction as the primary adoption driver, rising to 53% in 2025 from 37% the prior year, particularly in government sectors where 51.5% cite no-license-cost benefits.[107] [183] However, proprietary software often bundles maintenance and updates into licensing, potentially lowering total ownership costs in scenarios requiring minimal customization, whereas OSS demands internal expertise or third-party support, which a 2025 cost-benefit analysis found can offset savings in high-complexity deployments if not managed efficiently.[184] Security comparisons reveal no universal superiority, as both models exhibit vulnerabilities influenced by code complexity and scrutiny levels. OSS benefits from "many eyes" enabling rapid community patches, with widely adopted projects showing fewer persistent bugs due to diverse auditing; this auditability makes OSS particularly preferred in privacy-sensitive scenarios, where public code enables community inspection for potential backdoors or surveillance features that cannot be verified in proprietary software. For instance, transparency facilitates proactive vulnerability disclosure.[185] [186] [187] Proprietary software leverages code obscurity and dedicated security teams for controlled fixes, sometimes deploying updates faster in vendor ecosystems, though this can delay public awareness of flaws.[188] Analyses of breaches, such as the 2021 Log4Shell in OSS Apache Log4j versus proprietary incidents like SolarWinds, underscore that OSS risks stem from supply chain dependencies and uneven maintenance, while proprietary risks arise from single-vendor failures, with empirical metrics like mean time to patch varying by project maturity rather than model alone.[189] On innovation and development speed, OSS fosters accelerated feature iteration through collaborative models, often incorporating cutting-edge advancements ahead of proprietary counterparts; a 2024 study noted OSS's private-collective approach enables firms to leverage community R&D, reducing solo innovation costs. [190] Frequent releases in popular OSS repositories correlate with higher user engagement, contrasting proprietary cycles constrained by profit-driven roadmaps and testing regimes.[191] Yet, proprietary software can achieve focused reliability in niche domains via proprietary algorithms, and evidence from software complexity studies shows OSS may incur higher quality assurance costs in fragmented ecosystems compared to streamlined proprietary builds.[189] Flexibility represents a core OSS advantage, allowing customization without vendor approval, mitigating lock-in risks evident in proprietary migrations like Oracle database shifts, which have prompted enterprise turnarounds to OSS alternatives.[192] Proprietary software, while offering seamless integration within ecosystems (e.g., Microsoft Office suite), enforces terms that limit interoperability, potentially increasing switching costs estimated at 20-30% of annual IT budgets in locked environments.[193] Overall, selection depends on use case: OSS excels in scalable, modifiable infrastructures like Linux servers powering 96.4% of top websites as of 2024, whereas proprietary suits standardized, support-reliant operations.[194]| Aspect | Open-Source Software | Proprietary Software |
|---|---|---|
| Cost Structure | No upfront licenses; savings up to 87% in tools per 2020 review, but integration expenses vary | Licensing fees offset by bundled support; predictable but higher for scale |
| Security Model | Community scrutiny accelerates fixes; risks from unpatched dependencies | Obscurity and vendor patches; single-point failures possible |
| Innovation Pace | Collaborative, rapid releases; cutting-edge via shared R&D | Controlled, roadmap-driven; excels in proprietary IP niches |
| Flexibility | Full modifiability; avoids lock-in | Limited changes; ecosystem integration strengths |
Versus Free Software Ideology
The free software movement, initiated by Richard Stallman in 1983 with the GNU Project, prioritizes an ethical framework centered on four essential freedoms: to run the program as desired, to study and modify it, to redistribute copies, and to distribute modified versions. This ideology views proprietary software as morally wrong because it imposes restrictions on users' control, advocating copyleft licenses like the GNU General Public License (GPL, first released in 1989) to ensure these freedoms propagate to derivative works. In contrast, the open-source software paradigm, formalized in 1998 by the Open Source Initiative (OSI), emphasizes pragmatic advantages such as accelerated development through collaborative access to source code, improved reliability via peer review, and economic efficiency, without mandating an ethical stance against proprietary elements.[32] Stallman has critiqued the open-source label for diluting the focus on user autonomy, arguing it promotes software merely for its practical benefits—such as faster innovation and lower costs—while sidestepping the principled opposition to non-free software that could undermine freedoms in downstream uses.[10] For instance, permissive open-source licenses approved by OSI, like the MIT License (dating to 1988) and Apache License 2.0 (2004), allow recipients to incorporate code into proprietary products without reciprocal source disclosure, a practice Stallman contends erodes the goal of universal software freedom.[126] Eric S. Raymond, a key OSI co-founder, countered in his 1998 essay that "open source" was deliberately chosen to appeal to businesses wary of the ideological connotations of "free software," facilitating events like Netscape's source code release under a Mozilla Public License variant in 1998, which spurred broader industry adoption.[195] Empirically, the open-source approach has correlated with greater commercial integration; by 2023, over 90% of Fortune 500 companies reportedly used open-source components, often via permissive licenses enabling hybrid models, whereas strict free software adherence remains dominant in niches like embedded systems requiring GPL enforcement. This divergence manifests in license preferences: the GPL family accounted for about 27% of GitHub repositories in 2022, while permissive licenses like MIT held around 45%, reflecting open-source's flexibility in fostering ecosystems like Kubernetes (Apache-licensed since 2014). Stallman maintains that such pragmatism risks a "bazaar" of code where freedoms are optional, potentially leading to user lock-in via non-free extensions, though open-source proponents cite evidence of superior outcomes, such as Linux's kernel growth from 1.0 in 1994 to over 30 million lines by 2023, driven by voluntary contributions unbound by ideology.[12]Versus Source-Available Models
Open-source software adheres to the Open Source Initiative's (OSI) Open Source Definition, which mandates freedoms including free redistribution (with or without modifications), availability of source code, allowance for derived works, and non-discrimination against any person, group, field of endeavor, or technology.[1] These criteria ensure users can study, modify, and distribute the software commercially or otherwise without vendor-imposed restrictions. In contrast, source-available models provide access to source code but under licenses that fail OSI approval, often imposing limits such as prohibitions on redistribution in cloud services, commercial competition, or modifications for certain uses.[13] The primary divergence lies in permissible uses and ecosystem dynamics. Open-source licenses like the Apache License 2.0 or GNU General Public License (GPL) enable unrestricted forking, commercial exploitation, and integration into proprietary products, fostering widespread adoption and innovation through community contributions.[1] Source-available licenses, such as the Business Source License (BSL) or Redis Source Available License (RSALv2), typically convert to open-source after a delay (e.g., four years in BSL) or restrict "as-a-service" offerings to prevent competitors from profiting without contributing.[13][196] This allows vendors to monetize via hosted services while sharing code for transparency and custom integrations, but it curtails the full collaborative potential of open source.| Aspect | Open-Source Models | Source-Available Models |
|---|---|---|
| License Compliance | Meets OSI's 10 criteria for freedoms | Provides code access but restricts freedoms |
| Redistribution | Allowed, including modified binaries | Often prohibited or limited (e.g., no SaaS) |
| Commercial Use | Unrestricted | Frequently barred for competitors |
| Community Forking | Encouraged, leading to alternatives | Discouraged, risking vendor control loss |
| Monetization Strategy | Relies on services, dual-licensing, support | Protects core IP via service exclusivity |
Adoption and Impact
Widespread Implementation
Open-source software underpins much of the global digital infrastructure, particularly in server environments where Linux distributions hold dominant positions. As of 2025, Linux operates approximately 96.3% of the top one million web servers and powers 100% of the world's top 500 supercomputers, enabling high-performance computing for scientific simulations, weather modeling, and AI training.[200] These implementations leverage the Linux kernel's modularity and community-driven optimizations, which facilitate scalability and cost efficiency compared to proprietary alternatives. In cloud computing, open-source tools such as Kubernetes orchestrate containerized workloads for hyperscalers like AWS and Google Cloud, with over 96% of enterprises reporting increased or maintained reliance on open-source components for hybrid and multi-cloud deployments.[4][201] Enterprise adoption reflects broad integration across industries, driven by economic incentives like reduced licensing costs and enhanced customizability. Surveys indicate that 96% of organizations either expanded or sustained their use of open-source software in 2025, with significant growth in AI and data infrastructure applications where tools like TensorFlow and Apache Kafka process petabytes of data daily.[4] Mobile ecosystems exemplify this reach, as the Android operating system—built on the open-source Linux kernel—powers over 3 billion active devices globally, supporting app development via frameworks like React Native.[202] In databases and web services, open-source solutions such as PostgreSQL and Nginx command substantial market shares, handling transactions for e-commerce giants and financial institutions with proven reliability under high loads. Governments have increasingly implemented open-source software to promote transparency and interoperability, often mandating its use in public procurement. In the United States, federal agencies deploy open-source projects via Code.gov, including analytics platforms like analytics.usa.gov for real-time data visualization and CMS-hosted repositories for healthcare modules.[170][171] European Union enterprises, representing a proxy for institutional adoption, show 45.2% utilizing cloud services—predominantly open-source underpinned—for email, storage, and office applications as of 2023 data extended into policy frameworks.[203] These deployments prioritize vendor neutrality, as evidenced by policies in over 65% of global government initiatives favoring open-source to mitigate lock-in risks.[204] Despite desktop market shares remaining modest at around 4-6% for Linux variants, server-side and embedded implementations underscore open-source software's foundational role in resilient, distributed systems.[205][202]Prominent Projects and Ecosystems
The Linux kernel, first released by Linus Torvalds on September 17, 1991, underpins major operating systems like Ubuntu and Fedora, powering approximately 96% of the world's top one million web servers as of 2024 surveys. Its ecosystem encompasses thousands of distributions, with over 500 active variants tracked by DistroWatch, fostering widespread adoption in servers, embedded systems, and supercomputers—running on 100% of the top 500 supercomputers per TOP500 lists. The kernel's modular design enables contributions from corporations like Intel and Red Hat, amassing over 20,000 contributors by 2023. Git, developed by Torvalds in April 2005 as a distributed version control system, revolutionized software development workflows and hosts repositories for billions of code commits on platforms like GitHub, which reported 120 million repositories in 2024. Its lightweight branching and merging capabilities support ecosystems like DevOps pipelines, with integrations in tools from Jenkins to GitLab, enabling collaborative development at scale across projects with millions of stars.[206] In web infrastructure, the Apache HTTP Server, launched in 1995 by the Apache Software Foundation, handles over 30% of global websites, forming the backbone of the LAMP stack (Linux, Apache, MySQL, PHP/Python/Perl) that dominated server deployments through the 2010s.[207] MySQL, originating in 1995 and now stewarded by Oracle with community editions, processes queries for platforms like Facebook and Twitter, with over 10 million active installations reported in enterprise audits. These components interlink in ecosystems supporting e-commerce and content management, such as WordPress, which relies on PHP and MySQL for 43% of websites. Cloud-native ecosystems, coordinated by the Cloud Native Computing Foundation (CNCF) under the Linux Foundation, feature Kubernetes as a flagship orchestrator with the largest contributor base among open-source projects, exceeding 5,000 active participants in 2025 mid-year metrics.[208] Graduated CNCF projects like Prometheus for monitoring and Envoy for service proxies underpin microservices architectures adopted by 70% of Fortune 500 companies for container management. This ecosystem emphasizes portability and scalability, with OpenTelemetry gaining traction for observability, recording the second-highest development velocity in 2025.[208] Machine learning frameworks exemplify specialized ecosystems, with TensorFlow, released by Google in November 2015, facilitating model training on datasets for applications from image recognition to natural language processing, amassing over 180,000 GitHub stars and integrations in production systems at scale.[209] PyTorch, developed by Meta AI in 2016, complements it with dynamic computation graphs, powering research cited in thousands of academic papers annually and adopted in 60% of AI surveys for flexibility. These tools foster communities around Jupyter notebooks and Hugging Face hubs, aggregating models under permissive licenses for collaborative advancement.| Project | Foundation/Ecosystem | Key Metric (as of 2025) |
|---|---|---|
| Linux Kernel | Linux Foundation | Powers 100% TOP500 supercomputers |
| Kubernetes | CNCF | Largest contributor base (>5,000 active)[208] |
| Apache HTTP Server | Apache Software Foundation | Serves 30%+ of websites[207] |
| Git | Independent (Linux Foundation affiliate) | Hosts 120M+ GitHub repos |
| TensorFlow | Independent (Google origins) | 180K+ GitHub stars[209] |
