Foundation model

In artificial intelligence, a foundation model (FM), also known as large x model (LxM, where "x" is a variable representing any text, image, sound, etc.), is a machine learning or deep learning model trained on vast datasets so that it can be applied across a wide range of use cases.^[1] Generative AI applications like large language models (LLM) are common examples of foundation models.^[1]

Building foundation models is often highly resource-intensive, with the most advanced models costing hundreds of millions of dollars to cover the expenses of acquiring, curating, and processing massive datasets, as well as the compute power required for training.^[2] These costs stem from the need for sophisticated infrastructure, extended training times, and advanced hardware, such as GPUs. In contrast, adapting an existing foundation model for a specific task or using it directly is far less costly, as it leverages pre-trained capabilities and typically requires only fine-tuning on smaller, task-specific datasets.

Early examples of foundation models are language models like OpenAI's GPT series and Google's BERT.^[3]^[4] Beyond text, foundation models have been developed across a range of modalities—including DALL-E, Stable diffusion, and Flamingo^[5] for images, MusicGen^[6] and LLark^[7] for music, and RT-2^[8] for robotic control. Foundation models are also being developed for fields like astronomy,^[9] radiology,^[10] genomics,^[11] coding,^[12] times-series forecasting,^[13] mathematics,^[14] and chemistry.^[15]

Definitions

The Stanford Institute for Human-Centered Artificial Intelligence's (HAI) Center for Research on Foundation Models (CRFM) coined the term "foundation model" in August 2021^[16] to mean "any model that is trained on broad data (generally using self-supervision at scale) that can be adapted (e.g., fine-tuned) to a wide range of downstream tasks".^[17] This was based on their observation that preexisting terms, while overlapping, were not adequate, stating that "'large language model' was too narrow given the focus is not only language; 'self-supervised model' was too specific to the training objective; and 'pretrained model' suggested that the noteworthy action all happened after 'pretraining."^[18] The term "foundation model" was chosen over "foundational model"^[19] because "foundational" implies that these models provide fundamental principles in a way that "foundation" does not.^[20] The term vision-language model (VLM) is also used as a near-synonym.

As governments regulate foundation models, new legal definitions have emerged.

In the United States, the Executive Order on the Safe, Secure, and Trustworthy Development and Use of Artificial Intelligence defines a foundation model as "an AI model that is trained on broad data; generally uses self-supervision; contains at least tens of billions of parameters; is applicable across a wide range of contexts".^[21]
In the United States, the proposed AI Foundation Model Transparency Act of 2023^[22] by House Representatives Don Beyer (D–VA) and Anna Eshoo (D, CA) defines a foundation model as "an artificial intelligence model trained on broad data, generally uses self supervision, generally contains at least 1,000,000,000 parameters, is applicable across a wide range of contexts, and exhibits, or could be easily modified to exhibit, high levels of performance at tasks that could pose a serious risk to security, national economic security, national public health or safety, or any combination of those matters."
In the European Union, the European Parliament's negotiated position on the E.U. AI Act defines a foundation model as an "AI model that is trained on broad data at scale, is designed for generality of output, and can be adapted to a wide range of distinctive tasks".
In the United Kingdom, the Competition and Markets Authority's AI Foundation Models: Initial Report^[1] defines foundations model as "a type of AI technology that are trained on vast amounts of data that can be adapted to a wide range of tasks and operations."

The United States's definitions are the only ones to make reference to the size of a foundation model, and differ on magnitude. Beyer and Eshoo's definition also specifies that foundation models must achieve a level of performance as to be a potential danger. In contrast, the E.U. definition requires the model to be designed for generality of output. All definitions agree that foundation models must be trained on a broad range of data with potential applications in many domains.

History

Technologically, foundation models are built using established machine learning techniques like deep neural networks, transfer learning, and self-supervised learning. Foundation models differ from previous techniques as they are general purpose models that function as a reusable infrastructure, instead of bespoke and one-off task-specific models.

Advances in computer parallelism (e.g., CUDA GPUs) and new developments in neural network architecture (e.g., Transformers), and the increased use of training data with minimal supervision all contributed to the rise of foundation models. Foundation models began to materialize as the latest wave of deep learning models in the late 2010s.^[23] Relative to most prior work on deep learning, these language models demonstrated the potential of training on much larger web-sourced datasets using self-supervised objectives (e.g. predicting the next word in a large corpus of text). These approaches, which draw upon earlier works like word2vec and GloVe, deviated from prior supervised approaches that required annotated data (e.g. crowd-sourced labels).

The 2022 releases of Stable Diffusion and ChatGPT (initially powered by the GPT-3.5 model) led to foundation models and generative AI entering widespread public discourse. Further, releases of LLaMA, Llama 2, and Mistral in 2023 contributed to a greater emphasis placed on how foundation models are released with open foundation models garnering a lot of support^[24] and scrutiny.^[25]

Related concepts

Frontier models

Certain highly advanced foundation models are termed "frontier models", which have the potential to "possess dangerous capabilities sufficient to pose severe risks to public safety."^[26] These "dangerous capabilities" stem from the accidental or intentional misuse of such models, which in conjunction with their powerful nature can lead to severe harms. As foundation models continue to improve, some AI researchers speculate that almost all next-generation foundation models will be considered frontier models.

Since the concept of dangerous capabilities is inherently subjective, there is no strict designation for what foundation models qualify as frontier models. However, some generally held ideas for sufficiently dangerous capabilities include:

Designing and synthesizing new biological or chemical weapons^[27]
Producing and propagating convincing, tailored disinformation with minimal user instruction^[28]
Harnessing unprecedented offensive cyber capabilities^[29]
Evading human control through deceptive means^[30]

Due to frontier models' unique capabilities, it is difficult to effectively regulate their development and deployment. Because of their emergent nature, new dangerous capabilities can appear on their own in frontier models, both in the development stage and after being deployed.^[26] Additionally, since frontier models continue to adapt after deployment, it remains difficult to mitigate all harms that arise from already-deployed models. If a frontier model happens to be open-source or is released online, the model can also disseminate rapidly, further hampering regulators by creating a lack of accountability.

General-purpose AI

Due to their adaptability to a wide range of use-cases, foundation models are sometimes considered to be examples of general-purpose AI. In designing the EU AI Act, the European Parliament has stated that a new wave of general-purpose AI technologies shapes the overall AI ecosystem.^[31] The fuller structure of the ecosystem, in addition to the properties of specific general-purpose AI systems, influences the design of AI policy and research.^[32] General-purpose AI systems also often appear in people's everyday lives through applications and tools like ChatGPT or DALL-E.

Government agencies like EU Parliament have identified regulation of general-purpose AI, such as foundation models, to be a high priority. General-purpose AI systems are often characterized by large size, opacity, and potential for emergence, all of which can create unintended harms. Such systems also heavily influence downstream applications, which further exacerbates the need for regulation. In regards to prominent legislation, a number of stakeholders have pushed for the EU AI Act to include restrictions on general-purpose AI systems, all of which would also apply to foundation models.

World models

World models are sometimes described as foundation models.^[33]^[34] World models are a representation of an environment intended to predict the state of that environment after taking a set of actions,^[35]^[36] as well as to implicitly model physical concepts such as gravity.^[36] Input prompts for world models can include text or images,^[37]^[38] as well as videos or 3D scenes,^[39] and the resulting 3D environments can be exported.^[39] World models, alongside embodied AI, multi-agent models, and neuroscience models of the brain, are seen as alternatives to large language models for achieving general artificial intelligence.^[40]

World models do not have a fully agreed definition, but have been divided into two scopes: one for representing and understanding the current environment, and another for predicting the future state of that environment. In the former view, world models are developed using model-based reinforcement learning and a Markov decision process, using model predictive control or Monte Carlo tree search to create policies. With the latter, (multimodal) large language models or video generation models can be used. In addition, these environments can be immersive simulations for training AI agents that can interact in the real world.^[41]

History

Quanta Magazine traced world models back to a 1943 publication by Kenneth Craik on mental models and the blocks world of SHRDLU in the 1960s.^[42] Business Insider traced world models to a 1971 paper by Jay Wright Forrester.^[40] A related idea of organizing world knowledge, the frame representation, was proposed by Marvin Minsky in 1974.^[41]

In 2018, researchers David Ha and Jürgen Schmidhuber defined world models in the context of reinforcement learning: an agent with a variational autoencoder model V for representing visual observations, a recurrent neural network model M for representing memory, and a linear model C for making decisions. They suggested that agents trained on world models in environments that simulate reality could be applied to real world settings.^[43]

In 2022, Yann LeCun saw a world model (defined by him as a neural network that acts as a mental model for aspects of the world that are seen as relevant) as part of a larger system of cognitive architecture – other neural networks that are analogous to different regions of the brain. In his view, this framework could lead to commonsense reasoning.^[44]^[45] LeCun has estimated that world models would be fully functional by the late 2020s^[46] to mid 2030s.^[47]

Training

World models are trained on a variety of data modalities, including text, images, audio and video, and have been applied to video generation.^[48] One open source dataset for world models includes 1 billion data points across multiple modalities (text, images, audio, video and point clouds), including 1 million manual annotations.^[36]

Examples

TechCrunch saw Sora as an example of a world model,^[48] while in January 2025, Nvidia released its own set of world models.^[49]^[34] The South China Morning Post wrote that Manycore Tech was another example of companies aiming to build a world model, viewing their work as an example of spatial intelligence.^[50] In May 2025, Mohamed bin Zayed University of Artificial Intelligence released a world model for building simulations to test AI agents.^[51]

Google DeepMind has also released two world models in two-dimensional space and three-dimensional space, respectively, that were trained on video data, with Google claiming that the latter can be a training environment for AI agents.^[52]^[53] Meta released a world model in June 2025,^[54] Tencent released an open source world model in July 2025.^[55] Niantic, Inc. spinoff, Niantic Spatial, is developing a world model using anonymized player scans from Pokémon GO.^[56]^[57] Other companies that are planning as of 2025 to build world models include ByteDance^[55] and xAI.^[58]

Applications

Fei-Fei Li views world models as applying to robotics and creative works. Due to the complexity of these models, she advocates for more complex strategies in data acquisition, data engineering, data processing, and synthesizing data.^[59] She co-founded a startup on building world models, which, as of 2024, planned to do so in three phases: incorporating an understanding of three-dimensional space along with time; support for augmented reality; and support for robotics.^[60] Her startup, World Labs, released its commercial world model, Marble, in November 2025.^[61]

World models are intended for use in interactive media (such as video games and movies^[62]) and environment simulation.^[63] Proposed use cases for world models include action planning and outcome prediction.^[61] Other applications include social simulacra to simulate social systems.^[41] Wired compared world models to the metaverse,^[60] while Business Insider noted possible military applications.^[59]

In 2025, world models are being applied to drone warfare, robotics, and self-driving vehicles. The Wall Street Journal speculated that world models could improve spatial reasoning of artificial intelligence models and successfully automate both blue-collar and white-collar jobs.^[64] As of October 2025, research has shown mixed results in the spatial reasoning capabilities of text-to-video models (in particular, Veo 3).^[65]

Concerns

TechCrunch noted that world models could use more data than large language models and would require significantly more computational power (including the use of thousands of GPUs for training and inference).^[45]^[48] It also noted the risk of hallucinations, coverage bias and algorithmic bias.^[48] Similarly, The Financial Times noted the difficulty and expense in collecting data to simulate the world and training models to use that data.^[58]

Creative professionals have expressed concern that world models could disrupt jobs in their industries.^[63]

Other concerns include data privacy,^[41] simulation of harmful situations,^[41] misinformation and disinformation,^[41] emergent behaviors,^[66] and copyright.^[62]

Technical details

Modeling

For a foundation model to effectively generalize, it must acquire rich representations of the training data. As a result, expressive model architectures that efficiently process large-scale data are often preferred in building foundation models.^[17] Currently, the Transformer architecture is the de facto choice for building foundation models across a range of modalities.^[67]

Training

Foundation models are built by optimizing a training objective(s), which is a mathematical function that determines how model parameters are updated based on model predictions on training data.^[68] Language models are often trained with a next-tokens prediction objective, which refers to the extent at which the model is able to predict the next token in a sequence. Image models are commonly trained with contrastive learning or diffusion training objectives. For contrastive learning, images are randomly augmented before being evaluated on the resulting similarity of the model's representations. For diffusion models, images are noised and the model learns to gradually de-noise via the objective. Multimodal training objectives also exist, with some separating images and text during training, while others examine them concurrently.^[69] In general, the training objectives for foundation models promote the learning of broadly useful representations of data.

With the rise of foundation models and the larger datasets that power them, a training objective must be able to parse through internet-scale data for meaningful data points. Additionally, since foundation models are designed to solve a general range of tasks, training objectives ought to be domain complete, or able to solve a broad set of downstream capabilities within the given domain. Lastly, foundation model training objectives should seek to scale well and be computationally efficient. With model size and compute power both being relevant constraints, a training objective must be able to overcome such bottlenecks.

Data

Foundation models are trained on a large quantity of data, working under the maxim "the more data, the better."^[70] Performance evaluation does show that more data generally leads to better performance, but other issues arise as data quantity grows. Tasks like managing the dataset, integrating data across new applications, ensuring adherence to data licenses, and maintaining data quality all become more difficult as data size grows. The specific demands of foundation models have only exacerbated such issues, as it remains the norm for large foundation models to use public web-scraped data. Foundation models include also search engines data and SEO meta tags data. Public web data remains a plentiful resource, but it also demands stringent moderation and data processing from foundation model developers before it can be successfully integrated into the training pipeline.^[71]

Training foundation models often runs the risk of violating user privacy, as private data can be disclosed, collected, or used in ways beyond the stated scope. Even if no private data is leaked, models can still inadvertently compromise security through learned behavior in the resulting foundation model.^[72] Data quality is another key point, as web-scraped data frequently contains biased, duplicate, and toxic material. Once foundation models are deployed, ensuring high-quality data is still an issue, as undesirable behavior can still emerge from small subsets of data.

Systems

The size of foundation models also brings about issues with the computer systems they run on. The average foundation model is too large to be run within a single accelerator's memory and the initial training process requires an expensive amount of resources.^[73] Such issues are predicted to further exacerbate in future as foundation models grow to new heights. Due to this constraint, researchers have begun looking into compressing model size through tight model inference.

GPUs are the most common choice of compute hardware for machine learning, due to high memory storage and strong power. Typical foundation model training requires many GPUs, all connected in parallel with fast interconnects. Acquiring a sufficient amount of GPUs of requisite compute efficiency is a challenge for many foundation model developers, one that has led to an increasing dilemma in the field. Larger models require greater compute power, but often at the cost of improved compute efficiency. Since training remains time-consuming and expensive, the tradeoff between compute power and compute efficiency has led only a few select companies to afford the production costs for large, state of the art foundation models. Some techniques like compression and distillation can make inference more affordable, but they fail to completely shore up this weakness.

Scaling

The accuracy and capabilities of foundation models often scale predictably with the size of the model and the amount of the training data. Specifically, scaling laws have been discovered, which are data-based empirical trends that relate resources (data, model size, compute usage) to model capabilities. Particularly, a model's scale is defined by compute, dataset size, and the number of parameters, all of which exhibit a power-law relationship with end performance.

However, broken scaling laws^[74] have been discovered in which this relationship smoothly transitions (at points referred to as break(s)) from a power law with one exponent to a power law with another (different) exponent. When one does not collect any points near (or after) the break(s), it can be difficult to obtain an accurate extrapolation.

Adaptation

Foundation models are inherently multi-purpose: to use these model for a specific use case requires some form of adaptation. At a minimum, models need to be adapted to perform the task of interest (task specification), but often better performance can be achieved by more extensive adaptation to the domain of interest (domain specialization).

A variety of methods (e.g. prompting, in-context learning, fine-tuning, LoRA) provide different tradeoffs between the costs of adaptation and the extent to which models are specialized. Some major facets to consider when adapting a foundation model are compute budget and data availability. Foundation models can be very large, up to trillions of parameters in size, so adapting the entirety of a foundation model can be computationally expensive. Therefore, developers sometimes adapt only the last neural layer or only the bias vectors to save time and space.^[75] For particularly niche applications, specific data may also not be available to adapt the foundation model sufficiently. In such circumstances, data must be manually labeled, which is costly and can demand expert knowledge.

Evaluation

Evaluation is a key part of developing foundation models. Not only does evaluation allow for tracking progress of high-performance models, it also creates benchmarks for future model development. Stakeholders rely on evaluations to understand model behaviors and gain insight into their various attributes. Traditionally, foundation models are evaluated relative to each other through standardized task benchmarks like MMLU,^[76] MMMU,^[77] HumanEval,^[78] and GSM8K.^[79] Given that foundation models are multi-purpose, increasingly meta-benchmarks are developed that aggregate different underlying benchmarks. Examples include LM-Harness,^[80] BIG-Bench,^[81] HELM,^[82] OpenLLM Leaderboard,^[83] DecodingTrust,^[84] and HEIM.^[85]

Since foundation models' utility depends on their own general capabilities and the performance of fine-tuned applications, evaluation must cover both metrics. Proper evaluation examines both a foundation model's downstream applications in aggregate and the direct properties the foundation model holds. To ensure further equity in evaluation, certain existing evaluation frameworks account for all adaptation resources, which leads to more informed analyses for the benefit of all stakeholders.^[86]

Supply chain

Foundation models' general capabilities allow them to fulfill a unique role in the AI ecosystem,^[87] fueled by many upstream and downstream technologies.^[1] Training a foundation model requires several resources (e.g. data, compute, labor, hardware, code), with foundation models often involving immense amounts of data and compute (also referred to as computational power). Due to foundation models' large development costs and inexpensive adaptation requirements, the AI landscape has shifted to a small subset of AI companies making foundation models for downstream adaptation.^[88] Thus, most foundation model companies outsource this step to specialized data providers (e.g. Scale AI,^[89] Surge^[90]) and compute providers (e.g. Amazon Bedrock, Google Cloud, Microsoft Azure).

The foundation model developer itself will then take the data and use the supplied compute to actually train the foundation model. After the foundation model is completely built, much of the data and labor requirements abate. In this development process, hardware and compute are the most necessary, and also the most exclusive resources. To train larger and more complex AI, a sufficient amount of compute is key. However, compute is consolidated in the hands of a few, select entities, which most foundation model developers depend on. As such, the foundation model pipeline is concentrated heavily around these providers. Compute is also costly; in 2023, AI companies spent more than 80% of total capital on compute resources.^[92]

Foundation models require a large amount of general data to power their capabilities. Early foundation models scraped from subsets of the internet to provide this data information. As the size and scope of foundation models grows, larger quantities of internet scraping becomes necessary, resulting in higher likelihoods of biased or toxic data. This toxic or biased data can disproportionately harm marginalized groups and exacerbate existing prejudices.^[93]

To address this issue of low-quality data that arose with unsupervised training, some foundation model developers have turned to manual filtering. This practice, known as data labor, comes with its own host of issues.^[94] Such manual data detoxification is often outsourced to reduce labor costs, with some workers making less than $2 per hour.^[95]

The foundation model will then be hosted online either via the developer or via an external organization. Once released, other parties can create applications based on the foundation model, whether through fine-tuning or wholly new purposes. People can then access these applications to serve their various means, allowing one foundation model to power and reach a wide audience.

Release strategies

After a foundation model is built, it can be released in one of many ways. There are many facets to a release: the asset itself, who has access, how access changes over time, and the conditions on use.^[96] All these factors contribute to how a foundation model will affect downstream applications.^[97] In particular, the two most common forms of foundation model release are through APIs and direct model downloads.

When a model is released via an API, users can query the model and receive responses, but cannot directly access the model itself. Comparatively, the model could be directly downloadable for users to access and modify. Both release strategies are often classified as an open release. The exact definition of an open release is disputed, but widely accepted requirements are provided by the Open Source Initiative.

Some open foundation models are: PaLM 2, Llama 2, Granite, and Mistral. While open foundation models can further research and development more easily, they are also more susceptible to misuse. Open foundation models can be downloaded by anyone, and particularly powerful models can be fine-tuned to intentionally or unintentionally cause harm.^{[citation needed]}

During a closed release, the foundation model cannot be accessed by the public, but is used internally by an organization. Such releases are considered safer, but offer no additional value to the research community or the public at large.

Some foundation models like Google DeepMind's Flamingo^[98] are fully closed, meaning they are available only to the model developer; others, such as OpenAI's GPT-4, are limited access, available to the public but only as a black box; and still others, such as Meta's Llama 2 are open, with broadly available model weights enabling downstream modification and scrutiny.

Practices and applications

In practice, foundation models are often embedded into everyday software workflows as general-purpose services for drafting and summarizing text, answering questions, and generating structured output. Microsoft's documentation for Microsoft 365 Copilot describes the system as coordinating large language models to "understand, summarize, predict, and generate content" across applications such as Word, Excel, Outlook, and Teams.^[99] In customer support operations, IBM has described using foundation models for automatic call summarization and topic extraction to update customer-relationship-management (CRM) systems and reduce pre- and post-call workload.^[100]

Organizations also adapt foundation models to specialized domains in response to constraints such as cost, latency, and governance of sensitive data. In 2024, NeuralFabric co-founder John deVadoss argued that "Foundation models are the new applications," describing domain-specific foundation models as a new metaphor for software and emphasizing issues such as data sovereignty and the cost of training and inference in enterprise deployments.^[101]^[102]

Software development is another prominent application area, where foundation models are used for code generation, refactoring, and multi-step "agentic" assistance. IEEE Spectrum described a competitive market of AI coding tools in which AI-first integrated development environments (IDEs) such as Cursor (a fork of Visual Studio Code) and model-provider tools such as Anthropic's Claude Code both seek to become central to developer workflows.^[103] Reporting on Anthropic's Claude Code noted that Anthropic's models already powered third-party coding tools such as Cursor, while describing Claude Code as an "agentic" tool able to search and read code, edit files, write and run tests, and interact with version-control and command-line tooling.^[104] Anthropic later released a native Visual Studio Code extension for Claude Code, further integrating a first-party model-provider tool into the IDE environment and overlapping with capabilities offered by standalone AI-first editors.^[105]

Claude Code is an agentic coding assistant developed by Anthropic that runs in a command-line interface and is designed to help users carry out software-development tasks via natural-language instructions, including reading and editing project files and executing commands as part of a workflow.^[106]^[107] Comparable tools include GitHub Copilot, which provides code completion and chat-based assistance integrated into development environments and related tooling, as well as Google's Gemini Code Assist and AWS's Amazon Q Developer, which are positioned as generative-AI assistants that support multiple parts of the software development lifecycle.^[108]^[109]^[110] Such assistants have also been deployed through collaboration-platform integrations; for example, Anthropic introduced a Slack integration that routes tagged messages and thread context to Claude Code as a "research preview".^[111]

Although products like Claude Code are often described as programming tools, Anthropic has reported internal use cases that extend into adjacent knowledge-work domains, including automating routine data engineering and operational troubleshooting, and enabling finance staff to execute data workflows described in plain text; the same report describes usage across departments such as marketing and legal.^[112] Related model-based assistants have been integrated into mainstream productivity software, including Microsoft 365 Copilot and a Microsoft Excel worksheet function (COPILOT) that invokes an AI language model from within a cell formula for tasks such as summarization, while noting that outputs can be incorrect and are not intended for tasks requiring reproducible accuracy.^[113]^[114] Surveys and systematic reviews have discussed additional applications of foundation-model and large-language-model systems outside software engineering, including healthcare (e.g., diagnostics, personalized treatment, and operational efficiency, alongside privacy and bias concerns) and education (e.g., intelligent tutoring systems, alongside issues such as over-reliance, fairness, and privacy).^[115]^[116]

References

^ ^a ^b ^c ^d Competition and Markets Authority (18 September 2023). "AI Foundation Models: Initial Report" (PDF). gov.uk.
^ Nestor Maslej, Loredana Fattorini, Erik Brynjolfsson, John Etchemendy, Katrina Ligett, Terah Lyons, James Manyika, Helen Ngo, Juan Carlos Niebles, Vanessa Parli, Yoav Shoham, Russell Wald, Jack Clark, and Raymond Perrault, "The AI Index 2023 Annual Report," AI Index Steering Committee, Institute for Human-Centered AI, Stanford University, Stanford, California, April 2023.
^ Rogers, Anna; Kovaleva, Olga; Rumshisky, Anna (2020). "A Primer in BERTology: What we know about how BERT works". arXiv:2002.12327 [cs.CL].
^ Haddad, Mohammed. "How does GPT-4 work and how can you start using it in ChatGPT?". Al Jazeera. Retrieved 20 October 2024.
^ Tackling multiple tasks with a single visual language model, 28 April 2022, retrieved 13 June 2022
^ Copet, Jade; Kreuk, Felix; Gat, Itai; Remez, Tal; Kant, David; Synnaeve, Gabriel; Adi, Yossi; Défossez, Alexandre (7 November 2023). "Simple and Controllable Music Generation". arXiv:2306.05284 [cs.SD].
^ "LLark: A Multimodal Foundation Model for Music". Spotify Research. 13 October 2023. Retrieved 11 December 2023.
^ "Speaking robot: Our new AI model translates vision and language into robotic actions". Google. 28 July 2023. Retrieved 11 December 2023.
^ Nguyen, Tuan Dung; Ting, Yuan-Sen; Ciucă, Ioana; O'Neill, Charlie; Sun, Ze-Chang; Jabłońska, Maja; Kruk, Sandor; Perkowski, Ernest; Miller, Jack (12 September 2023). "AstroLLaMA: Towards Specialized Foundation Models in Astronomy". arXiv:2309.06126 [astro-ph.IM].
^ Tu, Tao; Azizi, Shekoofeh; Driess, Danny; Schaekermann, Mike; Amin, Mohamed; Chang, Pi-Chuan; Carroll, Andrew; Lau, Chuck; Tanno, Ryutaro (26 July 2023). "Towards Generalist Biomedical AI". arXiv:2307.14334 [cs.CL].
^ Zvyagin, Maxim; Brace, Alexander; Hippe, Kyle; Deng, Yuntian; Zhang, Bin; Bohorquez, Cindy Orozco; Clyde, Austin; Kale, Bharat; Perez-Rivera, Danilo (2023). "GenSLMs: Genome-scale language models reveal SARS-CoV-2 evolutionary dynamics". The International Journal of High Performance Computing Applications. 37 (6): 683–705. bioRxiv 10.1101/2022.10.10.511571. doi:10.1177/10943420231201154.
^ Li, Raymond; Allal, Loubna Ben; Zi, Yangtian; Muennighoff, Niklas; Kocetkov, Denis; Mou, Chenghao; Marone, Marc; Akiki, Christopher; Li, Jia (9 May 2023). "StarCoder: may the source be with you!". arXiv:2305.06161 [cs.CL].
^ Se, Ksenia; Spektor, Ian (5 April 2024). "Revolutionizing Time Series Forecasting: Interview with TimeGPT's creators". Turing Post. Retrieved 11 April 2024.
^ Azerbayev, Zhangir; Schoelkopf, Hailey; Paster, Keiran; Santos, Marco Dos; McAleer, Stephen; Jiang, Albert Q.; Deng, Jia; Biderman, Stella; Welleck, Sean (30 November 2023). "Llemma: An Open Language Model For Mathematics". arXiv:2310.10631 [cs.CL].
^ "Orbital". Archived from the original on 3 September 2024. Retrieved 5 September 2024.
^ "Introducing the Center for Research on Foundation Models (CRFM)". Stanford HAI. 18 August 2021. Retrieved 11 June 2022.
^ ^a ^b Bommasani, Rishi; et al. (18 August 2021). On the Opportunities and Risks of Foundation Models (Report). arXiv:2108.07258.
^ "Reflections on Foundation Models". Stanford HAI. 18 October 2021. Retrieved 22 May 2023.
^ Bommasani, Rishi; Liang, Percy (18 October 2021). "Reflections on Foundation Models". Stanford CRFM. Retrieved 11 December 2023.
^ Marcus, Gary (11 September 2021). "Has AI found a new Foundation?". The Gradient. Retrieved 11 December 2023.
^ "Executive Order on the Safe, Secure, and Trustworthy Development and Use of Artificial Intelligence". The White House. 30 October 2023. Retrieved 12 February 2024.
^ "AI Foundation Model Transparency Act" (PDF).
^ Liang, Percy; Bommasani, Rishi; Lee, Tony; Tsipras, Dimitris; Soylu, Dilara; Yasunaga, Michihiro; Zhang, Yian; Narayanan, Deepak; Wu, Yuhuai (1 October 2023), "Holistic Evaluation of Language Models", Annals of the New York Academy of Sciences, 1525 (1): 140–146, arXiv:2211.09110, Bibcode:2023NYASA1525..140B, doi:10.1111/nyas.15007, PMID 37230490
^ "Joint Statement on AI Safety and Openness". Mozilla. 31 October 2023. Retrieved 12 February 2024.
^ "Hawley and Blumenthal Demand Answers from Meta, Warn of Misuse After 'Leak' of Meta's AI Model". Senator Josh Hawley. 6 June 2023. Retrieved 12 February 2024.
^ ^a ^b Anderljung, Markus; Barnhart, Joslyn; Korinek, Anton; Leung, Jade; O'Keefe, Cullen; Whittlestone, Jess; Avin, Shahar; Brundage, Miles; Bullock, Justin (7 November 2023), Frontier AI Regulation: Managing Emerging Risks to Public Safety, arXiv:2307.03718
^ Singhal, Karan; Azizi, Shekoofeh; Tu, Tao; Mahdavi, S. Sara; Wei, Jason; Chung, Hyung Won; Scales, Nathan; Tanwani, Ajay; Cole-Lewis, Heather; Pfohl, Stephen; Payne, Perry; Seneviratne, Martin; Gamble, Paul; Kelly, Chris; Babiker, Abubakr (August 2023). "Large language models encode clinical knowledge". Nature. 620 (7972): 172–180. arXiv:2212.13138. Bibcode:2023Natur.620..172S. doi:10.1038/s41586-023-06291-2. ISSN 1476-4687. PMC 10396962. PMID 37438534.
^ Nori, Harsha; King, Nicholas; McKinney, Scott Mayer; Carignan, Dean; Horvitz, Eric (12 April 2023), Capabilities of GPT-4 on Medical Challenge Problems, arXiv:2303.13375
^ "Generative AI and the New Frontier in Cybersecurity". AI Business. 7 February 2024.
^ Pillay, Tharin (15 December 2024). "New Tests Reveal AI's Capacity for Deception". TIME. Retrieved 15 January 2025.
^ "General-purpose artificial intelligence | Think Tank | European Parliament". www.europarl.europa.eu. Retrieved 12 February 2024.
^ Bommasani, Rishi; Soylu, Dilara; Liao, Thomas I.; Creel, Kathleen A.; Liang, Percy (28 March 2023), Ecosystem Graphs: The Social Footprint of Foundation Models, arXiv:2303.15772
^ Bellan, Rebecca (5 August 2025). "DeepMind thinks its new Genie 3 world model presents a stepping stone toward AGI". TechCrunch. Archived from the original on 5 August 2025. Retrieved 8 November 2025.
^ ^a ^b Takahashi, Dean (7 January 2025). "Nvidia launches Cosmos world foundation model platform to accelerate physical AI". VentureBeat. Archived from the original on 8 January 2025. Retrieved 15 June 2025.
^ Pearl, Mike (15 November 2025). "'Imagine a Cube Floating in the Air': The New AI Dream Allegedly Driving Yann LeCun Away from Meta". Gizmodo. Archived from the original on 16 November 2025. Retrieved 28 November 2025.
^ ^a ^b ^c Fried, Ina (17 November 2025). "AI's next big leap is models that understand the world". Axios. Archived from the original on 17 November 2025. Retrieved 28 November 2025.
^ Husain, Mishal (21 November 2025). "The Godmother of AI Didn't Expect It to Be This Massive". Bloomberg News. Archived from the original on 24 November 2025. Retrieved 28 November 2025.
^ Whitwam, Ryan (5 August 2025). "DeepMind reveals Genie 3 "world model" that creates real-time interactive simulations". Ars Technica. Archived from the original on 12 September 2025. Retrieved 28 November 2025.
^ ^a ^b Pillay, Tharin (9 December 2025). "Inside Fei-Fei Li's Plan to Build AI-Powered Virtual Worlds". TIME. Archived from the original on 10 December 2025. Retrieved 11 December 2025.
^ ^a ^b Gelling, Peter; Varanasi, Lakshmi. "The AI doomers are having their moment". Business Insider. Archived from the original on 25 August 2025. Retrieved 14 October 2025.
^ ^a ^b ^c ^d ^e ^f Ding, Jingtao; Zhang, Yunke; Shang, Yu; Zhang, Yuheng; Zong, Zefang; Feng, Jie; Yuan, Yuan; Su, Hongyuan; Li, Nian; Sukiennik, Nicholas; Xu, Fengli; Li, Yong (9 September 2025). "Understanding World or Predicting Future? A Comprehensive Survey of World Models". ACM Comput. Surv. 58 (3): 57:1–57:38. doi:10.1145/3746449. ISSN 0360-0300.
^ Pavlus, John (2 September 2025). "'World Models,' an Old Idea in AI, Mount a Comeback". Quanta Magazine. Archived from the original on 23 September 2025. Retrieved 14 October 2025.
^ Ha, David; Schmidhuber, Jürgen (3 December 2018). "Recurrent world models facilitate policy evolution". Proceedings of the 32nd International Conference on Neural Information Processing Systems. NIPS'18. Red Hook, NY, USA: Curran Associates Inc.: 2455–2467.
^ Heikkilä, Melissa; Douglas Heaven, Will (24 June 2022). "Yann LeCun has a bold new vision for the future of AI". MIT Technology Review. Archived from the original on 24 June 2022. Retrieved 15 June 2025.
^ ^a ^b Zeff, Maxwell (17 October 2024). "Meta's AI chief says world models are key to 'human-level AI' – but it might be 10 years out". TechCrunch. Archived from the original on 4 March 2025. Retrieved 15 June 2025.
^ Sawers, Paul (23 January 2025). "Meta's Yann LeCun predicts 'new paradigm of AI architectures' within 5 years and 'decade of robotics'". TechCrunch. Archived from the original on 28 September 2025. Retrieved 17 December 2025.
^ Zeff, Maxwell (17 October 2024). "Meta's AI chief says world models are key to 'human-level AI' — but it might be 10 years out". TechCrunch. Archived from the original on 20 September 2025. Retrieved 17 December 2025.
^ ^a ^b ^c ^d Wiggers, Kyle (14 December 2024). "What are AI 'world models,' and why do they matter?". TechCrunch. Archived from the original on 18 March 2025. Retrieved 15 June 2025.
^ Wiggers, Kyle (7 January 2025). "Nvidia releases its own brand of world models". TechCrunch. Archived from the original on 5 March 2025. Retrieved 15 June 2025.
^ Chen, Wency (5 May 2025). "China's answer to Autodesk is betting on AI to build a 'world model'". South China Morning Post. Archived from the original on 5 May 2025. Retrieved 15 June 2025.
^ Knight, Will. "A United Arab Emirates Lab Announces Frontier AI Projects—and a New Outpost in Silicon Valley". Wired. ISSN 1059-1028. Archived from the original on 22 May 2025. Retrieved 15 June 2025.
^ Orland, Kyle (5 March 2024). "Google's Genie game maker is what happens when AI watches 30K hrs of video games". Ars Technica. Archived from the original on 8 December 2024. Retrieved 15 June 2025.
^ Orland, Kyle (6 December 2024). "Google's Genie 2 "world model" reveal leaves more questions than answers". Ars Technica. Archived from the original on 7 December 2024. Retrieved 15 June 2025.
^ Browne, Ryan (11 June 2025). "Meta launches AI 'world model' to advance robotics, self-driving cars". CNBC. Archived from the original on 27 September 2025. Retrieved 14 October 2025.
^ ^a ^b Osawa, Juro; Liu, Qianer; Yang, Jing. "ByteDance's Upcoming 'World Model'; Altman's Thoughts on the GPT-5 Fiasco, Profitability, and Going Public". The Information. Archived from the original on 4 September 2025. Retrieved 14 October 2025.
^ Criddle, Cristina; Murphy, Hannah; Bradshaw, Tim (29 September 2025). "Big AI firms pump money into world models as LLM advances slow". Ars Technica. Archived from the original on 4 October 2025. Retrieved 14 October 2025.
^ Levine, Adam. "Pokémon Go Is Driving a $4 Billion Spinoff. Those Game Maps Could Be AI Gold". barrons.
^ ^a ^b Criddle, Cristina (12 October 2025). "Elon Musk's xAI joins race to build 'world models' to power video games". Financial Times. Archived from the original on 12 October 2025. Retrieved 14 October 2025.
^ ^a ^b Varanasi, Lakshmi. "Top AI researchers say language is limiting. Here's the new kind of model they are building instead". Business Insider. Archived from the original on 15 June 2025. Retrieved 21 June 2025.
^ ^a ^b Levy, Steven (13 September 2024). "The Godmother of AI Wants Everyone to Be a World Builder". Wired. ISSN 1059-1028. Archived from the original on 7 February 2025. Retrieved 21 June 2025.
^ ^a ^b Bellan, Rebecca (12 November 2025). "Fei-Fei Li's World Labs speeds up the world model race with Marble, its first commercial product". TechCrunch. Archived from the original on 19 November 2025. Retrieved 28 November 2025.
^ ^a ^b Wiggers, Kyle (6 January 2025). "Google is forming a new team to build AI that can simulate the physical world". TechCrunch. Archived from the original on 12 October 2025. Retrieved 3 December 2025.
^ ^a ^b Wiggers, Kyle (28 May 2025). "Odyssey's new AI model streams 3D interactive worlds". TechCrunch. Archived from the original on 30 May 2025. Retrieved 15 June 2025.
^ Mims, Christopher (26 September 2025). "What Are 'World Models'? The Key to the Next Big AI Leap". The Wall Street Journal. Archived from the original on 3 October 2025. Retrieved 14 October 2025.
^ Orland, Kyle (1 October 2025). "Can today's AI video models accurately model how the real world works?". Ars Technica. Archived from the original on 3 October 2025. Retrieved 14 October 2025.
^ Taniguchi, Tadahiro; Murata, Shingo; Suzuki, Masahiro; Ognibene, Dimitri; Lanillos, Pablo; Ugur, Emre; Jamone, Lorenzo; Nakamura, Tomoaki; Ciria, Alejandra; Lara, Bruno; Pezzulo, Giovanni (3 July 2023). "World models and predictive coding for cognitive and developmental robotics: frontiers and challenges". Advanced Robotics. 37 (13): 780–806. doi:10.1080/01691864.2023.2225232. hdl:10281/450879. ISSN 0169-1864.
^ Bommasani, Rishi; Klyman, Kevin; Longpre, Shayne; Kapoor, Sayash; Maslej, Nestor; Xiong, Betty; Zhang, Daniel; Liang, Percy (19 October 2023), The Foundation Model Transparency Index, arXiv:2310.12941
^ Claude Elwood, Shannon (July 1948). "A Mathematical Theory of Communication" (PDF). Bell System Technical Journal.
^ Radford, Alec; Kim, Jong Wook; Hallacy, Chris; Ramesh, Aditya; Goh, Gabriel; Agarwal, Sandhini; Sastry, Girish; Askell, Amanda; Mishkin, Pamela (26 February 2021), Learning Transferable Visual Models From Natural Language Supervision, arXiv:2103.00020
^ Kaplan, Jared; McCandlish, Sam; Henighan, Tom; Brown, Tom B.; Chess, Benjamin; Child, Rewon; Gray, Scott; Radford, Alec; Wu, Jeffrey (22 January 2020), Scaling Laws for Neural Language Models, arXiv:2001.08361
^ Jo, Eun Seo; Gebru, Timnit (27 January 2020). "Lessons from archives: Strategies for collecting sociocultural data in machine learning". Proceedings of the 2020 Conference on Fairness, Accountability, and Transparency. pp. 306–316. arXiv:1912.10389. doi:10.1145/3351095.3372829. ISBN 978-1-4503-6936-7.
^ Bender, Emily M.; Gebru, Timnit; McMillan-Major, Angelina; Shmitchell, Shmargaret (1 March 2021). "On the Dangers of Stochastic Parrots: Can Language Models be Too Big? 🦜". Proceedings of the 2021 ACM Conference on Fairness, Accountability, and Transparency. FAccT '21. New York, New York, USA: Association for Computing Machinery. pp. 610–623. doi:10.1145/3442188.3445922. ISBN 978-1-4503-8309-7.
^ Brown, Tom B.; Mann, Benjamin; Ryder, Nick; Subbiah, Melanie; Kaplan, Jared; Dhariwal, Prafulla; Neelakantan, Arvind; Shyam, Pranav; Sastry, Girish (22 July 2020), Language Models are Few-Shot Learners, arXiv:2005.14165
^ Caballero, Ethan; Gupta, Kshitij; Rish, Irina; Krueger, David (2022). "Broken Neural Scaling Laws". International Conference on Learning Representations (ICLR), 2023.
^ Zaken, Elad Ben; Ravfogel, Shauli; Goldberg, Yoav (5 September 2022), BitFit: Simple Parameter-efficient Fine-tuning for Transformer-based Masked Language-models, arXiv:2106.10199
^ "Papers with Code – MMLU Benchmark (Multi-task Language Understanding)". paperswithcode.com. Retrieved 21 April 2024.
^ Yue, Xiang; Ni, Yuansheng; Zhang, Kai; Zheng, Tianyu; Liu, Ruoqi; Zhang, Ge; Stevens, Samuel; Jiang, Dongfu; Ren, Weiming (20 December 2023), MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI, arXiv:2311.16502
^ "Papers with Code – HumanEval Benchmark (Code Generation)". paperswithcode.com. Retrieved 21 April 2024.
^ "Papers with Code – GSM8K Benchmark (Arithmetic Reasoning)". paperswithcode.com. Retrieved 21 April 2024.
^ EleutherAI/lm-evaluation-harness, EleutherAI, 21 April 2024, retrieved 21 April 2024
^ Srivastava, Aarohi; Rastogi, Abhinav; Rao, Abhishek; Shoeb, Abu Awal Md; Abid, Abubakar; Fisch, Adam; Brown, Adam R.; Santoro, Adam; Gupta, Aditya (12 June 2023), Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models, arXiv:2206.04615
^ "Holistic Evaluation of Language Models (HELM)". crfm.stanford.edu. Retrieved 21 April 2024.
^ "open-llm-leaderboard (Open LLM Leaderboard)". huggingface.co. 9 November 2023. Retrieved 21 April 2024.
^ "DecodingTrust Benchmark". decodingtrust.github.io. Retrieved 21 April 2024.
^ "Holistic Evaluation of Image Models (HEIM)". crfm.stanford.edu. Retrieved 21 April 2024.
^ Linzen, Tal (July 2020). Jurafsky, Dan; Chai, Joyce; Schluter, Natalie; Tetreault, Joel (eds.). "How Can We Accelerate Progress Towards Human-like Linguistic Generalization?". Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. Online: Association for Computational Linguistics: 5210–5217. arXiv:2005.00955. doi:10.18653/v1/2020.acl-main.465.
^ "Ecosystem Graphs for Foundation Models". crfm.stanford.edu. Retrieved 13 February 2024.
^ Vipra, Jai; Korinek, Anton (2 November 2023), Market Concentration Implications of Foundation Models, arXiv:2311.01550
^ "Accelerate the Development of AI Applications | Scale AI". scale.com. Retrieved 21 April 2024.
^ "Surge AI | World's Most Powerful Data Labeling Platform". www.surgehq.ai. Retrieved 21 April 2024.
^ "2024 AI Index – chapter 1" (PDF). 15 April 2024. pp. 37–39.
^ "Computational Power and AI". AI Now Institute. 27 September 2023. Retrieved 13 February 2024.
^ Tiku, Nitasha; Schaul, Kevin; Chen, Szu Yu. "These fake images reveal how AI amplifies our worst stereotypes". Washington Post. Retrieved 13 February 2024.
^ "How the AI industry profits from catastrophe". MIT Technology Review. Retrieved 13 February 2024.
^ "Exclusive: The $2 Per Hour Workers Who Made ChatGPT Safer". TIME. 18 January 2023. Archived from the original on 19 January 2023. Retrieved 13 February 2024.
^ Liang, Percy; Bommasani, Rishi; Creel, Kathleen (17 May 2022). "The Time is Now to Develop Community Norms for the Release of Foundation Models". Stanford CRFM.
^ Solaiman, Irene (5 February 2023), The Gradient of Generative AI Release: Methods and Considerations, arXiv:2302.04844
^ Alayrac, Jean-Baptiste; Donahue, Jeff; Luc, Pauline; Miech, Antoine; Barr, Iain; Hasson, Yana; Lenc, Karel; Mensch, Arthur; Millican, Katie (15 November 2022), Flamingo: a Visual Language Model for Few-Shot Learning, arXiv:2204.14198.
^ "Microsoft 365 Copilot overview". Microsoft Learn. Microsoft. 7 November 2025. Retrieved 4 January 2026.
^ "Transforming customer service: How generative AI is changing the game". IBM Think. IBM. 17 July 2023. Retrieved 4 January 2026.
^ "The NeuralFabric Generative AI Platform Pioneers Micro-Foundation Models to Decrease Costs, Ensure Data Sovereignty and Democratize AI". PR Newswire. 29 February 2024. Retrieved 4 January 2026.
^ Mathews, Anshika (1 March 2024). "NeuralFabric Launches Groundbreaking Micro-Foundation Models in Generative AI". AIM Research. Retrieved 4 January 2026.
^ Smith, Matthew S. (21 June 2025). "The Best AI Coding Tools You Can Use Right Now". IEEE Spectrum. Retrieved 4 January 2026.
^ Heath, Alex (24 February 2025). "Anthropic's new 'hybrid reasoning' AI model is its smartest yet". The Verge. Retrieved 4 January 2026.
^ "Enabling Claude Code to work more autonomously". Anthropic. 29 September 2025. Retrieved 4 January 2026.
^ "Claude Code overview". Claude Code Docs. Anthropic. Retrieved 4 January 2026.
^ "anthropics/claude-code". GitHub. Anthropic. Retrieved 4 January 2026.
^ "GitHub Copilot · Your AI pair programmer". GitHub. Retrieved 4 January 2026.
^ "Gemini Code Assist overview". Google Developers. Google Inc. Retrieved 4 January 2026.
^ "Amazon Q Developer". Amazon Web Services. Amazon. Retrieved 4 January 2026.
^ Bonifield, Stevie (8 December 2025). "Anthropic is bringing Claude Code to Slack". The Verge. Retrieved 4 January 2026.
^ How Anthropic teams use Claude Code (PDF) (Report). Anthropic. Retrieved 4 January 2026.
^ "Microsoft to now include Copilot in Microsoft 365 for consumers". Reuters. 16 January 2025. Retrieved 4 January 2026.
^ "COPILOT function". Microsoft Support. Microsoft. Retrieved 4 January 2026.
^ Timilsina, Mohan; Buosi, Samuele; Razzaq, Muhammad Asif; Haque, Rafiqul; Judge, Conor; Curry, Edward (2025). "Harmonizing foundation models in healthcare: A comprehensive survey of their roles, relationships, and impact in artificial intelligence's advancing terrain". Computers in Biology and Medicine. 189 109925. doi:10.1016/j.compbiomed.2025.109925. PMID 40081208.
^ Shi, Yuhong; Yu, Kun; Dong, Yifei; Chen, Fang (June 2026). "Large language models in education: a systematic review of empirical applications, benefits, and challenges". Computers and Education: Artificial Intelligence. 10 100529. doi:10.1016/j.caeai.2025.100529.

[:1-1] Competition and Markets Authority (18 September 2023). "AI Foundation Models: Initial Report" (PDF). gov.uk.

[2] Nestor Maslej, Loredana Fattorini, Erik Brynjolfsson, John Etchemendy, Katrina Ligett, Terah Lyons, James Manyika, Helen Ngo, Juan Carlos Niebles, Vanessa Parli, Yoav Shoham, Russell Wald, Jack Clark, and Raymond Perrault, "The AI Index 2023 Annual Report," AI Index Steering Committee, Institute for Human-Centered AI, Stanford University, Stanford, California, April 2023.

[3] Rogers, Anna; Kovaleva, Olga; Rumshisky, Anna (2020). "A Primer in BERTology: What we know about how BERT works". arXiv:2002.12327 [cs.CL].

[4] Haddad, Mohammed. "How does GPT-4 work and how can you start using it in ChatGPT?". Al Jazeera. Retrieved 20 October 2024.

[deepmind_20220428-5] Tackling multiple tasks with a single visual language model, 28 April 2022, retrieved 13 June 2022

[6] Copet, Jade; Kreuk, Felix; Gat, Itai; Remez, Tal; Kant, David; Synnaeve, Gabriel; Adi, Yossi; Défossez, Alexandre (7 November 2023). "Simple and Controllable Music Generation". arXiv:2306.05284 [cs.SD].

[7] "LLark: A Multimodal Foundation Model for Music". Spotify Research. 13 October 2023. Retrieved 11 December 2023.

[8] "Speaking robot: Our new AI model translates vision and language into robotic actions". Google. 28 July 2023. Retrieved 11 December 2023.

[9] Nguyen, Tuan Dung; Ting, Yuan-Sen; Ciucă, Ioana; O'Neill, Charlie; Sun, Ze-Chang; Jabłońska, Maja; Kruk, Sandor; Perkowski, Ernest; Miller, Jack (12 September 2023). "AstroLLaMA: Towards Specialized Foundation Models in Astronomy". arXiv:2309.06126 [astro-ph.IM].

[10] Tu, Tao; Azizi, Shekoofeh; Driess, Danny; Schaekermann, Mike; Amin, Mohamed; Chang, Pi-Chuan; Carroll, Andrew; Lau, Chuck; Tanno, Ryutaro (26 July 2023). "Towards Generalist Biomedical AI". arXiv:2307.14334 [cs.CL].

[11] Zvyagin, Maxim; Brace, Alexander; Hippe, Kyle; Deng, Yuntian; Zhang, Bin; Bohorquez, Cindy Orozco; Clyde, Austin; Kale, Bharat; Perez-Rivera, Danilo (2023). "GenSLMs: Genome-scale language models reveal SARS-CoV-2 evolutionary dynamics". The International Journal of High Performance Computing Applications. 37 (6): 683–705. bioRxiv 10.1101/2022.10.10.511571. doi:10.1177/10943420231201154.

[12] Li, Raymond; Allal, Loubna Ben; Zi, Yangtian; Muennighoff, Niklas; Kocetkov, Denis; Mou, Chenghao; Marone, Marc; Akiki, Christopher; Li, Jia (9 May 2023). "StarCoder: may the source be with you!". arXiv:2305.06161 [cs.CL].

[13] Se, Ksenia; Spektor, Ian (5 April 2024). "Revolutionizing Time Series Forecasting: Interview with TimeGPT's creators". Turing Post. Retrieved 11 April 2024.

[14] Azerbayev, Zhangir; Schoelkopf, Hailey; Paster, Keiran; Santos, Marco Dos; McAleer, Stephen; Jiang, Albert Q.; Deng, Jia; Biderman, Stella; Welleck, Sean (30 November 2023). "Llemma: An Open Language Model For Mathematics". arXiv:2310.10631 [cs.CL].

[15] "Orbital". Archived from the original on 3 September 2024. Retrieved 5 September 2024.

[CRFM-16] "Introducing the Center for Research on Foundation Models (CRFM)". Stanford HAI. 18 August 2021. Retrieved 11 June 2022.

[Bommasani_20210818-17] Bommasani, Rishi; et al. (18 August 2021). On the Opportunities and Risks of Foundation Models (Report). arXiv:2108.07258.

[18] "Reflections on Foundation Models". Stanford HAI. 18 October 2021. Retrieved 22 May 2023.

[19] Bommasani, Rishi; Liang, Percy (18 October 2021). "Reflections on Foundation Models". Stanford CRFM. Retrieved 11 December 2023.

[20] Marcus, Gary (11 September 2021). "Has AI found a new Foundation?". The Gradient. Retrieved 11 December 2023.

[21] "Executive Order on the Safe, Secure, and Trustworthy Development and Use of Artificial Intelligence". The White House. 30 October 2023. Retrieved 12 February 2024.

[22] "AI Foundation Model Transparency Act" (PDF).

[:2-23] Liang, Percy; Bommasani, Rishi; Lee, Tony; Tsipras, Dimitris; Soylu, Dilara; Yasunaga, Michihiro; Zhang, Yian; Narayanan, Deepak; Wu, Yuhuai (1 October 2023), "Holistic Evaluation of Language Models", Annals of the New York Academy of Sciences, 1525 (1): 140–146, arXiv:2211.09110, Bibcode:2023NYASA1525..140B, doi:10.1111/nyas.15007, PMID 37230490

[24] "Joint Statement on AI Safety and Openness". Mozilla. 31 October 2023. Retrieved 12 February 2024.

[25] "Hawley and Blumenthal Demand Answers from Meta, Warn of Misuse After 'Leak' of Meta's AI Model". Senator Josh Hawley. 6 June 2023. Retrieved 12 February 2024.

[:0-26] Anderljung, Markus; Barnhart, Joslyn; Korinek, Anton; Leung, Jade; O'Keefe, Cullen; Whittlestone, Jess; Avin, Shahar; Brundage, Miles; Bullock, Justin (7 November 2023), Frontier AI Regulation: Managing Emerging Risks to Public Safety, arXiv:2307.03718

[27] Singhal, Karan; Azizi, Shekoofeh; Tu, Tao; Mahdavi, S. Sara; Wei, Jason; Chung, Hyung Won; Scales, Nathan; Tanwani, Ajay; Cole-Lewis, Heather; Pfohl, Stephen; Payne, Perry; Seneviratne, Martin; Gamble, Paul; Kelly, Chris; Babiker, Abubakr (August 2023). "Large language models encode clinical knowledge". Nature. 620 (7972): 172–180. arXiv:2212.13138. Bibcode:2023Natur.620..172S. doi:10.1038/s41586-023-06291-2. ISSN 1476-4687. PMC 10396962. PMID 37438534.

[28] Nori, Harsha; King, Nicholas; McKinney, Scott Mayer; Carignan, Dean; Horvitz, Eric (12 April 2023), Capabilities of GPT-4 on Medical Challenge Problems, arXiv:2303.13375

[29] "Generative AI and the New Frontier in Cybersecurity". AI Business. 7 February 2024.

[30] Pillay, Tharin (15 December 2024). "New Tests Reveal AI's Capacity for Deception". TIME. Retrieved 15 January 2025.

[31] "General-purpose artificial intelligence | Think Tank | European Parliament". www.europarl.europa.eu. Retrieved 12 February 2024.

[32] Bommasani, Rishi; Soylu, Dilara; Liao, Thomas I.; Creel, Kathleen A.; Liang, Percy (28 March 2023), Ecosystem Graphs: The Social Footprint of Foundation Models, arXiv:2303.15772

[33] Bellan, Rebecca (5 August 2025). "DeepMind thinks its new Genie 3 world model presents a stepping stone toward AGI". TechCrunch. Archived from the original on 5 August 2025. Retrieved 8 November 2025.

[:9-34] Takahashi, Dean (7 January 2025). "Nvidia launches Cosmos world foundation model platform to accelerate physical AI". VentureBeat. Archived from the original on 8 January 2025. Retrieved 15 June 2025.

[35] Pearl, Mike (15 November 2025). "'Imagine a Cube Floating in the Air': The New AI Dream Allegedly Driving Yann LeCun Away from Meta". Gizmodo. Archived from the original on 16 November 2025. Retrieved 28 November 2025.

[:10-36] Fried, Ina (17 November 2025). "AI's next big leap is models that understand the world". Axios. Archived from the original on 17 November 2025. Retrieved 28 November 2025.

[37] Husain, Mishal (21 November 2025). "The Godmother of AI Didn't Expect It to Be This Massive". Bloomberg News. Archived from the original on 24 November 2025. Retrieved 28 November 2025.

[38] Whitwam, Ryan (5 August 2025). "DeepMind reveals Genie 3 "world model" that creates real-time interactive simulations". Ars Technica. Archived from the original on 12 September 2025. Retrieved 28 November 2025.

[:16-39] Pillay, Tharin (9 December 2025). "Inside Fei-Fei Li's Plan to Build AI-Powered Virtual Worlds". TIME. Archived from the original on 10 December 2025. Retrieved 11 December 2025.

[:12-40] Gelling, Peter; Varanasi, Lakshmi. "The AI doomers are having their moment". Business Insider. Archived from the original on 25 August 2025. Retrieved 14 October 2025.

[:13-41] ^ ^a ^b ^c ^d ^e ^f Ding, Jingtao; Zhang, Yunke; Shang, Yu; Zhang, Yuheng; Zong, Zefang; Feng, Jie; Yuan, Yuan; Su, Hongyuan; Li, Nian; Sukiennik, Nicholas; Xu, Fengli; Li, Yong (9 September 2025). "Understanding World or Predicting Future? A Comprehensive Survey of World Models". ACM Comput. Surv. 58 (3): 57:1–57:38. doi:10.1145/3746449. ISSN 0360-0300.

[42] Pavlus, John (2 September 2025). "'World Models,' an Old Idea in AI, Mount a Comeback". Quanta Magazine. Archived from the original on 23 September 2025. Retrieved 14 October 2025.

[43] Ha, David; Schmidhuber, Jürgen (3 December 2018). "Recurrent world models facilitate policy evolution". Proceedings of the 32nd International Conference on Neural Information Processing Systems. NIPS'18. Red Hook, NY, USA: Curran Associates Inc.: 2455–2467.

[44] Heikkilä, Melissa; Douglas Heaven, Will (24 June 2022). "Yann LeCun has a bold new vision for the future of AI". MIT Technology Review. Archived from the original on 24 June 2022. Retrieved 15 June 2025.

[:3-45] Zeff, Maxwell (17 October 2024). "Meta's AI chief says world models are key to 'human-level AI' – but it might be 10 years out". TechCrunch. Archived from the original on 4 March 2025. Retrieved 15 June 2025.

[46] Sawers, Paul (23 January 2025). "Meta's Yann LeCun predicts 'new paradigm of AI architectures' within 5 years and 'decade of robotics'". TechCrunch. Archived from the original on 28 September 2025. Retrieved 17 December 2025.

[47] Zeff, Maxwell (17 October 2024). "Meta's AI chief says world models are key to 'human-level AI' — but it might be 10 years out". TechCrunch. Archived from the original on 20 September 2025. Retrieved 17 December 2025.

[:4-48] Wiggers, Kyle (14 December 2024). "What are AI 'world models,' and why do they matter?". TechCrunch. Archived from the original on 18 March 2025. Retrieved 15 June 2025.

[49] Wiggers, Kyle (7 January 2025). "Nvidia releases its own brand of world models". TechCrunch. Archived from the original on 5 March 2025. Retrieved 15 June 2025.

[50] Chen, Wency (5 May 2025). "China's answer to Autodesk is betting on AI to build a 'world model'". South China Morning Post. Archived from the original on 5 May 2025. Retrieved 15 June 2025.

[51] Knight, Will. "A United Arab Emirates Lab Announces Frontier AI Projects—and a New Outpost in Silicon Valley". Wired. ISSN 1059-1028. Archived from the original on 22 May 2025. Retrieved 15 June 2025.

[52] Orland, Kyle (5 March 2024). "Google's Genie game maker is what happens when AI watches 30K hrs of video games". Ars Technica. Archived from the original on 8 December 2024. Retrieved 15 June 2025.

[53] Orland, Kyle (6 December 2024). "Google's Genie 2 "world model" reveal leaves more questions than answers". Ars Technica. Archived from the original on 7 December 2024. Retrieved 15 June 2025.

[54] Browne, Ryan (11 June 2025). "Meta launches AI 'world model' to advance robotics, self-driving cars". CNBC. Archived from the original on 27 September 2025. Retrieved 14 October 2025.

[:8-55] Osawa, Juro; Liu, Qianer; Yang, Jing. "ByteDance's Upcoming 'World Model'; Altman's Thoughts on the GPT-5 Fiasco, Profitability, and Going Public". The Information. Archived from the original on 4 September 2025. Retrieved 14 October 2025.

[56] Criddle, Cristina; Murphy, Hannah; Bradshaw, Tim (29 September 2025). "Big AI firms pump money into world models as LLM advances slow". Ars Technica. Archived from the original on 4 October 2025. Retrieved 14 October 2025.

[57] Levine, Adam. "Pokémon Go Is Driving a $4 Billion Spinoff. Those Game Maps Could Be AI Gold". barrons.

[:7-58] Criddle, Cristina (12 October 2025). "Elon Musk's xAI joins race to build 'world models' to power video games". Financial Times. Archived from the original on 12 October 2025. Retrieved 14 October 2025.

[:5-59] Varanasi, Lakshmi. "Top AI researchers say language is limiting. Here's the new kind of model they are building instead". Business Insider. Archived from the original on 15 June 2025. Retrieved 21 June 2025.

[:6-60] Levy, Steven (13 September 2024). "The Godmother of AI Wants Everyone to Be a World Builder". Wired. ISSN 1059-1028. Archived from the original on 7 February 2025. Retrieved 21 June 2025.

[:11-61] Bellan, Rebecca (12 November 2025). "Fei-Fei Li's World Labs speeds up the world model race with Marble, its first commercial product". TechCrunch. Archived from the original on 19 November 2025. Retrieved 28 November 2025.

[:14-62] Wiggers, Kyle (6 January 2025). "Google is forming a new team to build AI that can simulate the physical world". TechCrunch. Archived from the original on 12 October 2025. Retrieved 3 December 2025.

[:15-63] Wiggers, Kyle (28 May 2025). "Odyssey's new AI model streams 3D interactive worlds". TechCrunch. Archived from the original on 30 May 2025. Retrieved 15 June 2025.

[64] Mims, Christopher (26 September 2025). "What Are 'World Models'? The Key to the Next Big AI Leap". The Wall Street Journal. Archived from the original on 3 October 2025. Retrieved 14 October 2025.

[65] Orland, Kyle (1 October 2025). "Can today's AI video models accurately model how the real world works?". Ars Technica. Archived from the original on 3 October 2025. Retrieved 14 October 2025.

[66] Taniguchi, Tadahiro; Murata, Shingo; Suzuki, Masahiro; Ognibene, Dimitri; Lanillos, Pablo; Ugur, Emre; Jamone, Lorenzo; Nakamura, Tomoaki; Ciria, Alejandra; Lara, Bruno; Pezzulo, Giovanni (3 July 2023). "World models and predictive coding for cognitive and developmental robotics: frontiers and challenges". Advanced Robotics. 37 (13): 780–806. doi:10.1080/01691864.2023.2225232. hdl:10281/450879. ISSN 0169-1864.

[67] Bommasani, Rishi; Klyman, Kevin; Longpre, Shayne; Kapoor, Sayash; Maslej, Nestor; Xiong, Betty; Zhang, Daniel; Liang, Percy (19 October 2023), The Foundation Model Transparency Index, arXiv:2310.12941

[68] Claude Elwood, Shannon (July 1948). "A Mathematical Theory of Communication" (PDF). Bell System Technical Journal.

[69] Radford, Alec; Kim, Jong Wook; Hallacy, Chris; Ramesh, Aditya; Goh, Gabriel; Agarwal, Sandhini; Sastry, Girish; Askell, Amanda; Mishkin, Pamela (26 February 2021), Learning Transferable Visual Models From Natural Language Supervision, arXiv:2103.00020

[70] Kaplan, Jared; McCandlish, Sam; Henighan, Tom; Brown, Tom B.; Chess, Benjamin; Child, Rewon; Gray, Scott; Radford, Alec; Wu, Jeffrey (22 January 2020), Scaling Laws for Neural Language Models, arXiv:2001.08361

[71] Jo, Eun Seo; Gebru, Timnit (27 January 2020). "Lessons from archives: Strategies for collecting sociocultural data in machine learning". Proceedings of the 2020 Conference on Fairness, Accountability, and Transparency. pp. 306–316. arXiv:1912.10389. doi:10.1145/3351095.3372829. ISBN 978-1-4503-6936-7.

[72] Bender, Emily M.; Gebru, Timnit; McMillan-Major, Angelina; Shmitchell, Shmargaret (1 March 2021). "On the Dangers of Stochastic Parrots: Can Language Models be Too Big? 🦜". Proceedings of the 2021 ACM Conference on Fairness, Accountability, and Transparency. FAccT '21. New York, New York, USA: Association for Computing Machinery. pp. 610–623. doi:10.1145/3442188.3445922. ISBN 978-1-4503-8309-7.

[73] Brown, Tom B.; Mann, Benjamin; Ryder, Nick; Subbiah, Melanie; Kaplan, Jared; Dhariwal, Prafulla; Neelakantan, Arvind; Shyam, Pranav; Sastry, Girish (22 July 2020), Language Models are Few-Shot Learners, arXiv:2005.14165

[74] Caballero, Ethan; Gupta, Kshitij; Rish, Irina; Krueger, David (2022). "Broken Neural Scaling Laws". International Conference on Learning Representations (ICLR), 2023.

[75] Zaken, Elad Ben; Ravfogel, Shauli; Goldberg, Yoav (5 September 2022), BitFit: Simple Parameter-efficient Fine-tuning for Transformer-based Masked Language-models, arXiv:2106.10199

[76] "Papers with Code – MMLU Benchmark (Multi-task Language Understanding)". paperswithcode.com. Retrieved 21 April 2024.

[77] Yue, Xiang; Ni, Yuansheng; Zhang, Kai; Zheng, Tianyu; Liu, Ruoqi; Zhang, Ge; Stevens, Samuel; Jiang, Dongfu; Ren, Weiming (20 December 2023), MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI, arXiv:2311.16502

[78] "Papers with Code – HumanEval Benchmark (Code Generation)". paperswithcode.com. Retrieved 21 April 2024.

[79] "Papers with Code – GSM8K Benchmark (Arithmetic Reasoning)". paperswithcode.com. Retrieved 21 April 2024.

[80] EleutherAI/lm-evaluation-harness, EleutherAI, 21 April 2024, retrieved 21 April 2024

[81] Srivastava, Aarohi; Rastogi, Abhinav; Rao, Abhishek; Shoeb, Abu Awal Md; Abid, Abubakar; Fisch, Adam; Brown, Adam R.; Santoro, Adam; Gupta, Aditya (12 June 2023), Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models, arXiv:2206.04615

[82] "Holistic Evaluation of Language Models (HELM)". crfm.stanford.edu. Retrieved 21 April 2024.

[83] "open-llm-leaderboard (Open LLM Leaderboard)". huggingface.co. 9 November 2023. Retrieved 21 April 2024.

[84] "DecodingTrust Benchmark". decodingtrust.github.io. Retrieved 21 April 2024.

[85] "Holistic Evaluation of Image Models (HEIM)". crfm.stanford.edu. Retrieved 21 April 2024.

[86] Linzen, Tal (July 2020). Jurafsky, Dan; Chai, Joyce; Schluter, Natalie; Tetreault, Joel (eds.). "How Can We Accelerate Progress Towards Human-like Linguistic Generalization?". Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. Online: Association for Computational Linguistics: 5210–5217. arXiv:2005.00955. doi:10.18653/v1/2020.acl-main.465.

[87] "Ecosystem Graphs for Foundation Models". crfm.stanford.edu. Retrieved 13 February 2024.

[88] Vipra, Jai; Korinek, Anton (2 November 2023), Market Concentration Implications of Foundation Models, arXiv:2311.01550

[89] "Accelerate the Development of AI Applications | Scale AI". scale.com. Retrieved 21 April 2024.

[90] "Surge AI | World's Most Powerful Data Labeling Platform". www.surgehq.ai. Retrieved 21 April 2024.

[91] "2024 AI Index – chapter 1" (PDF). 15 April 2024. pp. 37–39.

[92] "Computational Power and AI". AI Now Institute. 27 September 2023. Retrieved 13 February 2024.

[93] Tiku, Nitasha; Schaul, Kevin; Chen, Szu Yu. "These fake images reveal how AI amplifies our worst stereotypes". Washington Post. Retrieved 13 February 2024.

[94] "How the AI industry profits from catastrophe". MIT Technology Review. Retrieved 13 February 2024.

[95] "Exclusive: The $2 Per Hour Workers Who Made ChatGPT Safer". TIME. 18 January 2023. Archived from the original on 19 January 2023. Retrieved 13 February 2024.

[96] Liang, Percy; Bommasani, Rishi; Creel, Kathleen (17 May 2022). "The Time is Now to Develop Community Norms for the Release of Foundation Models". Stanford CRFM.

[97] Solaiman, Irene (5 February 2023), The Gradient of Generative AI Release: Methods and Considerations, arXiv:2302.04844

[98] Alayrac, Jean-Baptiste; Donahue, Jeff; Luc, Pauline; Miech, Antoine; Barr, Iain; Hasson, Yana; Lenc, Karel; Mensch, Arthur; Millican, Katie (15 November 2022), Flamingo: a Visual Language Model for Few-Shot Learning, arXiv:2204.14198.

[M365CopilotOverview-99] "Microsoft 365 Copilot overview". Microsoft Learn. Microsoft. 7 November 2025. Retrieved 4 January 2026.

[IBMCustomerService-100] "Transforming customer service: How generative AI is changing the game". IBM Think. IBM. 17 July 2023. Retrieved 4 January 2026.

[PRNewswireNeuralFabric2024-101] "The NeuralFabric Generative AI Platform Pioneers Micro-Foundation Models to Decrease Costs, Ensure Data Sovereignty and Democratize AI". PR Newswire. 29 February 2024. Retrieved 4 January 2026.

[AIMNeuralFabric2024-102] Mathews, Anshika (1 March 2024). "NeuralFabric Launches Groundbreaking Micro-Foundation Models in Generative AI". AIM Research. Retrieved 4 January 2026.

[IEEESpectrumCodingTools2025-103] Smith, Matthew S. (21 June 2025). "The Best AI Coding Tools You Can Use Right Now". IEEE Spectrum. Retrieved 4 January 2026.

[TheVergeClaudeCode2025-104] Heath, Alex (24 February 2025). "Anthropic's new 'hybrid reasoning' AI model is its smartest yet". The Verge. Retrieved 4 January 2026.

[AnthropicVSCodeExt2025-105] "Enabling Claude Code to work more autonomously". Anthropic. 29 September 2025. Retrieved 4 January 2026.

[106] "Claude Code overview". Claude Code Docs. Anthropic. Retrieved 4 January 2026.

[107] "anthropics/claude-code". GitHub. Anthropic. Retrieved 4 January 2026.

[108] "GitHub Copilot · Your AI pair programmer". GitHub. Retrieved 4 January 2026.

[109] "Gemini Code Assist overview". Google Developers. Google Inc. Retrieved 4 January 2026.

[110] "Amazon Q Developer". Amazon Web Services. Amazon. Retrieved 4 January 2026.

[111] Bonifield, Stevie (8 December 2025). "Anthropic is bringing Claude Code to Slack". The Verge. Retrieved 4 January 2026.

[112] How Anthropic teams use Claude Code (PDF) (Report). Anthropic. Retrieved 4 January 2026.

[113] "Microsoft to now include Copilot in Microsoft 365 for consumers". Reuters. 16 January 2025. Retrieved 4 January 2026.

[114] "COPILOT function". Microsoft Support. Microsoft. Retrieved 4 January 2026.

[115] Timilsina, Mohan; Buosi, Samuele; Razzaq, Muhammad Asif; Haque, Rafiqul; Judge, Conor; Curry, Edward (2025). "Harmonizing foundation models in healthcare: A comprehensive survey of their roles, relationships, and impact in artificial intelligence's advancing terrain". Computers in Biology and Medicine. 189 109925. doi:10.1016/j.compbiomed.2025.109925. PMID 40081208.

[116] Shi, Yuhong; Yu, Kun; Dong, Yifei; Chen, Fang (June 2026). "Large language models in education: a systematic review of empirical applications, benefits, and challenges". Computers and Education: Artificial Intelligence. 10 100529. doi:10.1016/j.caeai.2025.100529.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

[18]

[19]

[20]

[21]

[22]

[23]

[24]

[25]

[26]

[27]

[28]

[29]

[30]

[31]

[32]

[33]

[34]

[35]

[36]

[37]

[38]

[39]

[40]

[41]

[42]

[43]

[44]

[45]

[46]

[47]

[48]

[49]

[50]

[51]

[52]

[53]

[54]

[55]

[56]

[57]

[58]

[59]

[60]

[61]

[62]

[63]

[64]

[65]

[66]

[67]

[68]

[69]

[70]

[71]

[72]

[73]

[74]

[75]

[76]

[77]

[78]

[79]

[80]

[81]

[82]

[83]

[84]

[85]

[86]

[87]

[88]

[89]

[90]

[91]

[92]

[93]

[94]

[95]

[96]

[97]

[98]

[99]

[100]

History

Foundation model

Recent from talks

Recent from talks

Contribute something

Contribute something

Media Pages

Timelines

Articles

Notes collections

Notes

Notes

Days in Chronicle

Foundation model

Definitions

History

Related concepts

Frontier models

General-purpose AI

World models

History

Training

Examples

Applications

Concerns

Technical details

Modeling

Training

Data

Systems

Scaling

Adaptation

Evaluation

Supply chain

Release strategies

Practices and applications

References