Hubbry Logo
IBM GraniteIBM GraniteMain
Open search
IBM Granite
Community hub
IBM Granite
logo
8 pages, 0 posts
0 subscribers
Be the first to start a discussion here.
Be the first to start a discussion here.
IBM Granite
from Wikipedia
IBM Granite
DeveloperIBM Research[1]
Initial releaseNovember 7, 2023; 23 months ago (2023-11-07)
PlatformIBM Watsonx (initially)
GitHub
Hugging Face
RHEL AI
Type
LicenseProprietary
Code models: Open Source (Apache 2.0)[2]
Websiteibm.com/granite

IBM Granite is a series of decoder-only AI foundation models created by IBM.[3] It was announced on September 7, 2023,[4][5] and an initial paper was published 4 days later.[6] Initially intended for use in the IBM's cloud-based data and generative AI platform Watsonx along with other models,[7] IBM opened the source code of some code models.[8][9] Granite models are trained on datasets curated from Internet, academic publishings, code datasets, legal and finance documents.[10][11][1]

Foundation models

[edit]

A foundation model is an AI model trained on broad data at scale such that it can be adapted to a wide range of downstream tasks.[12]

Granite's first foundation models were Granite.13b.instruct and Granite.13b.chat. The "13b" in their name comes from 13 billion, the amount of parameters they have as models, lesser than most of the larger models of the time. Later models vary from 3 to 34 billion parameters.[4][13]

On May 6, 2024, IBM released the source code of four variations of Granite Code Models under Apache 2, an open source permissive license that allows completely free use, modification and sharing of the software, and put them on Hugging Face for public use.[14][15] According to IBM's own report, Granite 8b outperforms Llama 3 on several coding related tasks within similar range of parameters.[16][17]

See also

[edit]

References

[edit]
[edit]
Revisions and contributorsEdit on WikipediaRead on Wikipedia
Add your contribution
Related Hubs
User Avatar
No comments yet.