Hubbry Logo
LanguageToolLanguageToolMain
Open search
LanguageTool
Community hub
LanguageTool
logo
8 pages, 0 posts
0 subscribers
Be the first to start a discussion here.
Be the first to start a discussion here.
LanguageTool
LanguageTool
from Wikipedia
LanguageTool
DevelopersDaniel Naber and Marcin Miłkowski
Initial release15 August 2005; 20 years ago (2005-08-15)
Stable release
6.7[1] Edit this on Wikidata / 10 October 2025
Repository
Written inJava
PlatformJava SE
Size
  • Desktop app: 156 MB[2]
  • n-gram data: 8.34 GB[3]
TypeGrammar checker
LicenseGNU LGPL v2.1+
Websitelanguagetool.org Edit this on Wikidata

LanguageTool is a free and open-source grammar, style, and spell checker, and all its features are available for download.[4][5] The LanguageTool website connects to a proprietary sister project called LanguageTool Premium (formerly LanguageTool Plus), which provides improved error detection for English and German, as well as easier revision of longer texts, following the open-core model.

Overview

[edit]

LanguageTool was started by Daniel Naber for his diploma thesis[6] in 2003 (then written in Python). It now supports 31 languages, each developed by volunteer maintainers, usually native speakers of each language.[7] Based on error detection patterns, rules are created and then tested for a given text. The core app itself is free and open-source and can be downloaded for offline use. Some languages use 'n-gram' data,[8] which is massive and requires considerable processing power and I/O speed, for some extra detections. As such, LanguageTool is also offered as a web service that does the processing of 'n-grams' data on the server-side. LanguageTool "Premium" also uses n-grams as part of its freemium business model.

LanguageTool web service can be used via a web interface in a web browser, or via a specialized client-side plug-ins for Microsoft Office, LibreOffice, TeXstudio, Apache OpenOffice, Vim, Emacs, Firefox, Thunderbird, and Google Chrome.[5]

LanguageTool does not check a sentence for grammatical correctness, but whether it contains typical errors. Therefore, it is easy to invent ungrammatical sentences that LanguageTool will still accept. Error detection succeeds with a variety of rules based on XML or written in Java.[9] XML-based rules can be created using an online form.[10]

More recent developments rely on large n-gram libraries that offer suggestions for improving misspellings with the help of artificial neural networks.[11]

In April 2023 Learneo acquired LanguageTool.[12][13][14][15]

See also

[edit]

References

[edit]
[edit]
Revisions and contributorsEdit on WikipediaRead on Wikipedia
from Grokipedia
LanguageTool is a free and open-source proofreading software that uses AI to detect and correct grammar, spelling, style, and punctuation errors in texts across more than 30 languages and dialects. Developed as a multilingual tool, it goes beyond basic spell-checking by identifying complex issues such as stylistic inconsistencies, tonal mismatches, and typographical errors, making it suitable for both personal and professional writing enhancement. The software supports integrations with web browsers (via extensions for Chrome, Firefox, Edge, and others), Microsoft Office applications, and standalone desktop clients for Windows and macOS. Initiated in 2003 by Daniel Naber as part of his diploma thesis in at , LanguageTool began as an open-source project focused on international language support and has since evolved into a widely used service with millions of daily users. Naber, the founder and former primary maintainer, collaborated with a global community of over 140 contributors to expand its rule-based and machine learning-driven detection capabilities, with the first stable release occurring in 2005; in 2025, he handed over maintenance to the community, and the project adopted a snapshot-based release model. In 2023, the company behind LanguageTool, based in , , was acquired by Learneo, Inc., to accelerate its AI innovations and global reach, serving over 2,000 organizations including the BMW Group and the . Key to its functionality is a hybrid approach combining XML-defined pattern-matching rules for error detection—allowing non-programmers to contribute—and advanced models for context-aware suggestions, all built on Java 17 and hosted on for community-driven development. LanguageTool offers a free version with core features and a premium tier that includes enhanced paraphrasing, consistency checks, and priority support, amassing over 4 million users and 2 million active browser add-on installations worldwide. Its emphasis on privacy ensures that browser extensions do not store user texts, while enterprise solutions enable customized rule sets for specific industries.

History and Development

Founding and Early Development

LanguageTool originated in 2003 when Daniel Naber developed it as part of his diploma thesis at the Technische Fakultät of in . The project was conceived as a rule-based tool for detecting style and grammar errors in text, addressing limitations in existing spell checkers by incorporating linguistic rules for more sophisticated analysis. Initially implemented in Python, it focused on basic grammar checking capabilities, including and error , with the software released as open-source under the GNU Lesser General Public License. The project gained public visibility in 2004 through its registration on , marking the beginning of broader community involvement. By August 15, 2005, LanguageTool reached its initial public release as version 1.0, primarily supporting grammar rules for English and German to ensure reliable error detection in those languages. This version emphasized cross-platform usability and integrated with tools like , laying the groundwork for its adoption as a assistant. Early development encountered significant challenges in crafting precise rules, especially since many initial contributors were non-native speakers of the target languages, which sometimes led to inaccuracies in nuance detection. To address this, Naber established volunteer-based language teams, comprising linguists and enthusiasts who collaboratively developed and refined XML-based rules for error identification. These teams, numbering around 10 active members by the mid-2000s, played a crucial role in improving rule quality and fostering the tool's evolution into a multilingual resource.

Key Milestones and Acquisitions

LanguageTool achieved a significant milestone in with the expansion of its proofreading capabilities to support over 20 languages, enabling broader multilingual error detection and establishing it as a versatile open-source tool. This development coincided with the availability of its web interface, allowing users to access grammar and style checking online without local installation, which contributed to initial adoption among writers and developers. The tool maintained a consistent release cadence, with stable versions issued approximately every six months to incorporate community-contributed rules and optimizations. In March 2025, with the release of version 6.6, Daniel Naber handed over maintenance responsibilities to Stefan Viol at LanguageTooler , transitioning to a snapshot-based release model to support ongoing development. A notable recent update was version 6.7, released on October 10, 2025, which included refinements to suggestion algorithms, leveraging neural networks for more precise spellchecking and stylistic recommendations in supported languages. These enhancements built on earlier neural integrations, allowing for better handling of ambiguous phrasing through probabilistic modeling. In April 2023, LanguageTool was acquired by Learneo, Inc., marking a pivotal shift from its origins as a fully volunteer-driven open-source project—initiated in 2003 by a of linguists and developers—to a hybrid model combining professional engineering resources with ongoing community contributions. This acquisition facilitated accelerated innovation in AI-driven features while preserving the tool's open-source core, enabling integration with Learneo's suite of writing and learning platforms. User growth reflected these advancements, expanding from a few thousand active contributors and early adopters in the mid-2000s to millions of users by 2025, fueled by the 2010 launch and subsequent browser extensions that reached over 3 million Chrome users alone. The surge was further propelled by integrations into productivity tools and the rise of , positioning LanguageTool as a staple for multilingual writing assistance.

Core Functionality

Error Detection Mechanisms

LanguageTool detects errors across multiple categories, including , , , style, tonality, and , primarily through a that combines with part-of-speech (POS) tagging for contextual analysis. Rules are defined in XML format, allowing matches against specific word sequences, POS tags (e.g., or verb forms), and regular expressions to identify issues such as subject-verb agreement errors or inconsistent usage. Spelling errors are handled via integration with dictionaries like , while style and tonality checks target overuse of passive constructions or informal phrasing in formal contexts. For more probabilistic error detection, LanguageTool leverages an approximately 8 GB n-gram dataset derived from Google's n-gram collection, which analyzes word sequence probabilities to flag confusable terms in context, such as "their" versus "there" based on surrounding phrases. This method supports up to three-word n-grams and enhances accuracy for idiomatic or collocation-based errors in languages like English, German, French, and Spanish. Context-aware suggestions go beyond simple fixes by recommending rephrasings for improved clarity or formality; for instance, the tool can identify overuse of in sentences like "The report was written by the team" and suggest "The team wrote the report" to promote . Typography errors, such as improper hyphenation or spacing, are caught through pattern rules that enforce consistency. The system supports offline operation via its desktop application, which requires a download of about 252 MB for the standalone version, enabling local rule-based checks without internet connectivity. In comparison, cloud-based processing provides access to the full n-gram dataset and additional AI-driven analysis for more nuanced suggestions.

Language Support and Coverage

LanguageTool currently supports 31 languages and dialects, encompassing a wide range of linguistic diversity through its open-source framework. This includes comprehensive grammar checking for major languages such as English (with variants for US, UK, Canada, Australia, New Zealand, and South Africa), German, French, Spanish, Dutch, and Portuguese, where the tool performs advanced error detection beyond basic spelling. Partial support is available for other languages, including Russian and Arabic, which feature grammar rules but with fewer advanced checks compared to the primary languages. The development of language-specific rule sets is driven by a community of volunteer native speakers who contribute expertise to ensure cultural and idiomatic accuracy. In total, 143 contributors have participated in building and maintaining the technology, focusing on tailoring rules to the nuances of each language. This collaborative effort leverages the to adapt checks for syntactic, semantic, and stylistic elements unique to individual languages. The depth of analysis varies significantly across supported languages, reflecting resource allocation and community maturity. For top languages like English and German, LanguageTool offers deep checks including style suggestions, tonality adjustments, and confusion pair resolutions to enhance clarity and professionalism. In contrast, emerging or less-resourced languages, such as Swedish or Chinese, primarily provide basic and corrections, with limited style or advanced analysis due to fewer rules (e.g., 32 XML rules for Swedish). This tiered approach ensures broad while prioritizing robust support for high-demand languages. Community feedback plays a crucial role in ongoing enhancements, with over 20 million texts improved daily across all supported languages. Users report errors and suggest refinements through platforms like and the LanguageTool community forum, enabling iterative updates to rule sets and expanding coverage for underrepresented languages. This feedback loop has facilitated steady growth, with recent activity showing hundreds of rule changes in languages like Catalan and over the past six months.

Technical Architecture

Rule-Based System

LanguageTool's rule-based system forms the foundational engine for its error detection capabilities, implemented primarily in to ensure cross-platform compatibility and performance. This architecture processes input text by first tokenizing it into sentences and words, applying , and then applying rules defined in XML format for precise . Rules are stored in language-specific files, such as grammar.xml, where each rule specifies an error pattern, a corrective message, and examples for validation. This declarative approach allows for modular extension without altering the core codebase. The XML rules support flexible through <token> elements that can target exact words, lemmas, part-of-speech tags, or regular expressions, enabling detection of syntactic and stylistic issues. For instance, a simple regex-based rule might identify redundant prepositions by matching patterns like <token regexp="yes">in</token> <token>the</token> <token regexp="yes">beginning</token>, flagging phrases such as "in the beginning of" and suggesting "at the beginning of" as a correction. More complex rules incorporate logical operators like OR (|), (<exception> or ^), and antipatterns to avoid false positives, such as excluding matches within quotes or specific contexts. This regex integration, combined with linguistic annotations, allows rules to handle nuances like subject-verb agreement or idiomatic expressions efficiently. To facilitate community involvement, LanguageTool provides an online rule editor at community.languagetool.org/ruleEditor2, where volunteers can create, test, and refine rules interactively without needing to write code or compile the software. The editor generates XML output directly, simulating matches against sample sentences and offering previews of error highlighting and suggestions, which streamlines contributions for , style, and locale-specific checks across supported languages. For efficient parsing of text structures like sentences and clauses, the system leverages finite-state automata, particularly in components such as morphological analysis and dictionary lookups, to process tokenized input rapidly and scale to large documents. This approach compiles patterns and linguistic data into compact state machines, minimizing computational overhead during rule application. The is distributed under the GNU Lesser General Public License (LGPL) version 2.1 or later, which permits users to freely modify the source code, integrate it into other applications, and host custom servers for private or enterprise use while requiring that modifications remain open if redistributed.

AI and Machine Learning Integration

LanguageTool began incorporating artificial neural networks around 2017 to improve the detection of subtle style issues, such as word confusion pairs, by leveraging embeddings for contextual disambiguation in English, German, and . These neural components augment the core by providing adaptive suggestions based on learned patterns from large text corpora, enabling better handling of nuanced errors that fixed rules might overlook. The models are trained on extensive sentence corpora to capture contextual relationships and linguistic patterns, facilitating advanced features like , which distinguishes between formal, informal, fluent, or concise tones to offer tailored rephrasing suggestions. For instance, the system can rewrite informal phrases like "bummed out" to more professional alternatives such as "disappointed," enhancing overall text suitability for specific audiences. In recent developments, LanguageTool's AI-powered paraphrasing tool supports sentence-level rewriting to improve fluency, formality, or brevity, available primarily through Premium accounts with unlimited access in supported languages including English, German, French, Spanish, Dutch, Polish, and . Free users are limited to three rephrases per day and a 2,000-character text limit, while basic neural suggestions for error detection remain accessible in the open-source version without premium restrictions. This integration builds upon the rule-based foundation to deliver more sophisticated, context-aware enhancements, though advanced rewriting remains gated behind subscriptions.

Deployment and Integrations

Platforms and Accessibility

LanguageTool offers a dedicated desktop application compatible with Windows, macOS, and operating systems, enabling users to perform , , and style checks directly within various writing environments. The application operates in offline mode, requiring no connection once installed, which allows for local processing of texts to ensure and in disconnected settings. This standalone version leverages for cross-platform support, integrating seamlessly with text editors and providing real-time suggestions without relying on external servers. For web-based browsing, LanguageTool provides extensions for major browsers including , Chrome, Edge, , and , which automatically detect and correct errors in online forms, emails, and web applications. As of 2025, these browser add-ons have over 2 million active installations, reflecting widespread adoption among users seeking on-the-go proofreading capabilities. The extensions support both free and premium tiers, with the free version offering core functionality across supported languages. LanguageTool also offers an app for mobile . LanguageTool extends its reach through integrations with popular productivity tools, including , , and , where it functions as an add-on to enhance document editing workflows. Additionally, it connects with email clients such as , Outlook, and , allowing users to review and refine messages before sending. These integrations enable contextual error highlighting and corrections within native interfaces, improving efficiency for professional and personal writing tasks. Users can access LanguageTool via its web-based service at languagetool.org, which provides a straightforward interface for pasting and checking texts without installation. For those prioritizing data privacy, the open-source nature of LanguageTool supports self-hosting, where individuals or organizations can deploy a local server instance to process checks entirely on their own infrastructure. This option eliminates reliance on cloud services and accommodates custom configurations for enterprise use.

Premium Enhancements

LanguageTool Premium serves as a extension of the open-source project, launched in 2017 as LanguageTool Plus to provide advanced capabilities. It introduces deeper AI-driven checks, particularly for English and German, enabling unlimited sentence paraphrasing and sophisticated style improvements such as detecting repetition, wordiness, and informal phrasing. The service operates on a subscription-based model, with individual plans available at approximately €5.83 per month when billed annually (€69.90/year) or €19.90 per month for monthly billing, alongside team options starting at €11.07 per month for two users when billed annually (€132.80/year), as of November 2025. It is trusted by hundreds of thousands of across industry-leading companies for enhancing writing quality in professional contexts. Key exclusive features distinguish Premium from the free edition, including support for text lengths up to 100,000 characters per check and priority customer support with a 14-day money-back guarantee. These enhancements, along with access to over 20,000 additional rules for languages like English, German, French, and Spanish, are not available in the open-source version. While building directly on the core open-source codebase for fundamental grammar and spelling detection, Premium integrates proprietary, non-open-licensed AI models to deliver its specialized rephrasing and style analysis functionalities.

Community and Impact

Open-Source Contributions

LanguageTool's open-source nature has fostered a global community of contributors, with its primary repository hosted at github.com/languagetool-org/languagetool, where 143 developers worldwide have participated in enhancing the proofreading software. This repository serves as the central hub for code, rules, and documentation, enabling collaborative development of the tool's core functionality in Java. Contributions to LanguageTool emphasize community involvement, particularly through the submission of grammar and style rules. Users can propose new rules using the online Rule Editor, which generates XML-formatted rules ready for review and integration, or submit changes directly via pull requests on . These processes prioritize input from native speakers to ensure linguistic accuracy and cultural relevance in rule development. Following its acquisition by Learneo, Inc. in April 2023, LanguageTool adopted a remote-first team structure, combining professional linguists and experts with ongoing volunteer contributions from the open-source community. This hybrid model supports daily improvements to the tool while maintaining transparency through open-source practices. In 2025, the project switched to a snapshot-based release model in March and released a new Mac app in , further enhancing community-driven development. Comprehensive documentation at dev.languagetool.org guides developers in extending the Java-based , from adding custom rules to integrating new features. Language-specific enhancements, such as refined error detection for individual languages, often stem from these collaborative efforts.

Usage and Reception

LanguageTool has garnered widespread adoption, with over 4 million users worldwide relying on it for and style enhancement. It sees strong uptake in educational settings, where students utilize it for drafting theses and academic papers, supported by dedicated discounts for educational users. In , the tool integrates seamlessly into workflows via browser extensions and desktop applications, aiding tasks in environments like , Outlook, and collaborative platforms such as . The tool's reception is largely positive, particularly for its robust multilingual capabilities covering over 30 languages and its open-source foundation, which allows for community-driven improvements and self-hosting options. Reviews on Add-ons praise its automatic detection of complex errors beyond basic spelling, earning a 4.7 out of 5 rating from over 4,600 users. Similarly, users highlight its ease of integration with Chrome and effective language detection, contributing to an overall 4.2 out of 5 score from 45 verified reviews. Criticisms include occasional false positives in error detection and suggestions that can lead to user confusion, as noted in feedback where reviewers reported inaccuracies in checks. Additionally, while multilingual support is a strength, the depth of rules and AI-driven features is sometimes limited for non-major languages compared to English, resulting in less comprehensive coverage. LanguageTool's impact extends to specialized integrations, such as in author workflows recommended by Kindlepreneur for manuscripts, and corporate adoption by over 2,000 organizations, including Group and the , as of 2025. This broad usage underscores its role in enhancing communication across diverse sectors, bolstered by its acquisition by Learneo in 2023, which facilitated expanded scalability.

References

Add your contribution
Related Hubs
User Avatar
No comments yet.