LogoLogo
Tabnine websiteContact Sales
  • đź‘‹Welcome
    • Overview
      • Architecture
        • Deployment options
      • Security
      • Privacy
      • Protection
        • Provenance and Attribution
      • Personalization
        • Tabnine’s Personalization in Depth
        • Connection: Global Codebase Awareness
      • AI Models
        • Tabnine's private and protected Universal models
        • Tabnine's fine-tuned AI models
      • Integrations
        • Atlassian Jira
      • System Requirements
      • Supported Languages
      • Supported IDEs
      • Tabnine Subscription Plans
        • Dev Preview
        • Dev
        • Enterprise (SaaS)
        • Enterprise (private installation)
    • Support & Feedback
  • 🚀Getting started
    • Install
      • Client setup (SaaS)
        • VS Code
          • Install Tabnine in VS Code
          • Activate Tabnine in VS Code
        • JetBrains IDEs
          • Install Tabnine in a JetBrains IDE
          • Activate Tabnine in a JetBrains IDE
        • Visual Studio
          • Install Tabnine in Visual Studio 2022
          • Activate Tabnine in Visual Studio 2022
        • Eclipse
          • Install Tabnine in Eclipse
          • Activate Tabnine in Eclipse
        • Sign in
          • Using an email
          • Using an authentication token
      • Client setup (private installation)
        • Join your team (private installation)
        • VS Code (private installation)
        • JetBrains IDEs (private installation)
        • Visual Studio (private installation)
        • Eclipse (private installation)
    • Quickstart Guide
      • Menus and Icons
    • Getting the Most from Tabnine Code Completions
      • Code Acceptance Logs
      • Pause (snooze)
    • Getting the Most from Inline Actions
    • Getting the Most from Tabnine Chat
      • Launching Tabnine Chat
      • Interacting with Tabnine Chat
      • Reviewing suggestions
      • Writing prompts
      • Chat Context
        • Understanding Context
        • Jira Connection
        • Context Scoping
      • Conversing with Tabnine Chat
      • Switching between chat AI models
      • Image Prompts
      • Tabnine's Prompting Guide
        • Getting Started
        • Basic Prompting
          • Be specific and clear
          • Define the context
          • Start a fresh conversation as appropriate
          • Include necessary details
          • Ask for examples
          • Be concise but complete
  • đź’ŞSoftware Development with Tabnine
    • Overview
    • Plan
    • Create
    • Test
      • Intro to the Test Agent
      • Test Agent Workflow
      • Custom Commands
      • Generate Test Files with @Mentions
    • Review
    • Fix
    • Document
    • Explain
    • Maintain
  • 🏭Administering Tabnine
    • Start a team
    • Manage a team
    • SaaS
      • Enterprise (SaaS) team admin
        • Set up a Tabnine Enterprise (SaaS) account
        • Invite team members
        • Manage your team
        • AI models for Chat (Enterprise SaaS)
      • Enterprise (SaaS) team member
        • Join your Tabnine team by invitation email (team member)
        • Join Tabnine team by link (member)
    • Private installation
      • Server setup guide
        • Kubernetes (MicroK8s) Installation guide
        • Deployment guide
          • Tabnine update guide
        • Air-gapped deployment guide
      • Admin guide
        • Monitoring Tabnine
        • Prometheus Operator install
        • Audit logs
      • Managing your team
        • Tabnine teams
        • Roles in an enterprise
        • Inviting users to your team
        • Deactivating and reactivating users
        • Deleting PII data of a deactivated user
        • Reset user's password
        • Usage reports
          • Reports Glossary
          • CSV-based reports (V2)
            • Configuring scheduled CSV reports
            • CSV-based reports V1 (Depracted since version 5.7.0
          • Usage API
        • Settings
          • General
          • Single Sign-On (SSO)
          • Personalization (f.k.a. Workspace)
            • Connecting to External Code Repositories
          • Email
          • License
          • Models
          • Access Tokens
        • IdP Sync
      • Release Notes
  • 📣Product Updates
    • What's new?
      • What's new? (August 2024)
      • What's new? (July 2024)
      • What's new? (June 2024)
      • What's new? (May 2024)
      • What's new? (April 2024)
      • What's new? (March 2024)
      • What's new? (February 2024)
      • What's new? (January 2024)
Powered by GitBook
On this page
  • Tabnine’s Universal models: State-of-the-art, permissive, open source LLMs
  • Tabnine’s Universal models are trained on open source code with permissive licenses

Was this helpful?

  1. Welcome
  2. Overview
  3. AI Models

Tabnine's private and protected Universal models

Last updated 1 year ago

Was this helpful?

Tabnine’s Universal models: State-of-the-art, permissive, open source LLMs

These models are the result of our involvement in the open source LLM community.

We're regular contributors to many of the main open source LLM models. This gives us an advantage when selecting the best ones out there.

Our clients trust us to curate every option in the market for them. We assure them that each time we update the Universal model, it includes the latest and greatest. Being good open source citizens also allows us to have our proprietary add-on ready for any model. Our enhancement is model-agnostic.

Tabnine’s Universal models are trained on open source code with permissive licenses

Tabnine’s generative AI only uses open source code with permissive licenses for our Universal model trained on public code.

The licenses are as follows: MIT, MIT-0, Apache-2.0, BSD-2-Clause, BSD-3-Clause, Unlicense, CC0-1.0, CC-BY-3.0, CC-BY-4.0, RSA-MD, 0BSD, WTFPL, ISC.

We attribute all code suggested to the full list of codebases used for training our model. This list is publicly available and regularly updated in our .

We can guarantee that at the time of training the model, the licenses used belong to the above list only. We never train on data that isn't permissive to ensure that our clients aren’t at risk.

This decision has painful implications for Tabnine since it limits the amount of data with which our models can be trained. But our users’ peace of mind comes first, at any cost. Using only permissively licensed code, we guarantee our users can safely use the code that Tabnine generates in commercial projects without any license compliance uncertainty. Moreover, we're good open source users. By training our model on code with permissive licenses only, we adhere to and fully respect the original intent of those who contributed code to open source.

This applies to all the ways you can use Tabnine: code completions or chat, for any deployment (secure SaaS or private installations), and for any model (Universal or private model).

With regards to code privacy, the same principles still apply to Chat. We use a data set licensed as Apache-2, free from copyrighted materials. This, again, makes all Tabnine Chat suggestions safe to use.

For Chat we additionally use the English data set. If the model happens to recommend code that was already in its training data, Tabnine users don't need to worry. The code is under a permissive license, making Tabnine's code suggestions safe to use.

đź‘‹
Trust Center