Definition

Definition of BERT

Bidirectional Encoder Representations from Transformers (BERT) is a language representation model designed to pre-train deep bidirectional representations from unlabeled text by jointly conditioning on both left and right context in all layers. This generalist approach allows BERT to be fine-tuned for various downstream natural language processing tasks, such as question answering and language inference, by adding a single additional output layer without requiring task-specific architectures.

0

2

Updated 2026-06-12

Tags

Data Science

Foundations of Large Language Models

Ch.1 Pre-training - Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

D2L

Dive into Deep Learning @ D2L

Related