MemotivaLLM Engineer Interview Questions: Transformer Architecture, Self-Attention, and Modern LLM Foundations

What is a large language model?

LLM Engineer Interview Questions: Transformer Architecture, Self-Attention, and Modern LLM Foundations

Audio flashcard · 0:20

Nortren·

What is a large language model?

0:20

A large language model is a neural network trained on massive text datasets to predict the next token in a sequence. Modern LLMs are typically decoder-only transformers with billions or trillions of parameters. They learn statistical patterns of language during pretraining and can then generate text, answer questions, write code, and reason about problems through prompting.
en.wikipedia.org