Question

What is a large language model?

Accepted Answer

A large language model is a neural network trained on massive text datasets to predict the next token in a sequence. Modern LLMs are typically decoder-only transformers with billions or trillions of parameters. They learn statistical patterns of language during pretraining and can then generate text, answer questions, write code, and reason about problems through prompting.