Embers of autoregression show how large language models are shaped by the problem they are trained to solve.

McCoy, R Thomas; Yao, Shunyu; Friedman, Dan; Hardy, Mathew D; Griffiths, Thomas L

McCoy, R Thomas; Yao, Shunyu; Friedman, Dan; Hardy, Mathew D; Griffiths, Thomas L.

Afiliação

McCoy RT; Department of Computer Science, Princeton University, Princeton, NJ 08542.
Yao S; Department of Computer Science, Princeton University, Princeton, NJ 08542.
Friedman D; Department of Computer Science, Princeton University, Princeton, NJ 08542.
Hardy MD; Department of Psychology, Princeton University, Princeton, NJ 08542.
Griffiths TL; Department of Computer Science, Princeton University, Princeton, NJ 08542.

Proc Natl Acad Sci U S A ; 121(41): e2322420121, 2024 Oct 08.

Article em En | MEDLINE | ID: mdl-39365822

ABSTRACT

ABSTRACT

The widespread adoption of large language models (LLMs) makes it important to recognize their strengths and limitations. We argue that to develop a holistic understanding of these systems, we must consider the problem that they were trained to solve next-word prediction over Internet text. By recognizing the pressures that this task exerts, we can make predictions about the strategies that LLMs will adopt, allowing us to reason about when they will succeed or fail. Using this approach-which we call the teleological approach-we identify three factors that we hypothesize will influence LLM accuracy the probability of the task to be performed, the probability of the target output, and the probability of the provided input. To test our predictions, we evaluate five LLMs (GPT-3.5, GPT-4, Claude 3, Llama 3, and Gemini 1.0) on 11 tasks, and we find robust evidence that LLMs are influenced by probability in the hypothesized ways. Many of the experiments reveal surprising failure modes. For instance, GPT-4's accuracy at decoding a simple cipher is 51% when the output is a high-probability sentence but only 13% when it is low-probability, even though this task is a deterministic one for which probability should not matter. These results show that AI practitioners should be careful about using LLMs in low-probability situations. More broadly, we conclude that we should not evaluate LLMs as if they are humans but should instead treat them as a distinct type of system-one that has been shaped by its own particular set of pressures.

Assuntos

Idioma; Humanos; Modelos Teóricos

Palavras-chave

artificial intelligence; cognitive science; large language models

Texto completo

Adicionar na Minha BVS

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Idioma Limite: Humans Idioma: En Revista: Proc Natl Acad Sci U S A / Proc. Natl. Acad. Sci. U. S. A / Proceedings of the national academy of sciences of the United States of America Ano de publicação: 2024 Tipo de documento: Article País de publicação: Estados Unidos

Texto completo

Adicionar na Minha BVS

Imprimir

XML

PubMed Links

Buscar no Google