Artificial intelligence programs designed to process and generate text show remarkably high verbal reasoning abilities, but they struggle with visual and numerical puzzles. New research evaluating a variety of commercial and open-source models on traditional intelligence quotient tests revealed wide gaps in performance depending on the format of the questions. The findings were published in Computers in Human Behavior: Artificial Humans.
Large language models are computer algorithms trained…
