IN AI, IS BIGGER BETTER?. Ananthaswamy, A. Nature, 615(7951):202–205, March, 2023. Place: London Publisher: Nature Publishing Group
IN AI, IS BIGGER BETTER? [link]Paper  doi  abstract   bibtex   
In one early test of its reasoning abilities, ChatGPT scored just 26% when faced with a sample of questions from the 'MATH' data set of secondary-school-level mathematical problems1. The Minerva results hint at something that some researchers have long suspected: that training larger LLMs, and feeding them more data, could give them the ability, through pattern-recognition alone, to solve tasks that are supposed to require reasoning. [...]these models have major downsides. Besides concerns that their output cannot be trusted, and that they might exacerbate the spread of misinformation, they are expensive and suck up huge amounts of energy. In some instances, multiple power laws can govern how performance scales with model size, the researchers say.
@article{ananthaswamy_ai_2023,
	title = {{IN} {AI}, {IS} {BIGGER} {BETTER}?},
	volume = {615},
	issn = {00280836},
	url = {https://www.proquest.com/scholarly-journals/ai-is-bigger-better/docview/2786242522/se-2?accountid=14542},
	doi = {10.1038/d41586-023-00641-w},
	abstract = {In one early test of its reasoning abilities, ChatGPT scored just 26\% when faced with a sample of questions from the 'MATH' data set of secondary-school-level mathematical problems1.

The Minerva results hint at something that some researchers have long suspected: that training larger LLMs, and feeding them more data, could give them the ability, through pattern-recognition alone, to solve tasks that are supposed to require reasoning.

[...]these models have major downsides.

Besides concerns that their output cannot be trusted, and that they might exacerbate the spread of misinformation, they are expensive and suck up huge amounts of energy.

In some instances, multiple power laws can govern how performance scales with model size, the researchers say.},
	language = {English},
	number = {7951},
	journal = {Nature},
	author = {Ananthaswamy, Anil},
	month = mar,
	year = {2023},
	note = {Place: London
Publisher: Nature Publishing Group},
	keywords = {Artificial intelligence, Language, Reasoning, Datasets, Canada, Environmental Studies, Mathematical models, Montreal Quebec Canada, Pattern recognition, Power, Researchers},
	pages = {202--205},
	annote = {Copyright - Copyright Nature Publishing Group Mar 9, 2023},
	annote = {SubjectsTermNotLitGenreText - Montreal Quebec Canada; Canada},
	annote = {Última actualización - 2023-03-13},
}

Downloads: 0