Using Large Language Models to Help Train Machine Learning SDG Classifiers

United Nations Department of Economic and Social Affairs; Marcelo T. LaFleur

doi:10.18356/25206656-180

Using Large Language Models to Help Train Machine Learning SDG Classifiers
Автор: United Nations Department of Economic and Social Affairs and Marcelo T. LaFleur
Source: UN Department of Economic and Social Affairs (DESA) Working Papers, 30 нояб. 2023, 18 pages
DOI: https://doi.org/10.18356/25206656-180
Language: Английский

Abstract

This paper proposes the use of synthetic training data generated by large language models to improve machine learning SDG classifiers. It shows that supplementing existing training data with synthetic data produced by the ChatGPT tool improves the performance of the SDGClassy classifier. This addition of synthetic data is especially useful in building SDG classifiers given the limited availability of properly labeled data and the complex, interconnected nature of the SDGs. Synthetic data thus enable more effective machine-learning applications in this context.

Sustainable Development Goals:

Partnerships for the Goals

Связанные Темы : Economic and Social Development

JEL: C88: Mathematical and Quantitative Methods / Data Collection and Data Estimation Methodology ; Computer Programs / Other Computer Software ; O20: Economic Development, Innovation, Technological Change, and Growth / Development Planning and Policy / General

You do not have access to article level metrics. Please click here to request access

/content/papers/10.18356/25206656-180

Published online: 30 нояб. 2023

	Присоединяйтесь К Обсуждению

^{UN iLibrary is the comprehensive global search, discovery, and viewing source for digital content created by the United Nations}	НАШ АДРЕС \| © ООН \| ЧАСТО ЗАДАВАЕМЫЕ ВОПРОСЫ \| КОНФИДЕНЦИАЛЬНОСТЬ \| УСЛОВИЯ ПОЛЬЗОВАНИЯ ^{© Организация Объединенных Наций. Все права защищены.}

Using Large Language Models to Help Train Machine Learning SDG Classifiers

Abstract

Most Read This Month