This work investigates the socio-economic disparities and reduced utility for non-English speakers in the use of large language models (LLMs). We use the FLORES-200, Ethnologue, and World Development Indicators datasets to analyze the socio-economic disparities in the use of LLMs. We also use the OpenAI's GPT-4 API to assess the reduced utility of LLMs for non-English speakers.
Name |
---|
Mozilla Public License |
Name |
---|
Mozilla Public License |
Name |
---|
Mozilla Public License |
Repository name | Type | URI |
---|---|---|
double-jeopardy-in-llms | GitHub | https://github.com/worldbank/double-jeopardy-in-llms/tree/main |
This work has been developed using a MacBook Pro with an M1 Pro processor and 64GB of RAM. No GPU is needed for the computations.
Access to the OpenAI API is required.
Some of the notebooks are not publicly available because they are used to handle proprietary data from Ethnologue which is not publicly available. One of the notebooks is used to compute the adjusted population based on the historical figures from Ethnologue and the annual population growth rates.
This repository uses poetry to manage dependencies. To install the dependencies, run the following command:
`poetry install'
To review the list of dependencies, please refer to the pyproject.toml file.
VS Code / Cursor users can use the Python extension to run the notebooks.
Use the following command to spin up a local Jupyter server:
`poetry run jupyter notebook'
It is recommended to use a virtual environment to run the code.
Additionaly, the notebooks/compute-premium-costs.ipynb notebook uses the OpenAI API. To use the API, you need to set the OPENAI_API_KEY environment variable. You can create a .env file in the root of the repository and add the following:
`OPENAI_API_KEY=<your-openai-api-key>'
Please cite our paper as follows when referencing this work.
@misc{solatorio2024doublejeopardyclimateimpact,
title={Double Jeopardy and Climate Impact in the Use of Large Language Models: Socio-economic Disparities and Reduced Utility for Non-English Speakers},
author={Aivin V. Solatorio and Gabriel Stefanini Vicente and Holly Krambeck and Olivier Dupriez},
year={2024},
eprint={2410.10665},
archivePrefix={arXiv},
primaryClass={cs.CL},
url={https://arxiv.org/abs/2410.10665},
}
Agency Name | Affiliation |
---|---|
Aivin V. Solatorio | World Bank |
Gabriel Stefanini Vicente | World Bank |
Holly Krambeck | World Bank |
Olivier Dupriez | World Bank |
2024-10
Location | Code |
---|---|
World | WLD |
Name | URI |
---|---|
Mozilla Public License | https://www.mozilla.org/en-US/MPL/ |
Name |
---|
John Doe |
2025-03-14