Value Alignment

Our research explores ethical dilemmas through diverse prompt engineering.

A person with long hair wearing a red sweater is observing a whiteboard. On the whiteboard, there is an arrangement of multi-colored sticky notes with various words and phrases related to company values, such as 'equity', 'teamwork', 'respect', and 'diversity'. The words '# COMPANY VALUES' are prominently written at the top.
A person with long hair wearing a red sweater is observing a whiteboard. On the whiteboard, there is an arrangement of multi-colored sticky notes with various words and phrases related to company values, such as 'equity', 'teamwork', 'respect', and 'diversity'. The words '# COMPANY VALUES' are prominently written at the top.
Model Evaluation

We evaluate GPT-4 and GPT-3.5 using a customized value alignment index.

A computer screen displaying a webpage about ChatGPT, focusing on optimizing language models for dialogue. The webpage has text describing the model and includes the OpenAI logo. The background is green with some purple graphical elements on the side.
A computer screen displaying a webpage about ChatGPT, focusing on optimizing language models for dialogue. The webpage has text describing the model and includes the OpenAI logo. The background is green with some purple graphical elements on the side.
Fine-Tuning

Selected models are fine-tuned to optimize alignment with specific value systems.

Aligning Values Together

Exploring ethical dilemmas through innovative research and multidisciplinary collaboration for responsible AI development.

A 3D rendering of a microchip with the letters 'AI' prominently displayed on its surface, set on a dark, circular platform.
A 3D rendering of a microchip with the letters 'AI' prominently displayed on its surface, set on a dark, circular platform.
Two people are standing in front of a whiteboard containing various colorful sticky notes. The notes are arranged in a grid and each one has a word or phrase related to company values, such as 'Culture', 'Vision', 'Diversity', and 'Respect'. One person is gesturing towards the board, wearing a blue sweater, while the other is observing attentively, dressed in a red jacket.
Two people are standing in front of a whiteboard containing various colorful sticky notes. The notes are arranged in a grid and each one has a word or phrase related to company values, such as 'Culture', 'Vision', 'Diversity', and 'Respect'. One person is gesturing towards the board, wearing a blue sweater, while the other is observing attentively, dressed in a red jacket.