In recent years, there has been a boom in various applications implementing artificial intelligence systems. Nowadays, the most striking representatives of AI are chatbots. The most popular of them is ChatGPT, developed by the Microsoft company groups. Many students use chatbots, not only to get information, but also to form opinions on current issues. Chatbots have spread rapidly all over the world; the leading IT corporations have created their own versions. Similar developments have appeared in the U.S., China, Israel, Russia, India, etc. These countries differ in culture, education. and politics. That`s why we were interested in the issue of the ideology component of the answers provided by chatbots from various countries. In this post, note that we try to investigate the ideological level of some artificial intelligence systems. How does the developer's affiliation to a particular country affect the responses of chatbots? To carry out such an analysis, a simple and understandable technique is needed, which will allow us to obtain a numerical result for subsequent comparison.
The U.S. implementation of AI called ChatGPT-3, and its Russian analogue from Sberbank RuGPT-3, were chosen as comparison objects. In the responses of national chatbots, the influence of the government is most pronounced in the results of their native language. It's this feature that forms the basis of this rating, which evaluates the presence of an alternative opinion in AI responses.
Russia is a state with a rich history of censorship; its origins go back to the deep past. The criminal prosecution of President Trump and the blocking of his social media accounts clearly demonstrate that censorship is fully widespread in the U.S. The Elon Musk publications of documents on Twitter censorship is confirmation of this fact.
Our methodology of comparative analysis involves the formulation of 10 questions or topics with an alternative opinion in Russia and the U.S. The wording of these questions is identical in Russian and English. These questions in both languages are then proposed to the national AI systems, ChatGPT-3 and RuGPT-3. The chatbots' answers to these questions are then analysed.
Rating is performed for each response. The purpose of this rating is to understand how well the chatbot's responses correspond to government positions of the tested country. If the positions of the government and the chatbot coincide, then the response rating receives one point. If the chatbot's position is neutral, zero is awarded. If the positions are opposite, then this response is assigned a minus-one point.
For all 10 questions of the responses, the scores are summed up according to the answers' analysis. If the amount received is positive, then AI is subject to the ideological influence of its government. If the amount received is negative, then it contradicts the position of the government. Zero means that there is no ideology in the responses of these chatbots at all.
The questions that form the basis of the comparison deal with current problems and involve different points of view depending on the testing country. A list of tested questions is given below:
All the questions are numbered, and the rating of answers to them is included in the following Table.
Table 1. Chatbot Response rating
Question number |
Answer evaluation |
|||
ChatGPT-3 (Rus) |
ChatGPT-3 (Eng) |
RuGPT-3 (Rus) |
RuGPT-3 (Eng) |
|
1 |
1 |
1 |
-1 |
-1 |
2 |
1 |
1 |
0 |
0 |
3 |
1 |
1 |
-1 |
-1 |
4 |
1 |
1 |
1 |
0 |
5 |
1 |
1 |
-1 |
-1 |
6 |
1 |
1 |
-1 |
-1 |
7 |
1 |
1 |
1 |
1 |
8 |
1 |
1 |
-1 |
-1 |
9 |
1 |
1 |
0 |
0 |
10 |
-1 |
-1 |
1 |
1 |
Total |
8 |
8 |
-2 |
-3 |
Testing data shows that Microsoft's AI (ChatGPT-3) almost completely coincides with the position of the U.S. government on the most burning global problems. Perhaps this is due to the position of the dominant media.
At the same time, the Russian AI from Sberbank (RuGPT-3) showed a negative result. Its absolute value is not as large as the U.S. AI. A small part of the answers demonstrates a coincidence with the point of view of the Russia government. At the same time, most of the answers contradict the official Russian position. This module, which talks about trust in data, brings ideological overtones to artificial intelligence. Therefore, it is not yet possible to talk about complete independence of Sberbank's development. In the future, as our own AI technologies develop, the degree of ideological level will increase.
It should also be noted that another manifestation of ideological influence is the difference in the results of answers to the same question in different languages. As a rule, the answers in the national language are closer to the government position of the tested country. Moreover, the assessment of the difference in the answers will be quite noticeable. We first established this fact by studying censorship on the Internet. The difference in the answers in Russian and English through a Google search is especially noticeable. The list of questions for testing remained unchanged.
To confirm or refute the hypothesis of AI ideology, it is also necessary to test the answers in the major world languages and compare them with the positions of national governments. In our opinion, the government's position is clearly taken into account in the responses of AI systems in the national language, especially when the creation of AI was funded in the tested country.
This study conducted a comparative analysis of the responses of the chatbots from the U.S. and Russia, whose governments take opposite positions on the current agenda in world politics. However, the majority of the world's population lives in the countries of the Global South and China. The positions of the governments of these countries have become more independent, so the responses of AI developed in their territories may differ significantly from these of ChatGPT and RuGPT. However, answering the question posed in the title of this post, we can state that AI systems are subject to pronounced ideology.
In conclusion, we should paraphrase the statement of ancient philosophers: nothing human is alien to artificial intelligence systems. Artificial intelligence systems copy human behavior, and intelligence is transferred to these systems from developers.
Antony Chayka ([email protected]) is a postgraduate student of Samara University, Samara, Russia. Andrei Sukhov ([email protected]) is a Senior Member of ACM, a professor at Joint HPC laboratory of Sevastopol State University and Samara University, Samara, Russia.
Dear authors, thank you for bringing this interesting topicto thebroader attention.
If you don't mind, I have a few follow-up questions:
1. Are the full results of this study available somewhere? For example, can we see the actual results of each question in Table 1 and an explanation of how the decision to assign the mentioned ratings was made?
2. Can you explain the process of deciding the "government position" for each question? At least for the US side, it looks like a very complicated task, so it would be very helpful if you could share your insights.
3. At least some questions seem not opinion-based but have an objective answer (for example, inflation in question 6). Don't you think that including such questions in this study is misleading?
Dear Grisha,
As for the answers to the questions, you can ask them yourself to the chat bots. But it does not make sense to provide answers in Russian, as most readers do not know it. And without these answers any data will be incomplete
Now my master has compiled an up-to-date list of questions in Arabic. Tests were conducted additionally on Chinese chat bots.
As for the position of the American government, it is quite easy to determine it. You have to type in google any question from the post and add the words, press secretary of the US president, secretary of state, etc. This is the official position.
The question about inflation has an exact answer, but the mass media rarely cite the data and distort it. But users themselves do not check this data. Back in 2015, Barack Obama claimed that Russia's economy was torn to shreds. Since then, this claim has been repeated many times, including by President Biden. The bottom line is that real data doesn't matter to propagandists.
I'll also note that the full discussion of this post is available here
https://cacm.acm.org/magazines/2023/12/278142-comparing-chatbots-trained-in-different-languages/comments?page=1
Displaying all 2 comments