Evaluating ChatGPT-4’s historical accuracy: a case study on the origins of SWOT analysis
Peer reviewed, Journal article
Published version
Permanent lenke
https://hdl.handle.net/11250/3150350Utgivelsesdato
2024Metadata
Vis full innførselSamlinger
Originalversjon
Puyt, R. W., & Madsen, D. Ø. (2024). Evaluating ChatGPT-4’s historical accuracy: a case study on the origins of SWOT analysis. Frontiers in Artificial Intelligence, 7, Artikkel 1402047. https://doi.org/10.3389/frai.2024.1402047Sammendrag
In this study we test ChatGPT-4’s ability to provide accurate information about the origins and evolution of SWOT analysis, perhaps the most widely used strategy tool in practice worldwide. ChatGPT-4 is tested for historical accuracy and hallucinations. The API is prompted using a Python script with a series of structured questions from an Excel file and the results are recorded in another Excel file and rated on a binary scale. Our findings present a nuanced view of ChatGPT-4’s capabilities. We observe that while ChatGPT-4 demonstrates a high level of proficiency in describing and outlining the general concept of SWOT analysis, there are notable discrepancies when it comes to detailing its origins and evolution. These inaccuracies range from minor factual errors to more serious hallucinations that deviate from evidence in scholarly publications. However, we also find that ChatGPT-4 comes up with spontaneous historically accurate facts. Our interpretation of the result is that ChatGPT is largely trained on easily available websites and to a very limited extent has been trained on scholarly publications on SWOT analysis, especially when these are behind a paywall. We conclude with four propositions for future research.