论文标题
计算机科学教育中的基本费率忽视
Base rate neglect in computer science education
论文作者
论文摘要
在许多学术和工业应用中,机器学习(ML)算法变得越来越重要,因此,这种算法已成为计算机科学课程中的常见组成部分。学习ML不仅是由于其复杂的数学和算法方面的挑战,而且还因为a)在现实生活中正确使用这些算法的复杂性以及b)对相关社会和道德问题的理解。认知偏见是人脑的现象,可能导致错误的看法和非理性的决策过程。因此,在认知心理学和决策的背景下,对它们进行了彻底的研究。但是,他们对计算机科学教育也具有重要意义。 Kahneman和Tversky最初描述的一种众所周知的认知偏见是基本率忽略偏见,根据该偏差,人类在评估条件概率时未能考虑基本现象的基本速率。在本文中,我们探讨了ML教育中基本利率忽略偏见的表达。具体来说,我们表明,在不同背景(计算机科学专业的学生和教师,数据科学,工程,社会科学和数字人文科学)中,大约三分之一的学生在ML课程介绍中未能正确评估由于基本费率忽略偏见,因此无法正确评估ML算法的性能。该失败率应提醒教育者并促进开发新的教学方法,以教授ML算法性能。
Machine learning (ML) algorithms are gaining increased importance in many academic and industrial applications, and such algorithms are, accordingly, becoming common components in computer science curricula. Learning ML is challenging not only due to its complex mathematical and algorithmic aspects, but also due to a) the complexity of using correctly these algorithms in the context of real-life situations and b) the understanding of related social and ethical issues. Cognitive biases are phenomena of the human brain that may cause erroneous perceptions and irrational decision-making processes. As such, they have been researched thoroughly in the context of cognitive psychology and decision making; they do, however, have important implications for computer science education as well. One well-known cognitive bias, first described by Kahneman and Tversky, is the base rate neglect bias, according to which humans fail to consider the base rate of the underlying phenomena when evaluating conditional probabilities. In this paper, we explore the expression of the base rate neglect bias in ML education. Specifically, we show that about one third of students in an Introduction to ML course, from varied backgrounds (computer science students and teachers, data science, engineering, social science and digital humanities), fail to correctly evaluate ML algorithm performance due to the base rate neglect bias. This failure rate should alert educators and promote the development of new pedagogical methods for teaching ML algorithm performance.