人工智能课程教学评价量规设计模型与实践案例研究-数字化转型网

摘要

研究主要围绕人工智能课程教学评价展开，从人工智能课程教学评价现状、人工智能素养培养目标、传统评价理念与新型课程的矛盾冲突出发分析人工智能课程教学评价研究的必要性和迫切性。接着就评价量规应用于人工智能课程教学评价的适切性展开探讨。针对评价量规设计与落实，研究基于经典的评价系统三要素模型设计出指向人工智能课程教学评价的评价量规设计模型，具体为目标——期望成就、认知——学习进阶、观察——任务情境、诠释——评价量规四个步骤。根据模型以“智能分类器的特征提取”教学内容为例，展示完整的评价量规设计流程。最后论述了评价量规的应用有望让课程评价实现学生主体性、客观性、个性化和智能化。

关键词：学习进阶；评价设计；量规；人工智能课程；人工智能素养

引言

国务院发布的《新一代人工智能发展规划》和教育部印发的《教育信息化2.0行动计划》中都同时强调，应逐步向全民普及人工智能教育，特别是做好中小学阶段人工智能课程的建设和推广[1-2]。可以说在中小学开展基础性的人工智能课程是必然趋势。那么，人工智能课程到底能够在多大程度上完成正确价值观、基本知识与能力的培养是面临的关键问题之一。要回答这个问题，绕不过教学评价。从系统性角度出发，人工智能课程的教学评价在设计上既要“看见”新型课程的特点，又要“看见”学生学习的规律。

一、研究现状

我国中小学人工智能教育发展进入快车道，各省市区和各级各类学校积极开展人工智能课程。[3]其中，教学评价是完整教学过程的重要一环，成功的评价设计能给学生学习、教师教学及利益相关者的教育行动带来真实而有价值的反馈。但目前国内学界对中小学人工智能课程评价的关注度不高[4]，对具体教学评价方案的研究内容较少[5]。基于此，本文回答了人工智能课程应该“怎么评”的问题，并展示了相关教学案例。以期弥补我国人工智能课程评价研究之薄弱，希望为人工智能教学实践提供些许借鉴。

对于人工智能课程应该“怎么评”的问题，楼又嘉等人[6]提倡开展实证取向的课程评价，认为应关注形成性评价和总结性评价的过程，关注学生学习全过程和最后学习成果及案例的呈现，并构建人工智能课程考核评价量规。詹泽慧等人[7]认为应在人工智能课程中开展表现性评价，可以从学生的行为、作品、项目工作日志、试题或量表五个方面进行评价。可见，针对人工智能课程的教学评价总体趋势是基于学习证据和学习者表现的评价，但目前也存在一些误区和不足。一是直接套用信息技术课程目标，或把计算思维和编程能力作为主要教学目标。虽然人工智能是信息技术的分支，但人工智能课程有其特殊的教育使命，不能混为一谈。二是评价不是指向素养能力达成情况，而是指向具体任务的完成情况，如作品出现或未出现的具体元素、作品外观等，实际上只是项目清单，这种笼统的评价指向甚至会束缚创造性和元认知的发展。三是各研究者只是发现了课程中缺乏“评价量规”等工具的使用等问题，但并未给出具体解决措施。出现这些误区与不足的原因，主要在于人工智能课程评价设计缺乏系统性。所谓系统性，意味着评价设计不是独立存在的，而是在课程设计中与其他课程要素相联系的，共同服务于课程实践。其中，学术概念“课程联合”正是反映了这种系统视角下对评价设计的认识，即不同类型的学习结果和学习内容需要匹配不同的评价技术[8]。完整的教学评价设计应是自上而下的，即要从人工智能课程的评价目标、评价内容、评价标准、评价方式、评价工具来层层推演和精心设计。那么，与人工智能课程目标、内容、结果相匹配的评价应是什么样的呢？

二、评什么：适应智能社会的人工智能素养

若要回答人工智能课程教学“怎么评”，首先应明确评价目标，即“评什么”。我国研究者早已意识到人工智能素养的重要意义，并从不同角度对其进行研究和解构。侯贺中等人[9]从智能意识、智能态度、智能伦理、智能知识、智能技能、智能思维、智能创新等7个维度构建了智能素养金字塔模型，并把智能素养划分为初级（感受）、中级（体验）、高级（创新）三个递进的阶段。郑勤华等人[10]从人机协同的融合智能观出发，从智能知识、智能能力、智能思维、智能应用、智能态度五个维度阐述了智能素养的构成。张银荣等人[11]以“人工智能五大理念”为理论基础，建构了人工智能素养结构模型，其中AI知识、AI能力、AI伦理是人工智能素养的3个重要维度。无论人工智能素养的构成维度如何发生变化，实际上可以看出人工智能素养并非指某一种特定的知识或简单的能力，而是指在智能时代基于全新的人与技术的关系视角重建人的自我认知，学会与智能技术共处的一整套知识、技能、价值与信念。中央电化教育馆发布的《中小学人工智能技术与工程素养框架》中，明确指出以适应未来智能社会的必备品格和关键能力培养作为教学目标，本质上也是指向以人为本的人工智能素养的达成[12]。

当然，人工智能素养能力的内涵并非静态而是发展的。联合国教科文组织的人工智能素养能力框架的设计立足于以学习者为中心的教育理念，强调学习者（特别是非技术学习者）如何理解人工智能及如何与人工智能交互和合作。故而，从数字人文主义视角出发，以联合国教科文组织的人工智能素养能力框架为基础，结合我国《中小学人工智能课程开发标准（试行）》和《中小学人工智能技术与工程素养框架》，重构中小学人工智能素养能力框架。研究通过梳理人工智能素养各要素，围绕对人工智能的五个认知问题重构了素养能力框架（见图1）：什么是AI，AI能做什么，AI如何工作，如何设计AI系统，如何应用AI。在联合国教科文组织的素养能力框架下，增加“如何设计AI系统”这一问题框架，包含算法、编程、系统工程、系统设计与开发、系统评估与维护5个能力维度。在“AI如何工作”问题下，增加“人机协同”这一维度，突出人机协同工作能力培养的重要性。在“如何应用AI”问题下，增加价值观、智能社会2个维度，形成从思想观念到社会生活实践全面的素养提升目标。通过学习“什么是AI”和“AI能做什么”，发展学生的人工智能意识；通过学习“AI如何工作”和“如何设计AI系统”，发展学生的技术应用能力；学习“如何应用AI”，发展学生的智能社会责任；而实践创新思维的发展需贯穿整体课程学习当中。

三、怎么评：基于学习进阶的评价量规

（一）量规应用于人工智能课程评价的适切性

研究者提出了有效反馈模型，为了缩短目前表现与理想目标之间的差距，有效的反馈要回答“我要达到什么水平”“我如何达到目标水平”和“下一个目标是什么”三个问题。教学评价实际上就是一种反馈，而量规就是实现有效反馈的工具之一。它支持重申学习期望，支持监控学习和自我调节学习。量规是针对学生学习制定的，它包含一组清晰连贯的标准，以及这组标准下各层级的表现质量描述。[13]量规主要用于对两类学习表现进行评估：一是可观察到的学习过程，二是学习生成的实体成果。下面将从4个维度阐明量规应用于人工智能课程评价的适切性。①评价目的适切。发挥评价的导向作用，使其支持到学习发生和教学开展，是先进评价理念所推崇的评价目的。评价量规能为学习和教学提供有意义且具体的反馈和线索，特别是在教师难以照顾到所有小组的学习时，小组成员可以通过对照量规标准反思自身学习水平和改进学习。②评价目标适切。培养“数字公民”是人工智能课程的教育使命，因此课程目标指向适应智能社会的核心素养能力。素养能力的养成是一个由知识技能习得到能力发展，再到思维模式和信念建立的连续递进过程。而量规具有连续清晰的标准和各层级表现描述，能够真实反映学生学习过程思维的变化，引导学生朝教育目标发展。③评价方式适切。在课标中，多次强调要落实以项目式学习方式开展人工智能课程。在实践中，可以观察到机器人教育、创客教育、编程教育等也一般依托小组进行协作项目式学习。这种学习方式赋予了人工智能课程实践性、发展性、开放性和学生主体性的特点。而过程性评价倾向于“过程”与“发展”的价值取向，强调目标与过程并重，是一种主张内外结合的、开放的、教学评融合的评价方式[14]，能与项目式学习特点相契合。实际上，作为评价工具的量规与过程性评价的理念天然具有高度的一致性。一是考虑到量规主要的评价范围；二是量规既能支持“量化”的外部测量，又能实现“质性”的内部评估；三是量规支持评价主体多元化，突出协作和共同参与；四是量规的评价信息和学习线索回流促进学生自我调节的发展、学习项目的发展以及教师教学的发展。

④评价内容适切。人工智能属于应用科学，其课程内容往往与生活和生产实际联系紧密，多为建构不良的学习专题，例如人工智能伦理问题、算法与程序设计、设计新技术解决方案等。评价量规能超越局限于有标准答案的知识评价，能够很好地对建构不良的学习内容及较抽象的能力素养进行评价，设计合理的量规，甚至还能发挥学生的创造性和元认知。

（二）评价量规设计的依据和模型

评价量规的设计要基于对人是如何学习该学科领域的知识的研究和理论进行分析，量规中各层级的表现质量描述要依据“学生的理解如何发展”这一认识展开，实际上就是学生认知模型。有效的评价系统必须包含3个关键要素——认知、观察、诠释，且这3个要素必须加以连接形成协同的整体。考虑到学科核心素养和课程目标的落实在教学评价设计中发挥着指引性的作用，因此我们认为目标也是评价系统的关键要素之一。在经典的评价系统3要素模型基础上，研究设计出指向人工智能课程教学评价的评价量规设计模型（见图2）。该模型从目标出发，连接认知、观察、诠释。四个部分构成循环协同的整体。

在这里，目标指通过课程学习，学生在此学科领域内要达到的期望成就。期望成就是人工智能素养能力在相应课程内容中的具体体现。认知是指学生在此领域获取的认识和知识，它是关于学生如何表征知识，以及如何从新手发展成为专家的过程的认识。其中，学习进阶（也称构念地图）是描述学生随着学习时间推移，其某种知识或能力发展所展示出来的学习路径，它为创建有意义的评价提供清晰的指南，是后续观察和诠释两个环节的重要基础。实际上，随着学习进阶研究的影响力不断拓展，许多研究机构和学者以此为基础，进行测评开发和设计。例如伯克利评估与测量研究中心的BEAR测评系统就是较有代表性的产品[15]。观察是指借助任务情境学生提供可观察的知识习得与能力发展的有效证据。教师或评估设计者应该了解什么可以用来揭示学生知道什么和可以做什么，什么特定的任务可能引发的知识和技能的类型。诠释是指对观察中收集到的证据加以理解和判断的过程。指向目标期望成就的、包含各层级学习表现描述的评价量规，是理解证据并给出定量或定性判断的有效途径。基于此，学生的学习得到客观的评价，评价的反馈信息反过来对课程目标达成情况进行检验，教师利用这些信息可以进一步改进学习进阶模型，修改任务情境设计方案。该模型重点为人工智能课程应该“怎么评”提供一种思路，同时也为课程的项目、任务和情境设计提供相关指引。

（三）实现：评价量规设计的案例

本部分主要以“智能分类器的特征提取”为案例，展示如何根据上文提出的设计模型实现针对具体教学内容的评价量规设计。该教学内容选自《人工智能基础（高中版）》（华东师范大学出版社出版）教材第二章第2小节“提取特征”，主要让学生认识智能分类器及其运作机制，掌握其中重要的步骤“提取特征”。内容主要由理解分类过程、识别事物的特征和有效特征、量化有效特征并转化成特征向量、利用特征向量构建特征空间并理解如何利用特征空间进行分类等组成，笔者将本课命名为“智能分类器的特征提取”。

步骤一：根据上述中小学人工智能素养能力重构框架和本节教学内容确定教学最终需要达成的期望成就（见表1）。

步骤二：学习进阶的开发并不是一蹴而就的，而是一个假设与验证、理论与实践不断交替、逐步完善的过程，最终确认学生理解和发展的真实轨迹。本案例采用自上而下的方法预测学生思维的发展，开发学生对“智能分类器的特征提取”学习内容理解情况的学习进阶（见表2）。该学习进阶还需后续改进、完善。

步骤三：任务情境的设计是为了推动学习的展开和充分展现学生的知识技能和情感态度价值观，帮助学生和教师了解学习进度。针对需要观察的学习成就设计对应的任务情境（见图3），任务总体上层层递进，由易到难，以期在学生完成任务过程和提交的任务成果中透视出学生的成就水平。任务情境可以是开放式的，如案例中任务三的方案设计，没有给出固定的问题，充分发挥学生的想象力和观察力；当然也可以是半开放式的，例如蕨类植物的智能分类系统设计，这样有利于减轻教师课堂教学的难度和工作量。

步骤四：使用评价量规可以测量和诠释通过任务情境收集到的学习成就证据，最终获得对学生学习的客观评价。评价量规编制过程中需要注意：关注学习成就而非任务，任务只是成就水平展示的媒介，量规内容应指向任务所预示学习成就的特征，而不应沦为任务清单或作品要求；关注学习过程而非分数等级，量规不应充当简单的打分工具，而是能指明“我现在在哪里”和“我将要去哪里”，作为促进自我调节和学习反思的有效手段。在本案例中，评价量规针对每个学习任务，下设一级指标和二级指标，并对每个指标的含义进行描述。等级划分与学习进阶呼应，共设计了4个等级，尽量涵盖学生表现的各类情况（见表3）。

四、总结与展望

本文就人工智能课程应该如何进行教学评价展开研究，提出了基于学习进阶的评价量规设计模型，并展示了完整的评价量规设计流程。通过对“提取特征”学习内容进行了评价量规的设计，可以看到在人工智能课程教学中，能够实现对每个学习者学习表现的精准评价与指导；更重要的是，通过此类型的学习评价，为学习者的分步骤、阶段性学习进阶提供了可观察、诠释的学习依据。评价量规与人工智能课程的结合，有望实现教学评价的学生主体性、客观性、个性化和智能化。学生主体性意味着学生不仅仅是被动的评价对象，更是对学习效能起决定性作用的主体，学生应当参与学习评价的全过程。崔允漷指出学习评价的新范式——“促进学习的评价”已逐渐成为主流[16]。评价量规有利于促进自我调节的学习，调动学生的元认知，学生使用评价量规不但能够检验自身的学习成果，更重要的是能够获得对学习的掌控和学会学习。为教师、学生、同伴、家长提供简明且操作性强的评价量规，有利于多元化主体共同参与评价，使评价可以基于不同视角和不同维度展开，最终获得比较公平客观的评价结果。评价量规对每个层级的表现都有细致描述，能照顾到学生学习的个性化差异。学习是一个发展性的过程，个体经验和社会文化的差异让每个学生处在不同的理解阶段，利用评价量规，每个学生都可以得到个性化评估。同时，评价量规有可能为学习评价和人工智能技术相结合提供机会，实现评价的智能化。通过应用智能技术收集证据和基于评价量规进行推理评估，为每个学生生成个性化的能力雷达图，立体展示其人工智能素养整个能力圈层中各目标的完成情况。

关于人工智能课程教学评价的相关问题，在未来研究中还可以从以下几个方面深入探讨。一是完善学生在人工智能学科领域学习的学习进阶，从理论和实践研究中明确学生理解和发展的真实轨迹。二是检验评价量规的实际应用效果，如收集评价量规促进学习的证据、了解不同主体视角下评价量规的作用和影响，为人工智能教育收集循证依据。三是技术赋能教学评价，开发基于评价量规的智能评价工具，人机协同促进评价与决策。四是运用评价量规进一步跟踪学生的人工智能价值观的转变轨迹。五是基于评价量规探索符合人工智能教育规律的教学新路径。

翻译：

Abstract

The research mainly focuses on the teaching evaluation of artificial intelligence curriculum, and analyzes the necessity and urgency of the research on the teaching evaluation of artificial intelligence curriculum from the perspectives of the present situation of the teaching evaluation of artificial intelligence curriculum, the objectives of the cultivation of artificial intelligence literacy, and the contradiction and conflict between the traditional evaluation concept and the new curriculum. Then it discusses the appropriateness of applying the evaluation gauge to the teaching evaluation of artificial intelligence course.

Aiming at the design and implementation of evaluation gauge, based on the three elements model of classical evaluation system, an evaluation gauge design model pointing to the evaluation of artificial intelligence course teaching is designed, which includes four steps: goal – expected achievement, cognition – learning advancement, observation – task situation, interpretation – evaluation gauge. According to the model, the teaching content of “Feature extraction of intelligent classifier” is taken as an example to show the complete design process of evaluation gauge. Finally, the paper discusses that the application of evaluation gauge is expected to make the curriculum evaluation realize the subjectivity, objectivity, individuation and intelligence of students.

Key words: learning progression; Evaluation design; A gauge; Artificial intelligence course; Ai literacy

Introduction

The Development Plan of the New Generation of Artificial Intelligence issued by The State Council and the Action Plan of Education Informatization 2.0 issued by the Ministry of Education both emphasize that AI education should be gradually popularized to the whole people, especially the construction and promotion of AI courses in primary and secondary schools [1-2]. It can be said that it is an inevitable trend to carry out basic AI courses in primary and secondary schools. Therefore, to what extent artificial intelligence curriculum can complete the cultivation of correct values, basic knowledge and ability is one of the key issues facing us. To answer this question, teaching evaluation is the only way. From a systematic point of view, the teaching evaluation of artificial intelligence courses should not only “see” the characteristics of new courses, but also “see” the rules of students’ learning.

Research status

The development of artificial intelligence education in Chinese primary and secondary schools has entered the fast lane. All provinces, municipalities and all kinds of schools at all levels are actively carrying out artificial intelligence courses. [3] Teaching evaluation is an important part of the complete teaching process. Successful evaluation design can bring real and valuable feedback to students’ learning, teachers’ teaching and stakeholders’ educational actions. However, at present, domestic academic circles do not pay much attention to the curriculum evaluation of artificial intelligence in primary and secondary schools [4], and there is little research content on specific teaching evaluation programs [5]. Based on this, this paper answers the question of “how to evaluate” artificial intelligence courses and shows relevant teaching cases. In order to make up for the weakness of curriculum evaluation of artificial intelligence research, hoping to provide some reference for the teaching practice of artificial intelligence.

With regard to the question of “how to evaluate” artificial intelligence courses, Lou Youjia et al. [6] advocated the course evaluation based on evidence. And believed that the process of formative evaluation and summative evaluation should be paid attention to. As well as the presentation of students’ whole learning process and final learning results as well as cases. And the evaluation criteria of artificial intelligence courses should be constructed. Zhan Zehui et al. [7] believe that performance evaluation should be carried out in artificial intelligence courses. Which can be evaluated from the five aspects of students’ behavior, works, project work log, test questions or scales. It can be seen that the general trend of teaching evaluation for artificial intelligence courses is based on learning evidence and learner performance evaluation. But there are some misunderstandings and deficiencies.

There are also some mistakes and deficiencies in the course of artificial intelligence

One is to directly apply the curriculum objectives of information technology. Or take computational thinking and programming ability as the main teaching objectives. Although AI is a branch of information technology, AI curriculum has its special educational mission and should not be confused. Second, the evaluation does not point to the achievement of literacy ability. But to the completion of specific tasks, such as the presence or absence of specific elements of the work. The appearance of the work, and so on. In fact, it is just a list of items. Such general evaluation direction will even restrict the development of creativity and metacognition. Third, the researchers only found the lack of the use of “evaluation gauge” and other tools in the curriculum, but did not give specific solutions.

The main reason for these errors and deficiencies lies in the lack of systematic design of artificial intelligence curriculum evaluation. The so-called systematic means that evaluation design does not exist independently. But is linked with other curriculum elements in curriculum design and serves curriculum practice together. Among them, the academic concept of “curriculum association” reflects the understanding of evaluation design from this systematic perspective. That is, different types of learning outcomes and learning content need to match different evaluation techniques [8]. A complete instructional evaluation design should be top-down, that is. It should be developed and carefully designed from the evaluation objectives, content, standards, methods and tools of artificial intelligence courses. Then, what kind of evaluation should match the objectives, content and results of the AI course?

What to evaluate: AI literacy to adapt to the intelligent society

In order to answer the question of “how to evaluate” the teaching of artificial intelligence courses. We should first clarify the evaluation objective, namely “what to evaluate”. Chinese researchers have already realized the significance of artificial intelligence literacy and have studied and deconstructed it from different angles. Hou Hezhong et al. [9] built a pyramid model of intelligent literacy from seven dimensions, including intelligent consciousness, intelligent attitude, intelligent ethics, intelligent knowledge, intelligent skills, intelligent thinking and intelligent innovation, and divided intelligent literacy into three progressive stages: primary (feeling), intermediate (experience) and advanced (innovation). Zheng Qinhua et al. [10] expounded the composition of intelligent literacy from five dimensions of intelligent knowledge, intelligent ability, intelligent thinking, intelligent application and intelligent attitude, starting from the view of integrated intelligence of man-machine collaboration.

Zhang Yinrong et al. [11] built a structural model of AI literacy based on “Five concepts of AI”. In which AI knowledge, AI ability and AI ethics are three important dimensions of AI literacy. No matter how the composition dimension of AI literacy changes, in fact. It can be seen that AI literacy does not refer to a certain kind of knowledge or simple ability. But refers to a set of knowledge, skills, values and beliefs to rebuild people’s self-cognition based on a new perspective of human-technology relationship in the era of intelligence, and learn to coexist with intelligent technology.

In the Framework of Artificial Intelligence Technology and Engineering Literacy for primary and secondary schools released by the Central Audio-Visual Education Museum, it is clearly pointed out that the cultivation of essential character and key ability to adapt to the future intelligent society is the teaching goal, which is essentially directed to the achievement of human-oriented artificial intelligence literacy [12].

Of course, the connotation of AI literacy competence is not static but developmental.

The design of UNESCO’s AI Literacy Capacity Framework is based on a learner-centred approach to education, emphasizing how learners. Especially non-technical learners, understand AI and how they interact and collaborate with it. Therefore, from the perspective of digital humanism, on the basis of UNESCO’s artificial intelligence literacy competence framework, combined with Chinese Curriculum Development Standards for Artificial Intelligence in primary and secondary Schools (Trial) and Framework of Artificial Intelligence Technology and Engineering Literacy in primary and secondary schools, the framework of artificial intelligence literacy in primary and secondary schools is reconstructed. By sorting out various elements of AI literacy. The research reconstructs the literacy competence framework around five cognitive questions about AI (see Figure 1) :. What is AI, what can AI do, how AI works, how to design AI systems, and how to apply AI.

Under the UNESCO Literacy competence framework, the question framework of “How to design AI systems” is added. Including five dimensions of competence: algorithm, programming, systems engineering, system design and development, and system evaluation and maintenance. Under the question “How AI works”, the dimension of “man-machine collaboration” is added to highlight the importance of man-machine collaborative work ability cultivation. Under the question of “How to apply AI”. Two dimensions of values and intelligent society are added to form a comprehensive goal of improving literacy from ideology to social life practice. Develop students’ awareness of AI by learning “what AI is” and “what AI can do”. Develop students’ ability to apply technology by learning how AI works and how to design AI systems. Learn “how to apply AI” to develop students’ intelligent social responsibility. The development of practical and innovative thinking should run through the whole course learning.

How to evaluate: Evaluation gauge based on learning progression

(1) The appropriateness of the gauge applied to the evaluation of artificial intelligence courses

The researchers proposed an effective feedback model. In order to bridge the gap between current performance and desired goals, effective feedback should answer three questions:. “Where do I want to get to?”, “How do I get to the goal?” and “What is the next goal?” Teaching evaluation is actually a kind of feedback. And the gauge is one of the tools to achieve effective feedback. It supports reaffirmation of learning expectations, monitoring learning and self-regulating learning. Gauges are developed for student learning and contain a clear and coherent set of standards and a description of the quality of performance at each level under this set of standards. [13] Gauges are mainly used to assess two types of learning performance:. The observable learning process and the physical outcomes of learning. The following will clarify the appropriateness of the gauge applied to the evaluation of artificial intelligence courses from four dimensions.

The purpose of evaluation is appropriate.

Giving full play to the guiding role of evaluation and making it support to the occurrence of learning and the development of teaching is the evaluation purpose advocated by the advanced evaluation concept. Evaluation gauges can provide meaningful and concrete feedback and clues for learning and teaching, especially when it is difficult for teachers to take care of all group learning, group members can reflect on their own learning level and improve learning by comparing gauge standards.

The objective of evaluation is appropriate.

Cultivating “digital citizen” is the educational mission of artificial intelligence course. So the course target is to adapt to the core literacy ability of intelligent society. The cultivation of literacy ability is a continuous progressive process from the acquisition of knowledge and skills to the development of ability and then to the establishment of thinking patterns and beliefs. The gauge has continuous and clear standards and performance description at each level. Which can truly reflect the change of students’ thinking in the learning process and guide students to develop towards the educational goal.

Appropriate evaluation methods.

In the curriculum standard. It has been emphasized for many times to carry out AI courses in the way of project-based learning. In practice, it can be observed that robot education, maker education, programming education and so on generally rely on group collaborative project-type learning. This way of learning gives the AI course the characteristics of practicality, development, openness and students’ subjectivity. However, procedural evaluation tends to be the value orientation of “process” and “development” and emphasizes equal emphasis on goal and process. It is a kind of evaluation method that advocates internal and external combination, openness and integration of teaching evaluation [14], which can fit with the characteristics of project-based learning.

In fact, as an evaluation tool, the gauge and the concept of procedural evaluation naturally have a high degree of consistency. One is to take into account the main evaluation range of the gauge. Second, the gauge can not only support the “quantitative” external measurement, but also realize the “qualitative” internal evaluation. Third, the gauge supports the diversification of evaluation subject, highlighting cooperation and joint participation. Fourth, the evaluation information and learning clues of the gauge can promote the development of students’ self-regulation. The development of learning projects and the development of teachers’ teaching.

The evaluation content is appropriate.

Artificial intelligence is an applied science. And its course content is often closely related to the reality of life and production. Most of its courses are ill-constructed learning topics. Such as the ethical issues of artificial intelligence, algorithm and programming, and the design of new technological solutions. The evaluation gauge can go beyond the knowledge evaluation with standard answers, can well evaluate the poorly constructed learning content and the more abstract ability and accomplishment, design a reasonable gauge, and even give play to students’ creativity and metacognition.

(2) The basis and model for evaluating gauge design

The design of the evaluation gauge should be based on the research and theoretical analysis of how people learn knowledge in the subject area. And the description of performance quality at each level of the gauge should be based on the understanding of “how students’ understanding develops”. Which is actually a student cognitive model. An effective evaluation system must contain three key elements — cognition, observation and interpretation. And these three elements must be connected to form a collaborative whole.

Considering that the implementation of subject core literacy and curriculum objectives plays a guiding role in the design of teaching evaluation. We believe that objectives are also one of the key elements of the evaluation system. On the basis of the classic three-element model of evaluation system, an evaluation gauge design model pointing to the evaluation of artificial intelligence course teaching is designed (see Figure 2). This model connects cognition, observation and interpretation from the goal. The four parts form a circular, synergistic whole.

In this context, the goal refers to the desired achievement of the student in the subject area through the course of study.

Expected achievement is the concrete embodiment of artificial intelligence literacy ability in the corresponding course content. Cognition refers to the knowledge and knowledge students acquire in this field. It is about how students represent knowledge and how they develop from novices to experts. Among them, learning progression (also known as construct map) describes the learning path shown by the development of certain knowledge or ability of students over the passage of learning time. It provides a clear guide for creating meaningful evaluation, and serves as an important basis for follow-up observation and interpretation. In fact, as the influence of learning progression research continues to expand. Many research institutions and scholars use it as a basis for assessment development and design. For example, the BEAR evaluation system of Berkeley Evaluation and Measurement Research Center is a representative product [15].

Observation refers to the effective evidence of observable knowledge acquisition and ability development provided by students in task situations.

The teacher or assessment designer should understand what can be used to reveal what the student knows and can do. What particular tasks may trigger the types of knowledge and skills. Interpretation is the process of understanding and judging evidence gathered from observation. An evaluation gauge that includes descriptions of learning performance at all levels and is directed towards goals and expected achievements is an effective way to understand evidence and give quantitative or qualitative judgments. Based on this, students’ learning is objectively evaluated. And the feedback information from the evaluation in turn tests the achievement of curriculum objectives. Teachers can use this information to further improve the learning progression model and modify the task situation design scheme. The model focuses on providing an idea of “how to evaluate” the AI course. And also provides relevant guidance for the project, task and situation design of the course.

(3) Implementation: evaluation gauge design cases

This part mainly takes “Feature extraction of intelligent classifier” as a case to show how to realize the evaluation gauge design for specific teaching content according to the design model proposed above. This teaching content is selected from the Basis of Artificial Intelligence (High School Edition) textbook (East China Normal University Press) Chapter 2 section 2 “Feature extraction”, mainly let students understand the intelligent classifier and its operation mechanism, master the important step “feature extraction”. The content is mainly composed of understanding classification process, identifying features and effective features of things, quantifying effective features and converting them into feature vectors, using feature vectors to build feature Spaces and understanding how to use feature Spaces for classification, etc. The author named this course “Feature Extraction of Intelligent Classifier”.

The steps are as follows：

Step 1: According to the above framework of artificial intelligence literacy and ability reconstruction in primary and secondary schools and the teaching content in this section. The final expected achievement of teaching needs to be determined (see Table 1).

Step 2: The development of learning progression is not an overnight process. But a process of constant alternations of hypothesis and verification, theory and practice. And gradual improvement, and finally confirm the true track of students’ understanding and development. In this case, a top-down approach is adopted to predict the development of students’ thinking and develop the learning progression of students’ understanding of the learning content of “Feature extraction of intelligent classifier” (see Table 2). The learning level still needs to be improved and perfected.

Step 3: The task situation is designed to promote the development of learning and fully demonstrate students’ knowledge, skills, emotions, attitudes and values. So as to help students and teachers understand the learning progress. The corresponding task situation is designed for the learning achievements to be observed (see Figure 3). In general, the tasks progress from easy to difficult. So as to reflect the achievement level of students in the process of completing tasks and the submitted task results. The task situation can be open-ended. For example, the scheme design of task 3 in the case does not give a fixed problem, giving full play to the imagination and observation of students. Of course, it can also be semi-open. Such as the intelligent classification system design of ferns. Which is conducive to reducing the difficulty and workload of teachers’ classroom teaching.

The steps are as follows

Step 4: The use of evaluation gauges can measure and interpret the evidence of learning achievement collected through task situations, and ultimately obtain an objective evaluation of student learning. Attention should be paid to learning achievements rather than tasks in the process of compiling evaluation gauges. Tasks are only the medium of achievement level display. The contents of the gauges should point to the characteristics of learning achievements predicted by tasks. Rather than being reduced to task lists or work requirements.

Focusing on the learning process rather than grades. The gauge should not serve as a simple scoring tool. But can indicate “where I am now” and “where I am going” as an effective means of promoting self-regulation and reflection of learning. In this case, for each learning task. The evaluation gauge sets first-level indicators and second-level indicators, and describes the meaning of each indicator. The grade division echoes the learning progression. And a total of 4 grades are designed to cover various situations of students’ performance as far as possible (see Table 3).

Summary and Outlook

This paper studies how to conduct teaching evaluation in artificial intelligence courses, proposes an evaluation gauge design model based on learning progression. And shows the complete evaluation gauge design process. Through the design of the evaluation gauge for the learning content of “feature extraction”. It can be seen that the accurate evaluation and guidance of each learner’s learning performance can be realized in the teaching of artificial intelligence course. More importantly, this type of learning evaluation provides an observable and interpretable learning basis for the step-by-step and phased learning progression of learners. The combination of evaluation gauge and artificial intelligence course is expected to achieve student subjectivity, objectivity, individuation and intelligence of teaching evaluation. Student subjectivity means that students are not only the passive evaluation object. But also the subject that plays a decisive role in learning effectiveness. Students should participate in the whole process of learning evaluation.

Choi Yunrong pointed out that “Evaluation to promote learning”. A new paradigm of learning evaluation, has gradually become the mainstream [16]. Evaluation gauges are conducive to promoting self-regulated learning and mobilizing students’ metacognition. Students can use evaluation gauges not only to test their own learning results. But more importantly, to gain control of learning and learn to learn. Providing concise and operable evaluation gauges for teachers, students, peers and parents is conducive to the joint participation of diverse subjects in the evaluation. So that the evaluation can be developed based on different perspectives and different dimensions. And finally obtain relatively fair and objective evaluation results.

Evaluation gauges provide detailed descriptions of performance at each level, allowing for individual differences in student learning.

Learning is a developmental process. Individual experience and social and cultural differences make each student at a different stage of understanding. With the use of evaluation gauges, each student can get personalized assessment. At the same time, the evaluation gauge may provide opportunities for the combination of learning evaluation and artificial intelligence technology to realize the intelligence of evaluation. Through the application of intelligent technology to collect evidence and reasoning assessment based on evaluation gauges. A personalized ability radar map is generated for each student to display the achievement of each goal in the whole ability circle of artificial intelligence literacy.

In the future, the problems related to the teaching evaluation of artificial intelligence course can be further discussed from the following aspects. The first is to improve students’ learning progression in the field of artificial intelligence. And to clarify the real track of students’ understanding and development from theoretical and practical research. Third, technology enabling teaching evaluation, the development of intelligent evaluation tools based on evaluation gauges, man-machine collaboration to promote evaluation and decision-making. Fourth, the evaluation gauge is used to further track the transformation of students’ artificial intelligence values. The fifth is to explore a new teaching path in line with the law of artificial intelligence education based on the evaluation gauge.

本文由数字化转型网（www.szhzxw.cn）转载而成，来源于数字教育，作者：李睿、何敏怡；编辑/翻译：数字化转型网宁檬树。

免责声明: 本网站(https://www.szhzxw.cn/)内容主要来自原创、合作媒体供稿和第三方投稿，凡在本网站出现的信息，均仅供参考。本网站将尽力确保所提供信息的准确性及可靠性，但不保证有关资料的准确性及可靠性，读者在使用前请进一步核实，并对任何自主决定的行为负责。本网站对有关资料所引致的错误、不确或遗漏，概不负任何法律责任。

本网站刊载的所有内容(包括但不仅限文字、图片、LOGO、音频、视频、软件、程序等) 版权归原作者所有。任何单位或个人认为本网站中的内容可能涉嫌侵犯其知识产权或存在不实内容时，请及时通知本站，予以删除。

人工智能课程教学评价量规设计模型与实践案例研究

摘要

引言

一、研究现状

二、评什么：适应智能社会的人工智能素养