%0 Journal Article %T 测验链接中的锚题代表性研究 %A 叶萌 %A 辛涛 %J 心理科学 %D 2015 %X 摘要: 本文旨在以“锚题代表性”这一研究命题切入,探索在非等组锚测验设计下,作为实现测验链接的重要载体,锚题和相关的测验试卷/水平之间究竟应该有什么关系。本文首先指出锚题代表性这一概念在等值和垂直量尺化领域中具有不同的含义,并给出其在垂直量尺化中的含义。通过考察测验链接中有关锚题代表性的既有研究,系统总结相关研究成果,本文概括出了当前锚题构建实践的可能优化方案,分析了锚题代表性研究的未来方向。</br>Abstract: Practice in psychological and educational measurement often needs to use the technique of test linking to analyze the difference or trend of the examinees’ traits. When implementing test linking, we need to choose a data collection design. Among several designs, the Non-Equivalent Groups with Anchor Test (NEAT) design is the most frequently used one. With this design, the anchor items, the set of items shared by all the examinee groups, are unique carrier to achieve test linking. Given the importance of the anchor items, except for their parameters, what we must pay attention to is the relationship pattern of the anchor items and their related test forms/levels. This paper was aimed to explore the relationship pattern based upon a concept clarification and a literature review about the representativeness of the anchor items. First, it indicated that the meaning of the representativeness of the anchor items is different between the equating research field and the vertical scaling field, the two most important sub-areas of linking. In equating, it involves the statement that the anchor test be a “mini version” of the tests to be equated. In present, it is widely believed that the anchor used in equating should be constructed according to the same specification with the total test, including the content and statistic characteristics, to accurately reflect the group difference. In vertical scaling, with the grade-to-grade definition of growth which corresponds to the NEAT design, achievement growth is defined on the content in the test level(s) appropriate for the specific grade level(s). According to this growth definition, the anchor which serves as the way to quantify the growth should also come from the test level(s) appropriate for the specific grade level(s). Therefore, the representativeness of the anchor items in vertical scaling can be defined as the representativeness of the anchor items to test level(s) they are chosen from. It is worthy to note that the definitions of the representativeness of the anchor items do not necessarily mean that the anchor items should be a “mini version” of the test levels they are chosen from. That is why the representativeness of the anchor items is called a topic here rather than a property or nature in test linking. Based on the above statement, this paper examined the existing research on the %K test linking %K equating %K vertical scaling %K Non-Equivalent Groups with Anchor Test design %K the representativeness of the anchor items %U http://www.psysci.org/CN/abstract/abstract9382.shtml