Volume 8· Issue 1 · Feb 2026
Teaching Evaluation and Measurement
Exploration of Classroom Teaching Transformation Practice for the "Three-Dimensional Dynamic Assessment Framework" in Junior High School Chinese Language Education
He Yunmei [China]
Abstract
To address prevalent issues in junior high school Chinese language assessment—such as overemphasis on test scores at the expense of literacy development and delayed feedback—the study proposes a "Three-Dimensional Dynamic Assessment Framework" integrating three key dimensions: formative assessment, context-based tasks, and culturally responsive evaluation. Empirical research conducted across 12 middle schools demonstrates that this model significantly enhances students 'text interpretation skills (+31.2%) and writing transfer abilities (+26.8%), while also improving teachers' instructional decision-making precision. This research provides actionable assessment pathways for implementing core Chinese language competencies, advancing the integrated "teaching-learning-assessment" reform.
Keywords: educational measurement; assessment paradigm; core literacy of Chinese; dynamic assessment; junior high school Chinese
1. Introduction: The Realistic Difficulties of Chinese Language Assessment in Junior High School
The Compulsory Education Chinese Curriculum Standard (2022 Edition) emphasizes the establishment of an evaluation system that promotes students' all-round development, yet the current assessment of junior high school Chinese still faces three major contradictions:
Tool Over reliance: The 2023 "Compulsory Education Quality Monitoring Report" by the Ministry of Education's Basic Education Quality Monitoring Center reveals that over 75% of junior high schools nationwide still use standardized test papers as the primary assessment tool for Chinese language education. This evaluation system, predominantly composed of multiple-choice and fill-in-the-blank questions, struggles to effectively measure higher-order competencies such as aesthetic appreciation and cultural understanding. For instance, in teaching evaluations of "The Record of Yueyang Tower," only 12% of questions assess students' deep comprehension of the philosophical sentiment encapsulated in the phrase "Worry before the world worries, and enjoy after the world enjoys." Most questions remain at the level of mechanical testing of word definitions and paragraph summaries.
Process-oriented evaluation: Assessment activities predominantly focus on summative tests at the end of each semester, overlooking the value of formative evidence such as classroom dialogues, writing drafts, and collaborative group projects. A 2022 survey by a provincial education research institute revealed that 83% of junior high school Chinese teachers admitted their evaluation practices were "result-oriented rather than process-oriented," with only 9% of schools implementing regular process evaluation portfolios. This approach deprives students of opportunities to refine their writing and fails to effectively capture or quantify the intellectual sparks generated during classroom discussions.
Insufficient cultural adaptation: The assessment design failed to account for the cultural background differences between urban and rural students, putting non-urban students at a disadvantage in interpreting dialect texts. Taking a junior high school in the Cantonese-speaking region as an example, in the 2023 evaluation of the "Rural China" selected reading unit, questions completed in Mandarin written language resulted in rural students scoring 18.6 points lower on average than urban students, with the main gap lying in their understanding of dialect proverbs and folk allusions. Another study pointed out that urban cultural elements account for 72% of the current assessment materials, while rural and ethnic minority cultural content accounts for less than 15%. This cultural imbalance leads to significant deviations in the assessment performance of students from different backgrounds.
Therefore, it is urgent to construct a new evaluation paradigm with both disciplinary characteristics and cultural inclusiveness.
2. Theoretical Framework: From Static Scoring to Dynamic Empowerment
2.1 Design Principles of 3D Dynamic Evaluation Model
Based on the four dimensions of Chinese core literacy (language construction and application, thinking development and promotion, aesthetic appreciation and creation, cultural inheritance and understanding), the evaluation structure is constructed:
Process-oriented diagnosis (learning trajectory tracking) Contextualized tasks Culturally responsive evaluation
Process diagnosis
The growth portfolio documents students' revision history from initial drafts to final versions (e.g., three revisions of the essay "My Hometown" with adjustments to landscape descriptions), classroom recordings (e.g., debate responses on "the value of online literature"), and reading annotations (e.g., marginal notes and side notes during character analysis of "Dream of the Red Chamber"), forming a "learning evidence chain" 7. The error attribution table categorizes classical Chinese vocabulary errors, including "ancient vs. modern meanings" (e.g., "zou" meaning "run" in ancient texts), "homophone confusion" (e.g., "shuo" interchangeable with "yue"), and "word class conversion" (e.g., "nouns functioning as verbs"), generating personalized error correction plans. For instance, it designs specialized practice sets for students who frequently confuse homophones.
Situational task
Design authentic cultural scenarios: For instance, compiling oral histories for local intangible cultural heritage inheritors (integrating interviewing techniques, narrative writing skills, and understanding of local culture with respect); Adapt textbook tasks: Transform the post-reading exercise "Analyze Character Traits" from the text "Kong Yiji" into "Designing a Character Relationship Map for Xianheng Hotel." Students are required to annotate character traits and relationships based on textual details, assessing both language application and logical thinking.
Cultural responsiveness evaluation
Dialect Resource Utilization: In teaching "She Xi" (a traditional folk opera), compare the emotional nuances between Mandarin and dialect readings (e.g., how the intonation variations in the Shaoxing dialect's "Gui Hang" section convey joy). Urban-Rural Differences: Urban students evaluate "subway slogan optimization" (e.g., replacing "Please do not crowd" with "Civilized courtesy, safe travel" to assess linguistic appropriateness and social awareness), while rural students evaluate "agricultural tool instruction revisions" (e.g., explaining the structure and operation of a "plow" in plain language to test language adaptability and practicality).
2.2 Breakthroughs in Innovation
Diversified Evaluation Framework: The "double-blind peer review system" enables anonymous student evaluations, followed by teachers' secondary review focusing on logical flaws (e.g., weak reasoning, disconnect between evidence and arguments) with targeted improvement suggestions. Real-time Feedback: Classroom handheld feedback boards (red/yellow/green) track reading comprehension progress in real time. For example, after analyzing the "Back Shadow" text, teachers can instantly identify 80% correct understanding, 15% partial errors, and 5% complete misunderstandings through color distribution analysis, allowing immediate adjustments to teaching priorities.
3. Practice Case: Classroom Implementation of the Three-Dimensional Assessment Framework
3.1 Diagnostic Assessment: Precision Profiling of Learning Status and Stratified Intervention
Development of Multidimensional Learning Situation Diagnostic Tool
The Classical Chinese Decoding Scale: A "Comparative Matrix of Ancient and Modern Meanings" was designed for "Record of Yueyang Tower", requiring students to annotate the ancient and modern meanings of "wei" in "Wei Si Ren", and to conduct intertextual analysis by correlating it with "Tui Ran" in "Record of Drunken Old Man's Pavilion".
The pre-test revealed that 42% of students confused word class usage, based on which personalized classical Chinese learning cards were generated.
Dyslexia radar chart: Diagnosing weaknesses through the "Narrative Elements Recognition Test". For instance, when studying "The Back View", 30% of students failed to distinguish between "detailed description" and "scene rendering". Based on this, the teacher designed a "Five-Sense Observation Training": reconstructing the textual scene using tactile (texture of the father's cotton robe) and auditory (vending cries on the platform) senses.
Hierarchical task chain design:
A[Basic Layer] --> B (Organize the character relationship network in "The Village Opera")
C[Progress to the next level] --> D (Analyze the leverage effect of "double happiness" in the plot)
E[Challenge Layer] --> F (Create a "Contemporary Rural Youth" by imitating Lu Xun's writing style)
Example: In a school's experimental class, students were grouped based on diagnostic results. South Asian students used the' Cantonese-Chinese Classical Chinese Comparison Table 'to understand the local semantic transfer of the character' dang' (dang).
3.2 Process Evaluation: Embedded Feedback and Dynamic Adjustment
The Three-Stage Feedback Method of Composition Guidance
Instant annotations: Use the "symbolic coding system" to mark questions (e.g., ▲ for example of hollowed-out spaces), and students can scan the QR code to access a sample library (e.g., concrete descriptions from Zhu Ziqing's "Spring").
Peer Evaluation Guidelines: Design a "3+2 Evaluation Scale" (3 strengths + 2 improvement suggestions), with a focus on training the "Argument Support" evaluation. Case studies show that after peer evaluation, the clarity rate of argumentative essay viewpoints increased by 37%8.
Cross-temporal revision: Students were tasked with selecting the tavern scene from 'Kong Yiji' and supplementing the environmental description with historical photographs of 'Xianheng Hotel'. The AI platform automatically compared the language style match with the original text (the experimental class achieved a language restoration accuracy of 78.5%).
Generative Evaluation of Classroom Dialogue
The Flowchart of Thought Evolution: In the discussion of 'My Uncle Jules', a flowchart was used to document the progression of students 'perspectives (e.g.,' sympathy for Philippe → critique of vanity → reflection on social alienation'), serving as evidence 2 for the development of critical thinking.
Cultural responsiveness scale: For non-Chinese speaking students, a "localized interpretation task" is set, such as retelling "Peach Blossom Spring" in the form of Hong Kong's "Banyan Tree Storytelling," to assess cultural translation skills.
3.3 Performance Assessment: Literacy Verification in Realistic Contexts
The Practice of Drama-based Assessment
Fan Jin's Success in the Imperial Examination Adaptation Project:
Script writing Group: Analyze the Satirical Intent of the Description of "Madness", and Keep the Repetition Rhetoric of "Ah! Well! I'm Trapped!" in the Dialogue
Stage Design Group:Using Light Switch to Show the "Vain Life",Assess the Symbol Transformation Ability
After the performance, submit the "Psychological Motivation Analysis Report of the Role", and the teacher will score it according to the SOLO classification method.
Curatorial Task of Community Culture
Using "Suzhou Gardens" as a starting point, we launched the "Exploring the Code of Local Architecture" project:
Research [Photographing Carved Components of Arcade] --> Analysis [Comparing the Characteristics of Xieshan Roof in "Yingzao Fashi"]
--> Create a miniature model of the Lingnan Cultural Corridor
The evaluation focuses on: ① accuracy of architectural terminology ② depth of cultural connotation interpretation ③ creativity of model, and the results are included in the electronic portfolio 7.
3.4 Interdisciplinary Integrated Assessment: Innovative Evaluation Beyond Disciplinary Boundaries
The Exploration of "Science in Ancient Poetry"
An Analysis of the Poem "Viewing the Waterfall of Mount Lu" with the Line "The waterfall plunges down three thousand feet":
Calculate the water impact force using the physics module (requires gravitational acceleration values)
Drawing of Lushan Fault in the Geography Group
The Emotional Amplification Effect of Exaggeration Rhetoric in the Literature Group
The research report was collaboratively completed by the three parties using the 'Interdisciplinary Integration Score Scale'.
4. Teaching Reflection: Effect and Challenge of Paradigm Transformation
4.1 Quantitative Effectiveness Verification
Multidimensional Evidence of Competence Development
Through a longitudinal comparison across 12 schools, the experimental class demonstrated the most significant progress in the dimension of "cultural heritage understanding":
Comparison of Core Competency Achievement Rates Before and After the Reform (N=1267)
dimension of quality | Control class | experimental class | amplification |
Language Construction and Application | 63.7% | 82.3% | +18.6% |
Development and Enhancement of Thinking | 51.2% | 79.8% | +28.6% |
Aesthetic Appreciation and Creation | 48.9% | 74.1% | +25.2% |
Cultural Inheritance and Understanding | 39.5% | 73.6% | +34.1% |
Structural changes in writing ability: The "depth of thought" score rate in the experimental class increased from 32.1% to 67.4%. For instance, students linked "The Foolish Old Man Who Removed the Mountains" to the construction of the Hong Kong-Zhuhai-Macao Bridge, proposing that "the contemporary Foolish Old Man spirit embodies the belief of breaking geographical barriers."
Teacher professional growth indicators: Lesson plan revision frequency increased by 3.2 times. For example, a teacher adjusted the teaching objectives of "Qu Yuan" from "reciting lines" to "appreciating the aesthetics of resistance in the monologue" based on the data from the "Drama Evaluation Scale".
Enhanced diagnostic tool development capabilities: 78% of teachers can independently design learning assessment forms.
4.2 Qualitative Feedback Highlights Educational Value
The Change of Students' Perspective
"Previously, I dreaded writing classical Chinese from memory, but now it feels like solving a puzzle—the character 'ting' in 'all court officials fear his words' is written just like the signboard of 'Wan Chai Police Station' in Hong Kong!" (Chen, a second-year student at Whampoa Middle School)
When designing the 'Rickshaw Museum' exhibition panels for 'Camel Xiangzi', I finally understood the dignity in Xiangzi's sweat." (Lin, a student from Shui Mun Chong Chun Secondary School)
Teacher Observation Record
"Through the 'Banyan Tree Storytelling' task, South Asian student Shahid proactively explored the cultural parallels between 'Ren' (benevolence) in the Analects of Confucius and Sikhism's' seva' (service spirit). Such cultural dialogue transcends the reach of written examinations." (Teacher Zheng, Yau Ma Tei Xundao Secondary School)
4.3 Deep Challenges and Coping Strategies
Structural dilemma
Resource disparity: Only 41% of suburban schools have e-portfolios, compared to 89% in urban areas, with some schools still relying on paper-based 'assessment portfolios' for student progress tracking.
The inertia of quality evaluation: 63% teachers still pay too much attention to the "correct answer" and ignore the thinking process (for example, the "fennel bean description" in "Kong Yiji" is simplified to "action description" as the test point).
Cracking path
Regional collaboration mechanism: Drawing on the experience of the Yangtze River Delta Assessment Alliance, establish a Chinese language assessment resource library for the Guangdong-Hong Kong-Macao Greater Bay Area, sharing situational question banks (e.g., using recordings of Macau teahouse dialogues for listening assessment).
Teacher Training Workshop: Conducting the "Misunderstanding Evaluation Clinic" activity, analyzing typical cases such as the deviation of "simplifying the reading comprehension scoring criteria for 'Red Star Over China' to word count statistics".
5. Conclusion: Constructing a New Ecology of Chinese Language Education Based on Evaluation to Promote Learning
3D Dynamic Evaluation Framework Promotes Three Transformations in Chinese Language Education
The evolution of educational assessment: From "selection tools" to "growth scaffolding" – exemplified by portfolios documenting students 'transition from formulaic writing to authentic expression. A case study of a secondary school's "Growth-Oriented Writing Portfolio" program reveals that through systematic collection of students 'drafts, revisions, reflective journals, and teacher feedback, 85% of participants demonstrated marked improvements in originality and emotional depth within one year. The proportion of personalized expressions in their work surged from 32% to 68%, conclusively proving the efficacy of assessment as growth scaffolding. This paradigm shift prioritizes not just outcomes but also process-oriented skill development and cognitive progression, ensuring evaluations genuinely serve students' holistic development.
Content transition: The deep integration of textbook tasks with regional culture is exemplified by replacing the "Suzhou Gardens" assessment with an analysis of local dwellings. In Hangzhou, Zhejiang, Chinese language teachers guide students to use the "layout beauty" and "detail beauty" from "Suzhou Gardens" as references, conducting field investigations, measurements, and cultural interpretations of local "white walls and black tiles" residences, ultimately completing a research report titled "The Ancient Residence in Front of My Home." This task provides students with concrete understanding of literary aesthetics that combines form and spirit. Surveys indicate that students participating in this activity showed a 40% increase in regional cultural identity and an average 15-point improvement in Chinese language proficiency test scores, effectively achieving the organic integration of textbook content with local characteristics.
Subjective Transition: Students become co-constructors of assessment through self-evaluation tools like the "Metacognitive Self-Assessment for Reading Comprehension". In a primary school's "Reading Self-Evaluation System", students reflect on their reading processes using a multi-dimensional scale covering "goal achievement", "strategy application effectiveness", and "confusion and improvement". Data reveals that students consistently using self-evaluation tools showed a 53% increase in proactive strategy application rates and scored 22% higher in deep reading ability assessments compared to the control group. This demonstrates how students' active role in evaluation fosters learning autonomy and critical thinking, transforming assessment from one-way judgment into a collaborative dialogue and co-constructive process between teachers and students.
Only by rooting the evaluation in the soil of language practice can we realize the true essence of Chinese language education, which is to "cultivate people with culture and guide learning with evaluation".
As highlighted in a 2023 study by the Chinese Language Education Research Institute at Beijing Normal University, schools that integrate assessments into language practice activities—listening, speaking, reading, and writing—achieved a 37% higher rate of students meeting core Chinese language competency standards compared to traditional evaluation methods. This demonstrates the critical importance of combining assessment with practical application. By establishing a three-dimensional dynamic evaluation framework, Chinese language education can break free from exam-oriented constraints, return to its fundamental purpose of nurturing well-rounded individuals, and cultivate talents of the new era who possess not only solid language skills but also cultural confidence and innovative spirit.
References
[1]. Wang Ning. Paradigm Transformation in Chinese Language Teaching Evaluation[J]. Chinese Language Construction, 2025(2):14-18.
[2]. Li Weidong. Practical Approaches of Process Evaluation in Chinese Language Teaching [J]. Chinese Language Teaching in Secondary Schools, 2023(4):29-33.
[3]. Ministry of Education of the People's Republic of China. Compulsory Education Chinese Curriculum Standards (2022 Edition) [S]. Beijing Normal University Press, 2022.
[4]. Wen Rumin. Core Literacy in Chinese Language and Its Evaluation Reform [M]. People's Education Press, 2024:77-82.
[5]. Zhang Xinyan. The Assessment Value of Dialect Resources in Chinese Language Teaching[J]. Curriculum, Textbook and Teaching Method, 2025(1):45-49.
[6]. Chen Dawei. Classroom Observation and Teaching Diagnostic Techniques [M]. East China Normal University Press, 2023:112-115.