Observed Quality of Formative Peer and Self-Assessment in Everyday Mathematics Teaching and its Effects on Student Performance
The positive effect of peer assessment and self-assessment strategies on learners' performance has been widely confirmed in experimental or quasi-.
- Pub. date: April 15, 2022
- Pages: 663-680
- 868 Downloads
- 1342 Views
- 4 Citations
The positive effect of peer assessment and self-assessment strategies on learners' performance has been widely confirmed in experimental or quasi-experimental studies. However, whether peer and self-assessment within everyday mathematics teaching affect student learning and achievement, has rarely been studied. This study aimed to determine with what quality peer and self-assessment occur in everyday mathematics instruction and whether and which students benefit from it in terms of achievement and the learning process. Two lessons on division were video-recorded and rated to determine the quality of peer and self-assessment. Six hundred thirty-four students of fourth-grade primary school classes in German-speaking Switzerland participated in the study and completed a performance test on division. Multilevel analyses showed no general effect of the quality of peer or self-assessment on performance. However, high-quality self-assessment was beneficial for lower-performing students, who used a larger repertoire of calculation strategies, which helped them perform better. In conclusion, peer and self-assessment in real-life settings only have a small effect on the student performance in this Swiss study.
everyday mathematics teaching formative assessment learning process peer assessment self assessment
Keywords: Everyday mathematics teaching, formative assessment, learning process, peer assessment, self-assessment.
References
Alqassab, M. (2016). Peer feedback provision and mathematical proofs: Role of domain knowledge, beliefs, perceptions, epistemic emotions, and peer feedback content [Doctoral thesis, Ludwig-Maximilians University]. Ludwig-Maximilians University. https://bit.ly/3zaTyMK
Altmann, P. C., Fleming, P. B., & Heyburn, S. L. (2010). Understanding and using formative assessments: A mixed methods study of assessment for learning adoption. Vanderbilt University. https://bit.ly/3HocNVZ
Andrade, H. L. (2010). Students as the definitive source of formative assessment: Academic self-assessment and the self-regulation of learning. In H. L. Andrade & G. J. Cizek (Eds.), Handbook of formative assessment (pp. 90–105). Routledge. https://doi.org/10.4324/9780203874851
Andrade, H. L. (2019). A critical review of research on student self-assessment. Frontiers in Education, (4), Article 87. https://doi.org/10.3389/feduc.2019.00087
Andrade, H. L., & Valtcheva, A. (2009). Promoting learning and achievement through self-assessment. Theory into Practice, 48(1), 12–19. https://doi.org/10.1080/00405840802577544
Bates, D., Mächler, M., Bolker, B., & Walker, S. (2015). Fitting linear mixed-effects models using lme4. Journal of Statistical Software, 67(1), 1-48. https://doi.org/10.18637/jss.v067.i01
Beck, E., Baer, M., Guldimann, T., Bischoff, S., Brühwiler, C., Müller, P., Niedermann, R., Rogalla, M., & Vogt, F. (2008). Adaptive Lehrkompetenz: Analyse und Struktur, Veränderbarkeit und Wirkung handlungssteuernden Lehrerwissens [Adaptive teaching competence: analysis and structure, changeability and effect of action-controlling teacher knowledge]. Waxmann.
Black, P., Harrison, C., Lee, C., Marshall, B., & Wiliam, D. (2004). Working inside the black box: Assessment for learning in the classroom. Phi Delta Kappan, 86(1), 8–21. https://doi.org/10.1177/003172170408600105
Black, P., & Wiliam, D. (2009). Developing the theory of formative assessment. Educational Assessment, Evaluation and Accountability, 21(1), 5–31. https://doi.org/10.1007/s11092-008-9068-5
Bot, T. D. (2020). On categories of mathematics teachers' classroom characteristics and perceived influence on effective mathematics teaching in secondary schools in Plateau state, Nigeria. European Journal of Mathematics and Science Education, 1(2), 121-130. https://doi.org/10.12973/ejmse.1.2.121
Boud, D., Lawson, R., & Thompson, D. G. (2013). Does student engagement in self-assessment calibrate their judgement over time? Assessment & Evaluation in Higher Education, 38(8), 941–956. https://doi.org/10.1080/02602938.2013.769198
Brookhart, S. M., Moss, C. M., & Long, B. A. (2010). Teacher inquiry into formative assessment practices in remedial reading classrooms. Assessment in Education: Principles, Policy & Practice, 17(1), 41–58. https://doi.org/10.1080/09695940903565545
Brown, G. L., & Harris, L. R. (2013). Student self-assessment. In J. H. McMillan (Ed.), SAGE handbook of research on classroom assessment (pp. 367–393). SAGE. https://doi.org/10.4135/9781452218649.n21
Buholzer, A., Baer, M., Zulliger, S., Torchetti, L., Ruelmann, M., Häfliger, A., & Lötscher, H. (2020). Formatives Assessment im alltäglichen Mathematikunterricht von Primarlehrpersonen: Häufigkeit, Dauer und Qualität [Formative assessment in the everyday mathematics teaching of primary teachers: Frequency, duration and quality]. Unterrichtswissenschaft, 48(4), 629–661. https://doi.org/10.1007/s42010-020-00083-7
Bürgermeister, A. (2014). Leistungsbeurteilung im Mathematikunterricht: Bedingungen und Effekte von Beurteilungspraxis und Beurteilungsgenauigkeit [Performance assessment in mathematics education: conditions and effects of assessment practice and assessment accuracy]. Waxmann.
Cardelle-Elawar, M. (1995). Effects of metacognitive instruction on low achievers in mathematics problems. Teaching and Teacher Education, 11(1), 81–95. https://doi.org/10.1016/0742-051X(94)00019-3
Chen, Z., & Siegler, R. S. (2000). II. Overlapping waves theory. Monographs of the Society for Research in Child Development, 65(2), 7–11. https://doi.org/10.1111/1540-5834.00075
Cheng, L., & Wang, X. (2007). Grading, feedback, and reporting in ESL/EFL classrooms. Language Assessment Quarterly, 4(1), 85–107. https://doi.org/10.1080/15434300701348409
Cizek, G. J. (2010). An introduction to formative assessment: History, characteristics, and challenges. In H. L. Andrade & G. J. Cizek (Eds.), Handbook of formative assessment (pp. 3–17). Routledge. https://doi.org/10.4324/9780203874851
Decristan, J., Klieme, E., Kunter, M., Hochweber, J., Büttner, G., Fauth, B., Hondrich, A. L., Rieser, S., Hertel, S., & Hardy, I. (2015). Embedded formative assessment and classroom process quality. American Educational Research Journal, 52(6), 1133–1159. https://doi.org/10.3102/0002831215596412
Deiglmayr, A. (2018). Instructional scaffolds for learning from formative peer assessment: Effects of core task, peer feedback, and dialogue. European Journal of Psychology of Education, 33(1), 185–198. https://doi.org/10.1007/s10212-017-0355-8
Double, K. S., McGrane, J. A., & Hopfenbeck, T. N. (2019). The impact of peer assessment on academic performance: A meta-analysis of control group studies. Educational Psychology Review, 32(2), 481–509. https://doi.org/10.1007/s10648-019-09510-3
Fagginger Auer, M. F., Hickendorff, M., & van Putten, C. M. (2016). Solution strategies and adaptivity in multidigit division in a choice/no-choice experiment: Student and instructional factors. Learning and Instruction, 41, 52–59. https://doi.org/10.1016/j.learninstruc.2015.09.008
Fauth, B., Decristan, J., Rieser, S., Klieme, E., & Büttner, G. (2014). Grundschulunterricht aus Schüler-, Lehrer- und Beobachterperspektive: Zusammenhänge und Vorhersage von Lernerfolg [Primary school teaching from pupil, teacher and observer perspectives: Interrelationships and prediction of learning success]. Zeitschrift für Pädagogische Psychologie, 28(3), 127–137. https://doi.org/10.1024/1010-0652/a000129
Federal Statistical Office. (2021). Lehrkräfte nach Bildungsstufe (öffentliche Schulen) [Teachers by education level (public schools)]. Federal Statistical Office. https://bit.ly/3sUpTq1
Funder, D. C., & Ozer, D. J. (2019). Evaluating effect size in psychological research: Sense and nonsense. Advances in Methods and Practices in Psychological Science, 2(2), 156–168. https://doi.org/10.1177/2515245919847202
Geary, D. C., & Hoard, M. K. (2005). Learning disabilities in arithmetic and mathematics. In J. I. D. Campbell (Ed.), Handbook of mathematical cognition (pp. 253–267). Psychology Press.
Gersten, R., Chard, D. J., Jayanthi, M., Baker, S. K., Morphy, P., & Flojo, J. (2009). Mathematics instruction for students with learning disabilities: A meta-analysis of instructional components. Review of Educational Research, 79(3), 1202–1242. https://doi.org/10.3102/0034654309334431
Geurten, M., & Lemaire, P. (2019). Metacognition for strategy selection during arithmetic problem-solving in young and older adults. Neuropsychology, Development, and Cognition. Section B, Aging, Neuropsychology and Cognition, 26(3), 424–446. https://doi.org/10.1080/13825585.2018.1464114
Gielen, S., Peeters, E., Dochy, F., Onghena, P., & Struyven, K. (2010). Improving the effectiveness of peer feedback for learning. Learning and Instruction, 20(4), 304–315. https://doi.org/10.1016/j.learninstruc.2009.08.007
Gotwals, A. W., Philhower, J., Cisterna, D., & Bennett, S. (2015). Using video to examine formative assessment practices as measures of expertise for mathematics and science teachers. International Journal of Science and Mathematics Education, 13(2), 405–423. https://doi.org/10.1007/s10763-015-9623-8
Graham, S., Hebert, M., & Harris, K. R. (2015). Formative assessment and writing. The Elementary School Journal, 115(4), 523–547. https://doi.org/10.1086/681947
Grob, R., Holmeier, M., & Labudde, P. (2019). Analysing formal formative assessment activities in the context of inquiry at primary and upper secondary school in Switzerland. International Journal of Science Education, 43(3), 407-427. https://doi.org/10.1080/09500693.2019.1663453
Groeben, N., Wahl, D., Schlee, J., & Scheele, B. (1988). Das Forschungsprogramm subjektive Theorien: eine Einführung in die Psychologie des reflexiven Subjekts [The research programme subjective theories: an introduction to the psychology of the reflexive subject]. Francke.
Harris, L. R., & Brown, G. T. (2013). Opportunities and obstacles to consider when using peer- and self-assessment to improve student learning: Case studies into teachers' implementation. Teaching and Teacher Education, 36, 101–111. https://doi.org/10.1016/j.tate.2013.07.008
Hartnett, J. (2007). Categorisation of mental computation strategies to support teaching and to encourage classroom dialogue. In J. Watson & K. Beswick (Eds.), Mathematics: Essential research, essential practice (pp. 345–352). MERGA.
Heinze, A., Star, J. R., & Verschaffel, L. (2009). Flexible and adaptive use of strategies and representations in mathematics education. ZDM, 41(5), 535–540. https://doi.org/10.1007/s11858-009-0214-4
Heritage, M. (2020). Getting the emphasis right: Formative assessment through professional learning. Educational Assessment, 25(4), 355–358. https://doi.org/10.1080/10627197.2020.1766959
Hickendorff, M., Torbeyns, J., & Verschaffel, L. (2019). Multi-digit addition, subtraction, multiplication, and division strategies. In A. Fritz, V. G. Haase, & P. Räsänen (Eds.), International handbook of mathematical learning difficulties (pp. 543–560). Springer. https://doi.org/10.1007/978-3-319-97148-3_32
Hill, T. (2016). Do accounting students believe in self-assessment? Accounting Education, 25(4), 291–305. https://doi.org/10.1080/09639284.2016.1191271
Hox, J. J. (1998). Multilevel modeling: When and why. In I. Balderjahn, R. Mathar & M. Schader (Eds.), Classification, data analysis, and data highways (pp. 147–154). Springer. https://doi.org/10.1007/978-3-642-72087-1_17
Hugener, I., Pauli, C., & Reusser, K. (2006). Dokumentation der Erhebungs- und Auswertungsinstrumente zur schweizerisch-deutschen Videostudie "Unterrichtsqualität, Lernverhalten und mathematisches Verständnis": 3. Videoanalysen [Documentation of the survey and evaluation instruments for the Swiss-German video study "Teaching quality, learning behaviour and mathematical understanding": 3. video analyses.]. Gesellschaft z. Förd. Päd. Forsch. https://bit.ly/32QO1if
Kingston, N., & Nash, B. (2011). Formative assessment: A meta-analysis and a call for research. Educational Measurement: Issues and Practice, 30(4), 28–37. https://doi.org/10.1111/j.1745-3992.2011.00220.x
Kollar, I., & Fischer, F. (2010). Peer assessment as collaborative learning: A cognitive perspective. Learning and Instruction, 20(4), 344–348. https://doi.org/10.1016/j.learninstruc.2009.08.005
Kraft, M. A. (2020). Interpreting effect sizes of education interventions. Educational Researcher, 49(4), 241–253. https://doi.org/10.3102/0013189X20912798
Krammer, K., & Hugener, I. (2014). Förderung der Analysekompetenz angehender Lehrpersonen anhand von eigenen und fremden Unterrichtsvideos [Promoting the analytical competence of prospective teachers by means of their own and other people's teaching videos]. Journal für LehrerInnenbildung, 14(1), 25–32. https://bit.ly/3HJT1Vp
Kuznetsova, A., Brockhoff, P. B., & Christensen, R. H. B. (2017). lmerTest package: Tests in linear mixed effects models. Journal of Statistical Software, 82(13), 1-26. https://doi.org/10.18637/jss.v082.i13
Lindberg, S., Linkersdörfer, J., Lehmann, M., Hasselhorn, M., & Lonnemann, J. (2013). Individual differences in children's early strategy behavior in arithmetic tasks. Journal of Educational and Developmental Psychology, 3(1), 192-200. https://doi.org/10.5539/jedp.v3n1p192
Lyon, C. J., Nabors Oláh, L., & Wylie, E. C. (2019). Working toward integrated practice: Understanding the interaction among formative assessment strategies. The Journal of Educational Research, 112(3), 301–314. https://doi.org/10.1080/00220671.2018.1514359
Maier, U. (2011). Formative Leistungsdiagnostik in der Sekundarstufe I - Befunde einer quantitativen Lehrerbefragung zu Nutzung und Korrelaten verschiedener Typen formativer Diagnosemethoden in Gymnasien [Formative performance diagnostics in lower secondary schools - Findings from a quantitative teacher survey on the use and correlates of different types of formative diagnostic methods in grammar schools]. Empirische Pädagogik, 25(1), 25–46.
Miller, T. M., & Geraci, L. (2011). Unskilled but aware: Reinterpreting overconfidence in low-performing students. Journal of Experimental Psychology. Learning, Memory, and Cognition, 37(2), 502–506. https://doi.org/10.1037/a0021802
Ohadugha, R. O., Chukwuemeka, E. J., & Babatunde, A. E. (2020). Impact of peer-mediated learning on achievement and motivation in computer science among senior secondary school students in Minna Metropolis, Niger State. Contemporary Educational Technology, 12(1), ep263. https://doi.org/10.30935/cedtech/7629
Oswalt, S. G. (2013). Identifying formative assessment in classroom instruction [Doctoral thesis, Boise State University]. Boise State University Scholar Works. https://bit.ly/3qF256K
Panadero, E. (2016). Is it safe? Social, interpersonal, and human effects of peer assessment: A review and future directions. In G. T. L. Brown & L. R. Harris (Eds.), Handbook of social and human conditions in assessment (pp. 247–266). Routledge.
Panadero, E., Brown, G. L., & Strijbos, J. ‑W. (2016). The future of student self-assessment: A review of known unknowns and potential directions. Educational Psychology Review, 28(4), 803–830. https://doi.org/10.1007/s10648-015-9350-2
Pantiwati, Y., & Husamah, H. (2017). Self and peer Assessments in active learning model to increase metacognitive awareness and cognitive abilities. International Journal of Instruction, 10(4), 185–202. https://doi.org/10.12973/iji.2017.10411a
Pauli, C. (2012). Kodierende Beobachtung [Coding observation]. In H. de Boer & S. Reh (Eds.), Beobachtung in der Schule - Beobachten lernen [Observation at school - learn to observe] (pp. 45–63). Springer. https://doi.org/10.1007/978-3-531-18938-3_3
Pennequin, V., Sorel, O., Nanty, I., & Fontaine, R. (2010). Metacognition and low achievement in mathematics: The effect of training in the use of metacognitive skills to solve mathematical word problems. Thinking & Reasoning, 16(3), 198–220. https://doi.org/10.1080/13546783.2010.509052
Pianta, R. C., & Hamre, B. K. (2009). Classroom processes and positive youth development: Conceptualizing, measuring, and improving the capacity of interactions between teachers and students. New Directions for Youth Development, 2009(121), 33–46. https://doi.org/10.1002/yd.295
Ploegh, K., Tillema, H. H., & Segers, M. S. (2009). In search of quality criteria in peer assessment practices. Studies in Educational Evaluation, 35(2-3), 102–109. https://doi.org/10.1016/j.stueduc.2009.05.001
Praetorius, A. ‑K. (2014). Messung von Unterrichtsqualität durch Ratings [Measuring teaching quality through ratings]. Waxmann.
Praetorius, A. ‑K., Pauli, C., Reusser, K., Rakoczy, K., & Klieme, E. (2014). One lesson is all you need? Stability of instructional quality across lessons. Learning and Instruction, 31, 2–12. https://doi.org/10.1016/j.learninstruc.2013.12.002
R Core Team. (2020). R: A language and environment for statistical computing. R Foundation for Statistical Computing. https://www.R-project.org/
Reusser, K., & Pauli, C. (2021). Unterrichtsqualität ist immer generisch und fachspezifisch. Ein Kommentar aus kognitions- und lehr-lerntheoretischer Sicht [Teaching quality is always generic and subject-specific. A commentary from a cognitive and learning theory perspective.]. Unterrichtswissenschaft, 49(2), 189–202. https://doi.org/10.1007/s42010-021-00117-8
Ross, J. A., Rolheiser, C., & Hogaboam-Gray, A. (1999). Effect of self-evaluation on narrative writing. Assessing Writing, 6(1), 107–132. https://doi.org/10.1016/S1075-2935(99)00003-3
Ruiz-Primo, M. A., & Furtak, E. M. (2006). Informal formative assessment and scientific inquiry: Exploring teachers' practices and student learning. Educational Assessment, 11(3-4), 237–263. https://doi.org/10.1080/10627197.2006.9652991
Sadler, P., & Good, E. (2006). The impact of self- and peer-grading on student learning. Educational Assessment, 11(1), 1–31. https://doi.org/10.1207/s15326977ea1101_1
Sanchez, C. E., Atkinson, K. M., Koenka, A. C., Moshontz, H., & Cooper, H. (2017). Self-grading and peer-grading for formative and summative assessments in 3rd through 12th grade classrooms: A meta-analysis. Journal of Educational Psychology, 109(8), 1049-1066. https://doi.org/10.1037/edu0000190
Schmidt, C. A. (2020). Formatives Assessment in der Grundschule: Konzept, Einschätzungen der Lehrkräfte und Zusammenhänge [Formative assessment in primary school: Concept, teachers' assessments and contexts] (1st ed.). Springer. https://doi.org/10.1007/978-3-658-26921-0
Schnell, R., Hill, P. B., & Esser, E. (2013). Methoden der empirischen Sozialforschung [Methods of empirical social research] (10th revised ed.). Oldenbourg Verlag.
Schulz, A. (2015). Wie lösen Viertklässler Rechenaufgaben zur Multiplikation und Division? [How do fourth graders solve multiplication and division problems?] In F. Caluori, H. Linneweber-Lammerskitten & C. Streit (Eds.), Beiträge zum Mathematikunterricht 2015 (pp. 844–847). WTM. https://doi.org/10.17877/DE290R-16783
Schulz, A., & Leuders, T. (2018). Learning trajectories towards strategy proficiency in multi-digit division – A latent transition analysis of strategy and error profiles. Learning and Individual Differences, 66, 54–69. https://doi.org/10.1016/j.lindif.2018.04.014
Schütze, B., Souvignier, E., & Hasselhorn, M. (2018). Stichwort – Formatives Assessment [Keyword - Formative assessment]. Zeitschrift für Erziehungswissenschaft, 21(4), 697–715. https://doi.org/10.1007/s11618-018-0838-7
Siegler, R. S. (1996). A grand theory of development. Monographs of the Society for Research in Child Development, 61(1-2), 266–275. https://doi.org/10.1111/j.1540-5834.1996.tb00550.x
Siegler, R. S. (2002). Microgenetic studies of self-explanation. In N. Granott & J. Parziale (Eds.), Microdevelopment (pp. 31–58). Cambridge University Press. https://doi.org/10.1017/CBO9780511489709.002
Siegler, R. S. (2007). Cognitive variability. Developmental Science, 10(1), 104–109. https://doi.org/10.1111/j.1467-7687.2007.00571.x
Strijbos, J. ‑W., Narciss, S., & Dünnebier, K. (2010). Peer feedback content and sender's competence level in academic writing revision tasks: Are they critical for feedback perceptions and efficiency? Learning and Instruction, 20(4), 291–303. https://doi.org/10.1016/j.learninstruc.2009.08.008
Strijbos, J. ‑W., & Wichmann, A. (2017). Promoting learning by leveraging the collaborative nature of formative peer assessment with instructional scaffolds. European Journal of Psychology of Education, 33(1), 1–9. https://doi.org/10.1007/s10212-017-0353-x
Tingley, D., Yamamoto, T., Hirose, K., Keele, L., & Imai, K. (2014). Mediation: R package for causal mediation analysis. Journal of Statistical Software, 59(5), 1-38. https://doi.org/10.18637/jss.v059.i05
Topping, K. J. (2009). Peer assessment. Theory into Practice, 48(1), 20–27. https://doi.org/10.1080/00405840802577569
Waldis, M., Grob, U., Pauli, C., & Reusser, K. (2010). Der schweizerische Mathematikunterricht aus der Sicht von Schülerinnen und Schülern und in der Perspektive hochinferenter Beobachterurteile [Swiss mathematics teaching from the perspective of students and in the perspective of highly-inferental ratings]. In K. Reusser, C. Pauli & M. Waldis (Eds.), Unterrichtsgestaltung und Unterrichtsqualität: Ergebnisse einer internationalen und schweizerischen Videostudie zum Mathematikunterricht [Lesson design and teaching quality: Results of an international and Swiss video study on mathematics teaching] (pp. 171–208). Waxmann.
Wylie, C., & Lyon, C. (2013). Using the formative assessment rubrics, reflection and observation tools to support professional reflection on practice. Council of Chief State School Officers (CCSSO). https://bit.ly/3Jub5Em