Observed Quality of Formative Peer and Self-Assessment in Everyday Mathematics Teaching and its Effects on Student Performance

Sandra Zulliger; Alois Buholzer; Merle Ruelmann

doi:10.12973/eu-jer.11.2.663

Research Article

Observed Quality of Formative Peer and Self-Assessment in Everyday Mathematics Teaching and its Effects on Student Performance

Sandra Zulliger , Alois Buholzer , Merle Ruelmann

https://doi.org/10.12973/eu-jer.11.2.663

Pub. date: April 15, 2022
Pages: 663-680
925 Downloads
1862 Views
5 Citations

The positive effect of peer assessment and self-assessment strategies on learners' performance has been widely confirmed in experimental or quasi-experimental studies. However, whether peer and self-assessment within everyday mathematics teaching affect student learning and achievement, has rarely been studied. This study aimed to determine with what quality peer and self-assessment occur in everyday mathematics instruction and whether and which students benefit from it in terms of achievement and the learning process. Two lessons on division were video-recorded and rated to determine the quality of peer and self-assessment. Six hundred thirty-four students of fourth-grade primary school classes in German-speaking Switzerland participated in the study and completed a performance test on division. Multilevel analyses showed no general effect of the quality of peer or self-assessment on performance. However, high-quality self-assessment was beneficial for lower-performing students, who used a larger repertoire of calculation strategies, which helped them perform better. In conclusion, peer and self-assessment in real-life settings only have a small effect on the student performance in this Swiss study.

Keywords:
everyday mathematics teaching formative assessment learning process peer assessment self assessment

PDF

Metrics

Cite

Citations

Article Metrics

Views

925

Download

1862

Citations

Crossref
5

Scopus
4

References

Alqassab, M. (2016). Peer feedback provision and mathematical proofs: Role of domain knowledge, beliefs, perceptions, epistemic emotions, and peer feedback content [Doctoral thesis, Ludwig-Maximilians University]. Ludwig-Maximilians University. https://bit.ly/3zaTyMK

Altmann, P. C., Fleming, P. B., & Heyburn, S. L. (2010). Understanding and using formative assessments: A mixed methods study of assessment for learning adoption. Vanderbilt University. https://bit.ly/3HocNVZ

Andrade, H. L. (2010). Students as the definitive source of formative assessment: Academic self-assessment and the self-regulation of learning. In H. L. Andrade & G. J. Cizek (Eds.), Handbook of formative assessment (pp. 90–105). Routledge. https://doi.org/10.4324/9780203874851

Andrade, H. L. (2019). A critical review of research on student self-assessment. Frontiers in Education, (4), Article 87. https://doi.org/10.3389/feduc.2019.00087

Andrade, H. L., & Valtcheva, A. (2009). Promoting learning and achievement through self-assessment. Theory into Practice, 48(1), 12–19. https://doi.org/10.1080/00405840802577544

Bates, D., Mächler, M., Bolker, B., & Walker, S. (2015). Fitting linear mixed-effects models using lme4. Journal of Statistical Software, 67(1), 1-48. https://doi.org/10.18637/jss.v067.i01

Beck, E., Baer, M., Guldimann, T., Bischoff, S., Brühwiler, C., Müller, P., Niedermann, R., Rogalla, M., & Vogt, F. (2008). Adaptive Lehrkompetenz: Analyse und Struktur, Veränderbarkeit und Wirkung handlungssteuernden Lehrerwissens [Adaptive teaching competence: analysis and structure, changeability and effect of action-controlling teacher knowledge]. Waxmann.

Black, P., Harrison, C., Lee, C., Marshall, B., & Wiliam, D. (2004). Working inside the black box: Assessment for learning in the classroom. Phi Delta Kappan, 86(1), 8–21. https://doi.org/10.1177/003172170408600105

Black, P., & Wiliam, D. (2009). Developing the theory of formative assessment. Educational Assessment, Evaluation and Accountability, 21(1), 5–31. https://doi.org/10.1007/s11092-008-9068-5

Bot, T. D. (2020). On categories of mathematics teachers' classroom characteristics and perceived influence on effective mathematics teaching in secondary schools in Plateau state, Nigeria. European Journal of Mathematics and Science Education, 1(2), 121-130. https://doi.org/10.12973/ejmse.1.2.121

Boud, D., Lawson, R., & Thompson, D. G. (2013). Does student engagement in self-assessment calibrate their judgement over time? Assessment & Evaluation in Higher Education, 38(8), 941–956. https://doi.org/10.1080/02602938.2013.769198

Brookhart, S. M., Moss, C. M., & Long, B. A. (2010). Teacher inquiry into formative assessment practices in remedial reading classrooms. Assessment in Education: Principles, Policy & Practice, 17(1), 41–58. https://doi.org/10.1080/09695940903565545

Brown, G. L., & Harris, L. R. (2013). Student self-assessment. In J. H. McMillan (Ed.), SAGE handbook of research on classroom assessment (pp. 367–393). SAGE. https://doi.org/10.4135/9781452218649.n21

Buholzer, A., Baer, M., Zulliger, S., Torchetti, L., Ruelmann, M., Häfliger, A., & Lötscher, H. (2020). Formatives Assessment im alltäglichen Mathematikunterricht von Primarlehrpersonen: Häufigkeit, Dauer und Qualität [Formative assessment in the everyday mathematics teaching of primary teachers: Frequency, duration and quality]. Unterrichtswissenschaft, 48(4), 629–661. https://doi.org/10.1007/s42010-020-00083-7

Bürgermeister, A. (2014). Leistungsbeurteilung im Mathematikunterricht: Bedingungen und Effekte von Beurteilungspraxis und Beurteilungsgenauigkeit [Performance assessment in mathematics education: conditions and effects of assessment practice and assessment accuracy]. Waxmann.

Cardelle-Elawar, M. (1995). Effects of metacognitive instruction on low achievers in mathematics problems. Teaching and Teacher Education, 11(1), 81–95. https://doi.org/10.1016/0742-051X(94)00019-3

Chen, Z., & Siegler, R. S. (2000). II. Overlapping waves theory. Monographs of the Society for Research in Child Development, 65(2), 7–11. https://doi.org/10.1111/1540-5834.00075

Cheng, L., & Wang, X. (2007). Grading, feedback, and reporting in ESL/EFL classrooms. Language Assessment Quarterly, 4(1), 85–107. https://doi.org/10.1080/15434300701348409

Cizek, G. J. (2010). An introduction to formative assessment: History, characteristics, and challenges. In H. L. Andrade & G. J. Cizek (Eds.), Handbook of formative assessment (pp. 3–17). Routledge. https://doi.org/10.4324/9780203874851

Decristan, J., Klieme, E., Kunter, M., Hochweber, J., Büttner, G., Fauth, B., Hondrich, A. L., Rieser, S., Hertel, S., & Hardy, I. (2015). Embedded formative assessment and classroom process quality. American Educational Research Journal, 52(6), 1133–1159. https://doi.org/10.3102/0002831215596412

Deiglmayr, A. (2018). Instructional scaffolds for learning from formative peer assessment: Effects of core task, peer feedback, and dialogue. European Journal of Psychology of Education, 33(1), 185–198. https://doi.org/10.1007/s10212-017-0355-8

Double, K. S., McGrane, J. A., & Hopfenbeck, T. N. (2019). The impact of peer assessment on academic performance: A meta-analysis of control group studies. Educational Psychology Review, 32(2), 481–509. https://doi.org/10.1007/s10648-019-09510-3

Fagginger Auer, M. F., Hickendorff, M., & van Putten, C. M. (2016). Solution strategies and adaptivity in multidigit division in a choice/no-choice experiment: Student and instructional factors. Learning and Instruction, 41, 52–59. https://doi.org/10.1016/j.learninstruc.2015.09.008

Fauth, B., Decristan, J., Rieser, S., Klieme, E., & Büttner, G. (2014). Grundschulunterricht aus Schüler-, Lehrer- und Beobachterperspektive: Zusammenhänge und Vorhersage von Lernerfolg [Primary school teaching from pupil, teacher and observer perspectives: Interrelationships and prediction of learning success]. Zeitschrift für Pädagogische Psychologie, 28(3), 127–137. https://doi.org/10.1024/1010-0652/a000129

Federal Statistical Office. (2021). Lehrkräfte nach Bildungsstufe (öffentliche Schulen) [Teachers by education level (public schools)]. Federal Statistical Office. https://bit.ly/3sUpTq1

Funder, D. C., & Ozer, D. J. (2019). Evaluating effect size in psychological research: Sense and nonsense. Advances in Methods and Practices in Psychological Science, 2(2), 156–168. https://doi.org/10.1177/2515245919847202

Geary, D. C., & Hoard, M. K. (2005). Learning disabilities in arithmetic and mathematics. In J. I. D. Campbell (Ed.), Handbook of mathematical cognition (pp. 253–267). Psychology Press.

Gersten, R., Chard, D. J., Jayanthi, M., Baker, S. K., Morphy, P., & Flojo, J. (2009). Mathematics instruction for students with learning disabilities: A meta-analysis of instructional components. Review of Educational Research, 79(3), 1202–1242. https://doi.org/10.3102/0034654309334431

Geurten, M., & Lemaire, P. (2019). Metacognition for strategy selection during arithmetic problem-solving in young and older adults. Neuropsychology, Development, and Cognition. Section B, Aging, Neuropsychology and Cognition, 26(3), 424–446. https://doi.org/10.1080/13825585.2018.1464114

Gielen, S., Peeters, E., Dochy, F., Onghena, P., & Struyven, K. (2010). Improving the effectiveness of peer feedback for learning. Learning and Instruction, 20(4), 304–315. https://doi.org/10.1016/j.learninstruc.2009.08.007

Gotwals, A. W., Philhower, J., Cisterna, D., & Bennett, S. (2015). Using video to examine formative assessment practices as measures of expertise for mathematics and science teachers. International Journal of Science and Mathematics Education, 13(2), 405–423. https://doi.org/10.1007/s10763-015-9623-8

Graham, S., Hebert, M., & Harris, K. R. (2015). Formative assessment and writing. The Elementary School Journal, 115(4), 523–547. https://doi.org/10.1086/681947

Grob, R., Holmeier, M., & Labudde, P. (2019). Analysing formal formative assessment activities in the context of inquiry at primary and upper secondary school in Switzerland. International Journal of Science Education, 43(3), 407-427. https://doi.org/10.1080/09500693.2019.1663453

Groeben, N., Wahl, D., Schlee, J., & Scheele, B. (1988). Das Forschungsprogramm subjektive Theorien: eine Einführung in die Psychologie des reflexiven Subjekts [The research programme subjective theories: an introduction to the psychology of the reflexive subject]. Francke.

Harris, L. R., & Brown, G. T. (2013). Opportunities and obstacles to consider when using peer- and self-assessment to improve student learning: Case studies into teachers' implementation. Teaching and Teacher Education, 36, 101–111. https://doi.org/10.1016/j.tate.2013.07.008

Hartnett, J. (2007). Categorisation of mental computation strategies to support teaching and to encourage classroom dialogue. In J. Watson & K. Beswick (Eds.), Mathematics: Essential research, essential practice (pp. 345–352). MERGA.

Heinze, A., Star, J. R., & Verschaffel, L. (2009). Flexible and adaptive use of strategies and representations in mathematics education. ZDM, 41(5), 535–540. https://doi.org/10.1007/s11858-009-0214-4

Heritage, M. (2020). Getting the emphasis right: Formative assessment through professional learning. Educational Assessment, 25(4), 355–358. https://doi.org/10.1080/10627197.2020.1766959

Hickendorff, M., Torbeyns, J., & Verschaffel, L. (2019). Multi-digit addition, subtraction, multiplication, and division strategies. In A. Fritz, V. G. Haase, & P. Räsänen (Eds.), International handbook of mathematical learning difficulties (pp. 543–560). Springer. https://doi.org/10.1007/978-3-319-97148-3_32

Hill, T. (2016). Do accounting students believe in self-assessment? Accounting Education, 25(4), 291–305. https://doi.org/10.1080/09639284.2016.1191271

Hox, J. J. (1998). Multilevel modeling: When and why. In I. Balderjahn, R. Mathar & M. Schader (Eds.), Classification, data analysis, and data highways (pp. 147–154). Springer. https://doi.org/10.1007/978-3-642-72087-1_17

Hugener, I., Pauli, C., & Reusser, K. (2006). Dokumentation der Erhebungs- und Auswertungsinstrumente zur schweizerisch-deutschen Videostudie "Unterrichtsqualität, Lernverhalten und mathematisches Verständnis": 3. Videoanalysen [Documentation of the survey and evaluation instruments for the Swiss-German video study "Teaching quality, learning behaviour and mathematical understanding": 3. video analyses.]. Gesellschaft z. Förd. Päd. Forsch. https://bit.ly/32QO1if

Kingston, N., & Nash, B. (2011). Formative assessment: A meta-analysis and a call for research. Educational Measurement: Issues and Practice, 30(4), 28–37. https://doi.org/10.1111/j.1745-3992.2011.00220.x

Kollar, I., & Fischer, F. (2010). Peer assessment as collaborative learning: A cognitive perspective. Learning and Instruction, 20(4), 344–348. https://doi.org/10.1016/j.learninstruc.2009.08.005

Kraft, M. A. (2020). Interpreting effect sizes of education interventions. Educational Researcher, 49(4), 241–253. https://doi.org/10.3102/0013189X20912798

Krammer, K., & Hugener, I. (2014). Förderung der Analysekompetenz angehender Lehrpersonen anhand von eigenen und fremden Unterrichtsvideos [Promoting the analytical competence of prospective teachers by means of their own and other people's teaching videos]. Journal für LehrerInnenbildung, 14(1), 25–32. https://bit.ly/3HJT1Vp

Kuznetsova, A., Brockhoff, P. B., & Christensen, R. H. B. (2017). lmerTest package: Tests in linear mixed effects models. Journal of Statistical Software, 82(13), 1-26. https://doi.org/10.18637/jss.v082.i13

Lindberg, S., Linkersdörfer, J., Lehmann, M., Hasselhorn, M., & Lonnemann, J. (2013). Individual differences in children's early strategy behavior in arithmetic tasks. Journal of Educational and Developmental Psychology, 3(1), 192-200. https://doi.org/10.5539/jedp.v3n1p192

Lyon, C. J., Nabors Oláh, L., & Wylie, E. C. (2019). Working toward integrated practice: Understanding the interaction among formative assessment strategies. The Journal of Educational Research, 112(3), 301–314. https://doi.org/10.1080/00220671.2018.1514359

Maier, U. (2011). Formative Leistungsdiagnostik in der Sekundarstufe I - Befunde einer quantitativen Lehrerbefragung zu Nutzung und Korrelaten verschiedener Typen formativer Diagnosemethoden in Gymnasien [Formative performance diagnostics in lower secondary schools - Findings from a quantitative teacher survey on the use and correlates of different types of formative diagnostic methods in grammar schools]. Empirische Pädagogik, 25(1), 25–46.

Miller, T. M., & Geraci, L. (2011). Unskilled but aware: Reinterpreting overconfidence in low-performing students. Journal of Experimental Psychology. Learning, Memory, and Cognition, 37(2), 502–506. https://doi.org/10.1037/a0021802

Ohadugha, R. O., Chukwuemeka, E. J., & Babatunde, A. E. (2020). Impact of peer-mediated learning on achievement and motivation in computer science among senior secondary school students in Minna Metropolis, Niger State. Contemporary Educational Technology, 12(1), ep263. https://doi.org/10.30935/cedtech/7629

Oswalt, S. G. (2013). Identifying formative assessment in classroom instruction [Doctoral thesis, Boise State University]. Boise State University Scholar Works. https://bit.ly/3qF256K

Panadero, E. (2016). Is it safe? Social, interpersonal, and human effects of peer assessment: A review and future directions. In G. T. L. Brown & L. R. Harris (Eds.), Handbook of social and human conditions in assessment (pp. 247–266). Routledge.

Panadero, E., Brown, G. L., & Strijbos, J. ‑W. (2016). The future of student self-assessment: A review of known unknowns and potential directions. Educational Psychology Review, 28(4), 803–830. https://doi.org/10.1007/s10648-015-9350-2

Pantiwati, Y., & Husamah, H. (2017). Self and peer Assessments in active learning model to increase metacognitive awareness and cognitive abilities. International Journal of Instruction, 10(4), 185–202. https://doi.org/10.12973/iji.2017.10411a

Pauli, C. (2012). Kodierende Beobachtung [Coding observation]. In H. de Boer & S. Reh (Eds.), Beobachtung in der Schule - Beobachten lernen [Observation at school - learn to observe] (pp. 45–63). Springer. https://doi.org/10.1007/978-3-531-18938-3_3

Pennequin, V., Sorel, O., Nanty, I., & Fontaine, R. (2010). Metacognition and low achievement in mathematics: The effect of training in the use of metacognitive skills to solve mathematical word problems. Thinking & Reasoning, 16(3), 198–220. https://doi.org/10.1080/13546783.2010.509052

Pianta, R. C., & Hamre, B. K. (2009). Classroom processes and positive youth development: Conceptualizing, measuring, and improving the capacity of interactions between teachers and students. New Directions for Youth Development, 2009(121), 33–46. https://doi.org/10.1002/yd.295

Ploegh, K., Tillema, H. H., & Segers, M. S. (2009). In search of quality criteria in peer assessment practices. Studies in Educational Evaluation, 35(2-3), 102–109. https://doi.org/10.1016/j.stueduc.2009.05.001

Praetorius, A. ‑K. (2014). Messung von Unterrichtsqualität durch Ratings [Measuring teaching quality through ratings]. Waxmann.

Praetorius, A. ‑K., Pauli, C., Reusser, K., Rakoczy, K., & Klieme, E. (2014). One lesson is all you need? Stability of instructional quality across lessons. Learning and Instruction, 31, 2–12. https://doi.org/10.1016/j.learninstruc.2013.12.002

R Core Team. (2020). R: A language and environment for statistical computing. R Foundation for Statistical Computing. https://www.R-project.org/

Reusser, K., & Pauli, C. (2021). Unterrichtsqualität ist immer generisch und fachspezifisch. Ein Kommentar aus kognitions- und lehr-lerntheoretischer Sicht [Teaching quality is always generic and subject-specific. A commentary from a cognitive and learning theory perspective.]. Unterrichtswissenschaft, 49(2), 189–202. https://doi.org/10.1007/s42010-021-00117-8

Ross, J. A., Rolheiser, C., & Hogaboam-Gray, A. (1999). Effect of self-evaluation on narrative writing. Assessing Writing, 6(1), 107–132. https://doi.org/10.1016/S1075-2935(99)00003-3

Ruiz-Primo, M. A., & Furtak, E. M. (2006). Informal formative assessment and scientific inquiry: Exploring teachers' practices and student learning. Educational Assessment, 11(3-4), 237–263. https://doi.org/10.1080/10627197.2006.9652991

Sadler, P., & Good, E. (2006). The impact of self- and peer-grading on student learning. Educational Assessment, 11(1), 1–31. https://doi.org/10.1207/s15326977ea1101_1

Sanchez, C. E., Atkinson, K. M., Koenka, A. C., Moshontz, H., & Cooper, H. (2017). Self-grading and peer-grading for formative and summative assessments in 3rd through 12th grade classrooms: A meta-analysis. Journal of Educational Psychology, 109(8), 1049-1066. https://doi.org/10.1037/edu0000190

Schmidt, C. A. (2020). Formatives Assessment in der Grundschule: Konzept, Einschätzungen der Lehrkräfte und Zusammenhänge [Formative assessment in primary school: Concept, teachers' assessments and contexts] (1st ed.). Springer. https://doi.org/10.1007/978-3-658-26921-0

Schnell, R., Hill, P. B., & Esser, E. (2013). Methoden der empirischen Sozialforschung [Methods of empirical social research] (10th revised ed.). Oldenbourg Verlag.

Schulz, A. (2015). Wie lösen Viertklässler Rechenaufgaben zur Multiplikation und Division? [How do fourth graders solve multiplication and division problems?] In F. Caluori, H. Linneweber-Lammerskitten & C. Streit (Eds.), Beiträge zum Mathematikunterricht 2015 (pp. 844–847). WTM. https://doi.org/10.17877/DE290R-16783

Schulz, A., & Leuders, T. (2018). Learning trajectories towards strategy proficiency in multi-digit division – A latent transition analysis of strategy and error profiles. Learning and Individual Differences, 66, 54–69. https://doi.org/10.1016/j.lindif.2018.04.014

Schütze, B., Souvignier, E., & Hasselhorn, M. (2018). Stichwort – Formatives Assessment [Keyword - Formative assessment]. Zeitschrift für Erziehungswissenschaft, 21(4), 697–715. https://doi.org/10.1007/s11618-018-0838-7

Siegler, R. S. (1996). A grand theory of development. Monographs of the Society for Research in Child Development, 61(1-2), 266–275. https://doi.org/10.1111/j.1540-5834.1996.tb00550.x

Siegler, R. S. (2002). Microgenetic studies of self-explanation. In N. Granott & J. Parziale (Eds.), Microdevelopment (pp. 31–58). Cambridge University Press. https://doi.org/10.1017/CBO9780511489709.002

Siegler, R. S. (2007). Cognitive variability. Developmental Science, 10(1), 104–109. https://doi.org/10.1111/j.1467-7687.2007.00571.x

Strijbos, J. ‑W., Narciss, S., & Dünnebier, K. (2010). Peer feedback content and sender's competence level in academic writing revision tasks: Are they critical for feedback perceptions and efficiency? Learning and Instruction, 20(4), 291–303. https://doi.org/10.1016/j.learninstruc.2009.08.008

Strijbos, J. ‑W., & Wichmann, A. (2017). Promoting learning by leveraging the collaborative nature of formative peer assessment with instructional scaffolds. European Journal of Psychology of Education, 33(1), 1–9. https://doi.org/10.1007/s10212-017-0353-x

Tingley, D., Yamamoto, T., Hirose, K., Keele, L., & Imai, K. (2014). Mediation: R package for causal mediation analysis. Journal of Statistical Software, 59(5), 1-38. https://doi.org/10.18637/jss.v059.i05

Topping, K. J. (2009). Peer assessment. Theory into Practice, 48(1), 20–27. https://doi.org/10.1080/00405840802577569

Waldis, M., Grob, U., Pauli, C., & Reusser, K. (2010). Der schweizerische Mathematikunterricht aus der Sicht von Schülerinnen und Schülern und in der Perspektive hochinferenter Beobachterurteile [Swiss mathematics teaching from the perspective of students and in the perspective of highly-inferental ratings]. In K. Reusser, C. Pauli & M. Waldis (Eds.), Unterrichtsgestaltung und Unterrichtsqualität: Ergebnisse einer internationalen und schweizerischen Videostudie zum Mathematikunterricht [Lesson design and teaching quality: Results of an international and Swiss video study on mathematics teaching] (pp. 171–208). Waxmann.

Wylie, C., & Lyon, C. (2013). Using the formative assessment rubrics, reflection and observation tools to support professional reflection on practice. Council of Chief State School Officers (CCSSO). https://bit.ly/3Jub5Em

Moderating the Neuropsychological Impact of Online Learning on Psychology Students

Copy
Please select the style of the citation for coping to clipboard:

Zulliger, S., Buholzer, A., & Ruelmann, M. (2022). Observed quality of formative peer and self-assessment in everyday mathematics teaching and its effects on student performance. European Journal of Educational Research, 11(2), 663-680. https://doi.org/10.12973/eu-jer.11.2.663

Receive Email Alerts

Eurasian Society of Educational Research

Eurasian Society of Educational Research

Observed Quality of Formative Peer and Self-Assessment in Everyday Mathematics Teaching and its Effects on Student Performance

References

The Language Teaching Methods Scale: Reliability and Validity Studies

Moderating the Neuropsychological Impact of Online Learning on Psychology Students

Moderating the Neuropsychological Impact of Online Learning on Psychology Students

Receive Email Alerts

Eurasian Society of Educational Research

Eurasian Society of Educational Research

Observed Quality of Formative Peer and Self-Assessment in Everyday Mathematics Teaching and its Effects on Student Performance

References

The Language Teaching Methods Scale: Reliability and Validity Studies

Moderating the Neuropsychological Impact of Online Learning on Psychology Students

Moderating the Neuropsychological Impact of Online Learning on Psychology Students

HOW TO CITE THIS ARTICLE

Modal title