Learn Before
Formula
Distillation Loss for Relation-Based Knowledge
The distillation loss for relation-based knowledge transfer, based on the relations of feature maps, is calculated as:
L_{RelD}(f_t, f_s) = L_{R^1}(psi_t(acute{f_t}, check{f_t}), psi_s(acute{f_s}, check{f_s}))
Where:
- and are feature maps of the teacher and student models, respectively.
- and are pairs of feature maps chosen from the teacher.
- and are pairs of feature maps chosen from the student.
- and are similarity functions for pairs of feature maps from the models.
- is the correlation function between the teacher and student feature maps.
0
1
Updated 2026-05-10
Contributors are:
Who are from:
Tags
Deep Learning (in Machine learning)
Data Science
Computing Sciences