in the code, bn.append(m1.weight.data.abs() / (m1.weight.data.abs() + m2.weight.data.abs())) i don't understand why the weight better than enotrpy?
in the code,
bn.append(m1.weight.data.abs() / (m1.weight.data.abs() + m2.weight.data.abs()))
i don't understand why the weight better than enotrpy?