Knowledge Distillation(KD) 知识蒸馏 Pytorch实现 - 代码天地

Knowledge Distillation(KD) 知识蒸馏 Pytorch实现

其他 2021-04-06 18:27:08 阅读次数: 0

简单实现，主要为了理解其原理

import torch
import torch.nn as nn
import numpy as np

from torch.nn import CrossEntropyLoss
from torch.utils.data import TensorDataset,DataLoader,SequentialSampler

class model(nn.Module):
	def __init__(self,input_dim,hidden_dim,output_dim):
		super(model,self).__init__()
		self.layer1 = nn.LSTM(input_dim,hidden_dim,output_dim,batch_first = True)
		self.layer2 = nn.Linear(hidden_dim,output_dim)
	def forward(self,inputs):
		layer1_output,layer1_hidden = self.layer1(inputs)
		layer2_output = self.layer2(layer1_output)
		layer2_output = layer2_output[:,-1,:]#取出一个batch中每个句子最后一个单词的输出向量即该句子的语义向量！！！！！！！!！
		return layer2_output

#建立小模型
model_student = model(input_dim = 2,hidden_dim = 8,output_dim = 4)

#建立大模型（此处仍然使用LSTM代替，可以使用训练好的BERT等复杂模型）
model_teacher = model(input_dim = 2,hidden_dim = 16,output_dim = 4)

#设置输入数据，此处只使用随机生成的数据代替
inputs = torch.randn(4,6,2)
true_label = torch.tensor([0,1,0,0])

#生成dataset
dataset = TensorDataset(inputs,true_label)

#生成dataloader
sampler = SequentialSampler(inputs)
dataloader = DataLoader(dataset = dataset,sampler = sampler,batch_size = 2)

loss_fun = CrossEntropyLoss()
criterion  = nn.KLDivLoss()#KL散度
optimizer = torch.optim.SGD(model_student.parameters(),lr = 0.1,momentum = 0.9)#优化器，优化器中只传入了学生模型的参数，因此此处只对学生模型进行参数更新，正好实现了教师模型参数不更新的目的

for step,batch in enumerate(dataloader):
	inputs = batch[0]
	labels = batch[1]
	
	#分别使用学生模型和教师模型对输入数据进行计算
	output_student = model_student(inputs)
	output_teacher = model_teacher(inputs)
	
	#计算学生模型预测结果和教师模型预测结果之间的KL散度
	loss_soft = criterion(output_student,output_teacher)

	#计算学生模型和真实标签之间的交叉熵损失函数值
	loss_hard = loss_fun(output_student,labels)
		
	loss = 0.9*loss_soft + 0.1*loss_hard
	print(loss)
	optimizer.zero_grad()
	loss.backward()
	optimizer.step()

猜你喜欢

转载自blog.csdn.net/hxxjxw/article/details/115294112

Knowledge Distillation(KD) 知识蒸馏 Pytorch实现

Knowledge Distillation(KD) 知识蒸馏

知识蒸馏是什么？（Knowledge Distillation）KD

机器学习：知识蒸馏（Knowledge Distillation，KD）

知识蒸馏（Knowledge Distillation）的Pytorch实现以及分析

知识蒸馏（Knowledge Distillation）

知识蒸馏Knowledge Distillation

Knowledge Distillation 知识蒸馏详解

知识蒸馏简介（Knowledge Distillation）

【知识蒸馏】 Knowledge Distillation from A Stronger Teacher

【知识蒸馏】Knowledge Distillation with the Reused Teacher Classifier

知识蒸馏综述 Knowledge Distillation: A Survey

知识蒸馏（Knowledge distillation）必读论文合集

概念解析 | 知识蒸馏(Knowledge Distillation)

【知识蒸馏】知识蒸馏（Knowledge Distillation）技术详解

一文搞懂【知识蒸馏】【Knowledge Distillation】算法原理

【经典简读】知识蒸馏(Knowledge Distillation) 经典之作

通俗易懂的知识蒸馏 Knowledge Distillation（上）——理论分析

知识蒸馏之Focal and Global Knowledge Distillation for Detectors

[知识蒸馏] Data Efficient Stagewise Knowledge Distillation模型简介

Knowledge Distillation 知识蒸馏之 Hint layer & self-knowledge distillation

【KD】2022 CVPR Decoupled Knowledge Distillation

【KD】2022 TPAMI Quantifying the Knowledge in a DNN to Explain Knowledge Distillation for Clf

知识蒸馏（Distillation）相关论文阅读（1）——Distilling the Knowledge in a Neural Network（以及代码复现）

知识蒸馏学习笔记2--Structured Knowledge Distillation for Semantic Segmentation

【知识蒸馏】 DistPro: Searching A Fast Knowledge Distillation Process via Meta Optimization

多老师知识蒸馏模型——Anomaly detection based on multi-teacher knowledge distillation

通俗易懂的知识蒸馏 Knowledge Distillation（下）——代码实践（附详细注释）

蒸馏法文章选读——Correlation Congruence for Knowledge Distillation

《Distilling the Knowledge in a Neural Network》知识蒸馏

今日推荐

Electron中的关于静态资源加载问题解决方案

《Cursor-AI编程》基础篇-界面指南

《Cursor-AI编程》基础篇-Tab代码智能补充

《Cursor-AI编程》基础篇-Composer功能详解

《Cursor-AI编程》基础篇-Chat功能详解

《Cursor-AI编程》进阶篇-自定义模型

《Cursor-AI编程》进阶篇-上下文详解

【大模型系列篇】最强检索增强技术GraphRAG基本原理详解

【大模型系列篇】基于Ollama和GraphRAG v2.0.0快速构建知识图谱

解释什么是迁移学习？在 CNN 中如何应用？（面试题200合集，高频、关键）

解释数据增强（Data Augmentation）的概念和方法（（面试题200合集，高频、关键））

揭秘大模型“魔法”：Function Calling 让 AI 不止会说，更能“做”！

周排行

ConfigurationClassParser类的parse方法源码解析

基础大讲堂-java 位运算符

ConsecutiveInteger判断给定的整数n能否表示成连续的m(m>1)个正整数之和

多项式问题之六——多项式快速幂

Spring Security技术栈开发企业级认证与授权（四）RESTful API服务异常处理

Linux基础命令---apachectl

MATLAB中的线性插值

Unity编辑器拓展之十七：NGUI ComponentSelector增加搜索框

SqlServer 备份还原教程

[Unity动画]01.

每日归档

更多

2025-04-12(10529)

2025-04-11(9561)

2025-04-10(1213)

2025-04-09(10354)

2025-04-08(12998)

2025-04-07(0)

2025-04-06(0)

2025-04-05(0)

2025-04-04(0)

2025-04-03(0)