#Week7 Neural Networks : Learning - 代码天地

#Week7 Neural Networks : Learning

其他 2020-01-01 23:34:12 阅读次数: 0

一、Cost Function and Backpropagation

神经网络的损失函数：
\[J(\Theta) = - \frac{1}{m} \sum_{i=1}^m \sum_{k=1}^K \left[y^{(i)}_k \log ((h_\Theta (x^{(i)}))_k) + (1 - y^{(i)}_k)\log (1 - (h_\Theta(x^{(i)}))_k)\right] + \frac{\lambda}{2m}\sum_{l=1}^{L-1} \sum_{i=1}^{s_l} \sum_{j=1}^{s_{l+1}} ( \Theta_{j,i}^{(l)})^2\]
在这里插入图片描述
这个cost function是在logistic regression基础上演变而来，只是神经网络有很多输出结点，而logistic regression只有一个输出结点，所以这个cost function只是把所有的K个输出结点的损失函数进行累加。

得到cost function后，为了寻找使得\(J(\theta)\)最小的那组参数\(\theta\)，我们需要知道\(J(\theta)\)关于每个\(\theta\)的偏导数，而后向传播算法可以帮助我们计算偏导数：
在这里插入图片描述
对于每个训练样本，先利用forward propagation计算每一层的\(a\)：

接着利用样本真实标签\(y^{(t)}\)计算最后一层的误差值；

之后从右向左计算每一层（输入层除外）的误差：
在这里插入图片描述
这样每个样本一次正向、一次反向来更新误差矩阵：

向量化表示：

最后，就可以得到偏导数：

二、Backpropagation in Pratice

为了使用fminunc等高级的优化方法来求得cost function的最小值，所以将\(\theta\)这个矩阵展成向量传入fminunc，完成后可以通过reshape从向量中提取\(\theta^{(1)}、\theta^{(2)}\)等：
在这里插入图片描述

为了确保我们使用Backpropagation求得的偏导数的正确性，可以使用Gradient Checking（很慢）来检验：
根据偏导数定义：
\[\dfrac{\partial}{\partial\Theta_j}J(\Theta) \approx \dfrac{J(\Theta_1, \dots, \Theta_j + \epsilon, \dots, \Theta_n) - J(\Theta_1, \dots, \Theta_j - \epsilon, \dots, \Theta_n)}{2\epsilon}\]
\[一般\epsilon=10^{-4}\]
通过将这种方式计算的偏导数与之前Backpropagation求得的偏导数比较，即可得知Backpropagation的正确性。

之前在Linear Regression和Logistic Regression，我们可以用全0来初始化\(\theta\)，但在神经网络中，这样做会有问题，所以采用随机初始化：
在这里插入图片描述
最后，从整体捋一遍流程：
1、选择网络结构：

2、训练神经网络：

对每一个训练样本：
在这里插入图片描述

猜你喜欢

转载自www.cnblogs.com/EIMadrigal/p/12130910.html

#Week7 Neural Networks : Learning

Neural Networks and Deep Learning (Week 2)——Neural Networks Basics

Neural Networks and Deep Learning (Week 3)——Shallow neural networks

Neural Networks and Deep Learning (Week 1)——Introduction to deep learning

Coursera, Deep Learning 1, Neural Networks and Deep Learning - week4, Deep Neural Networks

Neural Networks for Machine Learning

Neural Networks and Deep Learning

Neural Networks and Deep Learning编程作业 (Week 2)

Neural Networks and Deep Learning编程作业 (Week 4)

Neural Networks and Deep Learning (Week 4)——Deep Nural Network

Neural Networks and Deep Learning编程作业 (Week 3)

《Neural networks and deep learning》概览

Neural Networks and Deep Learning(1)

Neural networks and deep learning 概览

Sequence to Sequence Learning with Neural Networks

Neural Networks and Deep Learning 整理

Neural Networks and Deep Learning 笔记

Deep learning - Introduction to Neural Networks

Neural Networks and Deep Learning--Course1week4--Building your Deep Neural Network -Step by Step

Neural Networks and Deep Learning--Course1week2--Logistic Regression with a Neural Network

Neural Networks and Deep Learning（week4）Deep Neural Network - Application（图像分类）

Neural Networks and Deep Learning--Course1week4--Deep Neural Network - Application v8

Neural Networks and Deep Learning（week4）Building your Deep Neural Network: Step by Step

Machine Learning - Neural Networks Learning: Backpropagation in Practice

【Deep Learning】Sequence to Sequence Learning with Neural Networks

【DeepLearning学习笔记】Coursera课程《Neural Networks and Deep Learning》——Week2 Neural Networks Basics课堂笔记

Introduction to deep learning--Week 1-Neural Networks and Deep Learning

NEURAL NETWORKS（neural networks and deep learning by Charu C. Aggarwa）

Neural Networks and Deep Learning-引论

Sequence to Sequence Learning with Neural Networks阅读笔记

今日推荐

火速冲上 GitHub 热榜 —— 开源编程语言、框架哪有这么可爱？

北京人形机器人创新中心发布全球首个纯电驱拟人奔跑的全尺寸人形机器人“天工”

LFOSSA 源来如此公开课 | 掌握云原生未来：CNCF 认证全面攻略与备考秘籍

国产云输入法——仅华为无云端数据上传安全问题

开源日报 | 工业开源项目OGG 1.0；姐姐，你要和我一起配置火狐吗；苹果AI遥遥落后？Fedora 40

开放签电子签章：停止新增，优化体验，前进更进（五一假期前工作）

开源日报 | 中学生开源前端动画引擎；全球首个Llama3 8B中文版开源模型；联想电脑恐出局；Linus讽刺AI炒作

周排行

浏览器对同一域名进行请求的最大并发连接数

React Hook之自定义Hook

【转】MyBatis缓存机制

-Java-泛型

自动化测试常用脚本-发送邮件

LeetCode#859: Buddy Strings

java、Python处理字符串

第二篇の博客

Hadoop伪分布式环境安装

SQL Server进阶（十一）临时表、表变量

每日归档

更多

2024-04-27(56)

2024-04-26(39)

2024-04-25(22)

2024-04-24(36)

2024-04-23(26)

2024-04-22(39)

2024-04-21(0)

2024-04-20(6)

2024-04-19(5)

2024-04-18(0)