
神经网络学习笔记 - 损失函数的定义和微分证明
发布日期:2021-05-09 06:52:33
浏览次数:20
分类:原创文章
本文共 1638 字,大约阅读时间需要 5 分钟。
神经网络学习笔记 - 损失函数的定义和微分证明
损失函数 Loss function (cross entropy loss)
损失函数,反向传播和梯度计算构成了循环神经网络的训练过程。
激活函数softmax和损失函数会一起使用。
激活函数会根据输入的参数(一个矢量,表示每个分类的可能性),计算每个分类的概率(0, 1)。
损失函数根据softmax的计算结果\(\hat{y}\)和期望结果\(y\),根据交叉熵方法(cross entropy loss) 可得到损失\(L\)。
cross entropy loss函数
\[L_t(y_t, \hat{y_t}) = - y_t \log \hat{y_t} \\L(y, \hat{y}) = - \sum_{t} y_t \log \hat{y_t} \\\frac{ \partial L_t } { \partial z_t } = \hat{y_t} - y_t \\\text{where} \\z_t = s_tV \\\hat{y_t} = softmax(z_t) \\y_t \text{ : for training data x, the expected result y at time t. which are from training data}\]
证明
\[\begin{align}\frac{ \partial L_t } { \partial z_t } & = \frac{ \partial \left ( - \sum_{k} y_k \log \hat{y_k} \right ) } { \partial z_t } \\ & = - \sum_{k} y_k \frac{ \partial \log \hat{y_k} } { \partial z_t } \\ & = - \sum_{k} y_k \frac {1} {\hat{y_k}} \cdot \frac{ \partial \hat{y_k} } { \partial z_t } \\ & = - \left ( y_t \frac {1} {\hat{y_t}} \cdot \frac{ \partial \hat{y_t} } { \partial z_t } \right ) - \left ( \sum_{k \ne t} y_k \frac {1} {\hat{y_k}} \cdot \frac{ \partial \hat{y_k} } { \partial z_t } \right ) \\ & \because \text{softmax differentiation formula } \\ & = - \left ( y_t \frac {1} {\hat{y_t}} \cdot ( 1 - \hat{y_t} ) \hat{y_t} \right ) - \left ( \sum_{k \ne t} y_k \frac {1} {\hat{y_k}} \cdot (-\hat{y_t} \hat{y_k}) \right ) \\ & = - \left ( y_t \cdot ( 1 - \hat{y_t} ) \right ) - \left ( \sum_{k \ne t} y_k \cdot (-\hat{y_t}) \right ) \\ & = - y_t + y_t \hat{y_t} + \left ( \sum_{k \ne t} y_k \hat{y_t} \right ) \\ & = - y_t + \hat{y_t} \left ( \sum_{k} y_k \right ) \\ & \because \sum_{k} y_k = 1 \\ & = \hat{y_t} - y_t\end{align}\]
参照
发表评论
最新留言
路过按个爪印,很不错,赞一个!
[***.219.124.196]2025年04月22日 09时51分59秒
关于作者

喝酒易醉,品茶养心,人生如梦,品茶悟道,何以解忧?唯有杜康!
-- 愿君每日到此一游!
推荐文章
leaflet图标闪烁(leaflet篇.20)
2025-04-04
leaflet圆采集与圆编辑(leaflet篇.8)
2025-04-04
leaflet地图无级别缩放(移动端)(leaflet篇.76)
2025-04-04
leaflet实现wms服务面要素可点击(leaflet篇.30)
2025-04-04
leaflet实现四色预警(仿echarts气泡图)(leaflet篇.41)
2025-04-04
Leaflet快速入门与加载OSM显示地图
2025-04-04
leaflet态势标绘-细直线箭头采集(leaflet篇.83)
2025-04-04
leaflet接入geoserver发布的wms服务(leaflet篇.28)
2025-04-04
leaflet接入geoserver发布的热力图服务(leaflet篇.29)
2025-04-04
leaflet接入土地资源(leaflet篇.55)
2025-04-04
leaflet接入天地图(经纬度投影256)(leaflet篇.24)
2025-04-04
leaflet接入百度午夜蓝地图、深色地图(leaflet篇.27)
2025-04-04
leaflet接入百度地图服务时只有北半球的解决方案(leaflet篇.54)
2025-04-04
leaflet接入百度影像地图(leaflet篇.34)
2025-04-04
leaflet散点图(leaflet篇.13)
2025-04-04
leaflet暗色系地图样式地图(获取滤镜值)(leaflet篇.44)
2025-04-04
leaflet柱状图(leaflet篇.75)
2025-04-04
leaflet波纹点(leaflet篇.14)
2025-04-04