Chapter6 - Githubissues

KevinLee1110 commented 7 years ago

关于第6章的各种建议，请在这里回复

fairmiracle commented 7 years ago

https://github.com/exacity/deeplearningbook-chinese/blob/master/Chapter6/deep_feedforward_networks.tex#L179 $g(z)=\max{0, x}$ ==> $g(z)=\max{0, z}$

KevinLee1110 commented 7 years ago

@fairmiracle 多谢~我尽量避免这种小错误

HC-2016 commented 7 years ago

https://github.com/exacity/deeplearningbook-chinese/blob/master/Chapter6/deep_feedforward_networks.tex#L671 原文“If we use a diagonal matrix, or a scalar times the diagonal matrix,” 翻译“如果我们使用对角矩阵，或者是一个常数乘以单位矩阵” 应该是“......乘以对角阵"

KevinLee1110 commented 7 years ago

@HC-2016 非常感谢你的建议~原文确实是”常数乘以对角矩阵“，但是这和对角矩阵有什么区别。。。个人认为这里是作者把单位阵打成了对角阵（理由如下：https://github.com/exacity/deeplearningbook-chinese/blob/master/Chapter3/probability_and_information_theory.tex#L418 ，这也是我为什么会把对角阵不假思索地打成单位阵的原因...），我还是在原文中翻译成”对角阵“，在脚注中加以修正吧。

HC-2016 commented 7 years ago

@KevinLee1110 赞同你的理解，不过建议还是按照原文翻译，并用其他方式注释说明

bengordai commented 7 years ago

关于until的翻译直到深度学习的出现，这都是主流的方法。在深度学习出现前，这都是主流的方法。我认为until直译为“直到”读起来有歧义。。
关于convexity的翻译放弃训练问题的凸性我感觉凸性离开函数这个限定会很诡异。。“放弃问题的凸性”这个描述有点诡异。。 3.关于address的翻译 address这里解释为强调吗？我觉得是处理、解决。。这句话我会翻译为我们先详细解释配置（或者部署或者训练）一个前馈网络的每步设计。。

bengordai commented 7 years ago

https://github.com/exacity/deeplearningbook-chinese/blob/master/Chapter6/deep_feedforward_networks.tex#L627 soft的双引号缺左边

KevinLee1110 commented 7 years ago

@bengordai 感谢你的每条建议~第6章我还在抓紧修正。当时翻译的有点急，所以读起来会怪怪的。接受你的1、3建议，我确实疏忽了。关于第2点，convexity的翻译，我认为凸性不止针对函数，也是对优化问题的一种描述，例如“凸优化”（参考wiki中的第二句话）。但我不确定这里的convexity是否是我理解的那种。

HC-2016 commented 7 years ago

https://github.com/exacity/deeplearningbook-chinese/blob/master/Chapter6/deep_feedforward_networks.tex#L997 这一段翻译看的很蛋疼。

HC-2016 commented 7 years ago

https://github.com/exacity/deeplearningbook-chinese/blob/master/Chapter6/deep_feedforward_networks.tex#L1099 response “相应”->“响应”

HC-2016 commented 7 years ago

https://github.com/exacity/deeplearningbook-chinese/blob/master/Chapter6/deep_feedforward_networks.tex#L1207 |||c||| 是什么？占位符？

KevinLee1110 commented 7 years ago

@HC-2016 这两天有点事，没怎么弄第6章。。。抱歉

https://github.com/exacity/deeplearningbook-chinese/blob/master/Chapter6/deep_feedforward_networks.tex#L997 原文是这样

Figure 6.5 illustrates how a network with absolute value rectification creates mirror images of the function computed on top of some hidden unit, with respect to the input of that hidden unit.

这句话翻译为"图\ref{fig:chap6_space_folding}解释了带有绝对值修正的网络是如何创建函数的镜像图像的，这些函数在某些隐藏单元的顶部计算，作用于这些隐藏单元的输入。"如何？

感谢是指正
|||c|||是引用，当时我们第一遍翻译的时候还没添加算法，所以有关算法的引用就先用|||c|||代替，后面我们会很快弄完的。

ghost commented 7 years ago

第六章题目及第一段落，将“网络”写成了“网路”

imwebson commented 7 years ago

6.3中的术语“winner-take-all”似乎更多的学术资料中译为“赢者通吃”~源于美国总统大选制度

SwordYork commented 7 years ago

@imwebson 很有道理，我也纳闷怎么翻。多谢！

SwordYork commented 7 years ago

@Elvinczp 多谢！ @KevinLee1110 靠你了。

KevinLee1110 commented 7 years ago

@Elvinczp 感谢指正
@imwebson 嗯嗯，"赢者通吃"这种译法确实非常好~

zdx3578 commented 7 years ago

https://github.com/exacity/deeplearningbook-chinese/blob/master/Chapter6/deep_feedforward_networks.tex#L604

有 --> 由

https://github.com/exacity/deeplearningbook-chinese/blob/master/Chapter6/deep_feedforward_networks.tex#L608

难么 --> 那么

zdx3578 commented 7 years ago

“argmax’’ 函数的结果表示为一个独热向量(只有一个元素为 1,其余元素都为 0 的向量) 保留 one-hot 更好吧；独热怪怪的。

SwordYork commented 7 years ago

@KevinLee1110 我改了one hot，@zdx3578 早上我grep了独热码，没想到没有改全，这下应该没有了。多谢！

KevinLee1110 commented 7 years ago

@zdx3578 @SwordYork 第6章错别字有点多...我争取尽快再完整读一遍...最近太忙了...（感觉这句话我说了很多遍...）

另外，我觉得one hot翻译成"独热"没问题啊（理由是通信中确实有这种翻译方法：Google），你确定直接从英文理解one hot就不怪了吗？（可能是我英文太菜，见谅...我觉得正文中出现"one-hot向量"的说法反而会让读者更加摸不着头脑...）...其实one hot这种，大家更容易接受它像标准基，而不是一种编码方式，所以翻译成啥都会怪怪的。但我认为翻译成"独热"会更好理解一些（个人观点），但要在第一次出现时给出说明，例如"只有一个元素为1，其余元素都为0的向量"，这样大家可以根据索引找到第一次出现的地方，就可以理解了，不用每次都解释一遍。

zdx3578 commented 7 years ago

感谢各位！ ,普遍近似定理 (universal approximation theorem 万能逼近定理是不是可以？

SwordYork commented 7 years ago

@KevinLee1110 我用 \gls{one_hot} 代替了一下。第5章其他简单的监督学习算法中具体解释了one-hot，而且确实是一种编码方式。再考虑考虑要不要翻成中文，反正目前应该都在gls里了。

SwordYork commented 7 years ago

@zdx3578 普遍近似定理（universal approximation theorem）万能逼近定理，我们没有统一好。我也不清楚哪个好，后面还出现了 universal approximator，普遍近似器，万能逼近器，万能近似器……我觉得普遍近似可能好一点。

zdx3578 commented 7 years ago

one hot 在开始几次出现时保留对照英文，会方便一些。

zdx3578 commented 7 years ago

逼近是个动词，表示可以逼近能力。个人觉得好，其他地方看到过。普遍近似的普遍觉得什么都是普遍，个人感觉哈。或者开始使用两个中文名词都说明一下

minoriwww commented 7 years ago

https://github.com/exacity/deeplearningbook-chinese/blob/master/Chapter6/deep_feedforward_networks.tex#L44

loosely 松散的->粗略地

KevinLee1110 commented 7 years ago

@zdx3578 @SwordYork approximate翻译成"逼近"或者"近似"都是有的，形成词组就是函数逼近、函数近似。但我觉得"逼近"更像是数值上迭代，最终收敛到一个局部极小值点，有一种极限的感觉；而"近似"意义更加广泛一些。另外就是"普遍近似器、普遍逼近器"我觉得有点难听（纯粹个人感觉），我倾向于翻译成"通用近似器、通用逼近器"或者"万能近似器、万能逼近器"，后面的"万能"纯粹是我受到中学时三角函数万能公式这一说法的影响..

KevinLee1110 commented 7 years ago

@minoriwww 感谢你的建议~loosely确实可以翻译成松散地、零散地、粗略地。这里我们看一下它前面那一句话

Finally, these networks are called neural because they are loosely inspired by neuroscience.

我在这里翻译为"最后，这些网络之所以被称为神经网络是因为它们零散地受到神经科学的启发。"这里如果翻译为"粗略地"当然也可以，但却没有那种“从这里受到一点启发，又从那里受到一点启发”的感觉（不知道我理解得对不对...）

后面一句话的原文是

The choice of the functions $f^{(i)}(\bm{x})$ used to compute these representations is also loosely guided by neuroscientific observations about the functions that biological neurons compute.

这里"松散地"翻译得确实不好，我打算把它改成"零散地"，你觉得怎么样呢？然后整句话翻译为"用于计算这些表示的函数$f^{(i)}(\bm{x})$的选择，也零散地受到神经科学观测的指引，这些观测是关于生物神经元计算的函数的"，这样如何？

minoriwww commented 7 years ago

搜索引擎的[*地受到*的启发]的结果也没有好主意: 常见的量词还是 或多或少 这种(意译)，"零散地"应该是直译里面较贴切的了、 粗细 的话的确不太合适（好难喔

后面一句话，第二个functions大概是功能的意思？

用于计算这些表示的函数$f^{(i)}(\bm{x})$的选择，也[零散地/或多或少地]受到神经科学中，关于对生物神经元计算功能的观测的指引

希望能有帮助 :P

KevinLee1110 commented 7 years ago

@minoriwww 嗯！你说得有道理~ 多谢~"或多或少"确实是很棒的翻译~第二个function也确实应该翻译成功能，我们观测的是细胞的功能，不是函数XD

zdx3578 commented 7 years ago

@KevinLee1110 深度学习梯度下降就是数值计算迭代，个人觉得动词逼近比近似更传神。个人感觉。

KevinLee1110 commented 7 years ago

@zdx3578 这里所谓的approximator是一个静态的表示，并不是一个过程。对于分类问题，我们的目标是得到输入到标签的一个映射f；对于回归问题，我们的目标是得到输入到数值的一个映射f。这里的approximator就是用f'来近似这样一种映射f，是一个静态的表示f‘，并不一定涉及数值迭代的过程。例如说"多项式可以近似任意的连续函数"，更加强调说多项式这种结构形成的函数可以近似任意连续函数的行为；但是如果说"多项式可以逼近任意的连续函数"，更加强调说任意的连续函数都可以表示成某种类型的多项式的极限。所以这里我其实个人更倾向于翻译成"近似"。

但是其实，我觉得"逼近"和"近似"没有必要区分得那么细，换着翻也没问题，因为大家都知道它的意思呀。还是不要纠结这个问题了...

huangpingchun commented 7 years ago

17行： Feedforward networks are a conceptual stepping stone on the path to recurrent networks, stepping stone 在此处翻译成 “奠基石/进身之阶” 比较好吧，“垫脚石”有点贬义。如：前馈网络是通往循环网络概念的进身之阶

KevinLee1110 commented 7 years ago

@huangpingchun 嗯嗯，多谢~"垫脚石"确实有点贬义，但"进身之阶"不太好理解，我改成"奠基石"吧~

HC-2016 commented 7 years ago

歪个楼，关于矩阵微分，这本书区分“gradient”和“Jacobian”么？有的话在哪说明的？

SwordYork commented 7 years ago

@HC-2016 区分，详见 https://github.com/exacity/deeplearningbook-chinese/blob/master/Chapter4/numerical_computation.tex#L209 对应英文书82左右。但书中一般说成是hessian，目前暂时译成海森。

HC-2016 commented 7 years ago

@SwordYork 谢了！刚看完第四章。我是指“gradient”和“Jacobian”是否互为转置（列向量/行向量），还是不区分？第四章好像并没有明确说明。Hessian是二阶导。

SwordYork commented 7 years ago

@HC-2016 我貌似还是没有理解你的意思，在我的理解中 Jacobian 和 Hessian 才可以不区分。以下是原书4.3.1节中的话，

Sometimes we need to find all the partial derivatives of a function whose input and output are both vectors. The matrix containing all such partial derivatives is as a Jacobian matrix.

When our function has multiple input dimensions, there are many second derivatives. These derivatives can be collected together into a matrix called the Hessian matrix.

Equivalently, the Hessian is the Jacobian of the gradient.

你有没有兴趣帮忙校对第4章？

HC-2016 commented 7 years ago

@SwordYork Jacobian是一阶导数排列成矩阵，Hessian是二阶导排列成矩阵。因此，“Equivalently, the Hessian is the Jacobian of the gradient.”

正在看第6章.

SwordYork commented 7 years ago

你说的是对的，Jacobian是一阶偏导排列成矩阵，但前提是这个函数输入输出都是向量。包含这样偏导的矩阵称为Jacobian矩阵，这里偏导是指对输出是向量输入也是向量的函数求导。对导数（输入是向量输出标量的函数的导数）求导就是其中的一种情况，也就是所谓的 Jacobian of the gradient，这句话的意思是梯度的Jacobian矩阵（也就是二阶导）。 梯度的Jacobian矩阵 特称为 Hessian矩阵。

The Jacobian of the gradient of a scalar function of several variables has a special name: the Hessian matrix, which in a sense is the "second derivative" of the function in question.

from wikipedia.

HC-2016 commented 7 years ago

https://github.com/exacity/deeplearningbook-chinese/blob/master/Chapter6/deep_feedforward_networks.tex#L680

closed form expression “闭合的表达式” -> "闭式表达"/“闭式解”？

HC-2016 commented 7 years ago

https://github.com/exacity/deeplearningbook-chinese/blob/master/Chapter6/deep_feedforward_networks.tex#L733 “那” -> "哪"

https://github.com/exacity/deeplearningbook-chinese/blob/master/Chapter6/deep_feedforward_networks.tex#L734 太绕口了，然而并没有好的建议。

HC-2016 commented 7 years ago

https://github.com/exacity/deeplearningbook-chinese/blob/master/Chapter6/deep_feedforward_networks.tex#L737

https://github.com/exacity/deeplearningbook-chinese/blob/master/Chapter6/deep_feedforward_networks.tex#L738

As when ... ：和/如同...的时候一样

HC-2016 commented 7 years ago

https://github.com/exacity/deeplearningbook-chinese/blob/master/Chapter6/deep_feedforward_networks.tex#L1153 $\nabla{\bm{\theta}J(\bm{\theta})}$ -> $\nabla{\bm{\theta}}{J(\bm{\theta})}$

这章中 “Jacobian matrix”翻译是"Jacobi 矩阵"，和其他章节不一致。术语一章用的是 “Jacobian matrix”

HC-2016 commented 7 years ago

https://github.com/exacity/deeplearningbook-chinese/blob/master/Chapter6/deep_feedforward_networks.tex#L1361 $u^{(1)},\ldots,u^{(n)}$ -> $u^{(1)},\ldots,u^{(n_i)}$

HC-2016 commented 7 years ago

https://github.com/exacity/deeplearningbook-chinese/blob/master/Chapter6/deep_feedforward_networks.tex#L1353 “it performs on the order of one Jacobian product per node in the graph”, "on the order of "翻译错了吧。

https://github.com/exacity/deeplearningbook-chinese/blob/master/Chapter6/deep_feedforward_networks.tex#L1354 $\frac{\partial u^{(i)}}{\partial u^{(j)}}$ 翻译是对的，书是错的，补充说明。

https://github.com/exacity/deeplearningbook-chinese/blob/master/Chapter6/deep_feedforward_networks.tex#L1744 同上

KevinLee1110 commented 7 years ago

@HC-2016 非常感谢你的建议！！兄台有意愿认领一下第6章的中英对比吗？我们会在纸质版正式出版的时候，在书中致谢，非常感谢~

closed form expression "闭合的表达式"确实没有"闭式表达"/"闭式解"好。我感觉意译的话，结合语境，用"解析表达式"可能会更好一点。你觉得怎么样？
原文是这样的

The expression for the negative log-likelihood naturally weights each example’s contribution to the loss for each component by the probability that the component produced the example.

我觉得作者如果用公式直接写反而会更加简洁明了...现在翻译成"负对数似然表达式将每个样例对每个组件的贡献进行赋权，权重的大小由相应的组件产生这个样例的概率来决定。"还是很拗口...我原本想翻译成"对每个组件的损失函数的贡献"，但是这样就更奇怪了...这个问题先留着，待我再想一想。

你是对的，就应该翻译成"和...一样"，这里是呼应前文。
嗯嗯，有一些术语还是没来得及统一，我去改一下。
这里是我疏漏了。
"on the order of" 应该翻译成"大约"，我当时第一反应是"order"是数量级的意思，所以翻译成了它，现在发现还是"大约"更合适。脚注确实应该加一下，毕竟我们是翻译，不是写书...XD，当时觉得作者错了直接修正就好，省得读者看着也挺麻烦...现在想想还是以脚注的形式吧: )

HC-2016 commented 7 years ago

@KevinLee1110 不好意思，可能没有足够的时间中英对照。

minoriwww commented 7 years ago

@KevinLee1110 咱可以领一个第六章的中英对照嘛？

exacity / deeplearningbook-chinese

Chapter6 #19