PatrickHuembeli / Distill_Physics_and_ML

Distill article: Physics and Machine Learning
1 stars 0 forks source link

Review #3 #4

Open distillpub-reviewers opened 3 years ago

distillpub-reviewers commented 3 years ago

The following peer review was solicited as part of the Distill review process.

The reviewer chose to keep anonymity. Distill offers reviewers a choice between anonymous review and offering reviews under their name. Non-anonymous review allows reviewers to get credit for the service they offer to the community.

Distill is grateful to the reviewer for taking the time to review this article.


Specific notes

Conclusions


Distill employs a reviewer worksheet as a help for reviewers.

The first three parts of this worksheet ask reviewers to rate a submission along certain dimensions on a scale from 1 to 5. While the scale meaning is consistently "higher is better", please read the explanations for our expectations for each score—we do not expect even exceptionally good papers to receive a perfect score in every category, and expect most papers to be around a 3 in most categories.

Any concerns or conflicts of interest that you are aware of?: No known conflicts of interest

Any concerns or conflicts of interest that you are aware of?: No known conflicts of interest What type of contributions does this article make?: Explanation of existing results

Advancing the Dialogue Score
How significant are these contributions? 2/5
Outstanding Communication Score
Article Structure 2/5
Writing Style 2/5
Diagram & Interface Style 4/5
Impact of diagrams / interfaces / tools for thought? 3/5
Readability 2/5

Comments: This article explains some connections between physics and ML in a hard-to-follow manner: by bringing in quantum mechanics, making big promises in the introduction, and using more words than necessary to describe a given idea. It would be hard for a young researcher to learn, eg, EBMs starting from this article.

Scientific Correctness & Integrity Score
Are claims in the article well supported? 3/5
Does the article critically evaluate its limitations? How easily would a lay person understand them? 2/5
How easy would it be to replicate (or falsify) the results? N/A
Does the article cite relevant work? 5/5
Does the article exhibit strong intellectual honesty and scientific hygiene? 2/5
PatrickHuembeli commented 3 years ago

Response to Reviewer #3

We would like to thank the reviewer for their in-depth comments. We implemented all their suggestions and we believe that thanks to this, the article gained substance and improved significantly.

Before answering all of the reviewer’s comments below, we summarize the main changes that we implemented:

Answers to Reviewers comments

We agree with the reviewer that “beyond” is vague and also not necessary in our title. We removed it and further shortened the title by removing “Equilibration and Machine Learning:”. The new title reads simply “The Physics of Energy-Based Models”.

It is written as an interdisciplinary article with a target audience that includes members of both disciplines.

We thank the reviewer for this comment. We have fixed the error.

We thank the reviewer for this suggestion. We have adapted this sentence to “Using physics to understand energy-based models”.

We thank the reviewer for this valuable feedback. We have substantially rewritten and shortened the introduction following these suggestions.

We thank the reviewer for spotting this typo. We have fixed it in the revised version.

We agree with the reviewer. We have adapted the phrasing throughout the article and now refer simply to “storing” or “memorizing” patterns.

We thank the referee for pointing this out. While Hamiltonians can be defined beyond canonical coordinates, particularly in the context of quantum computing, we agree that our wording is potentially misleading and therefore have removed the mention of Hamiltonians in this section.

This is a very interesting point. We agree that the Boltzmann distribution is not special to physics. The distribution was first introduced in the context of statistical mechanics, but it indeed applies more widely. One of the primary aims of this article is that we can use physical systems to understand abstract concepts like probability distributions. Our strategy is akin to providing insights on quadratic curves by studying projectile motion, which can help deepen understanding.

We have updated the Boltzmann distribution section to clarify that the Boltzmann distribution was initially uncovered in statistical physics, but have made clear that it is more widely applicable across other fields.

This is a very important point, and we thank the reviewer for providing this feedback. We have now added a definition for EBMs directly after the introduction.

We agree with the reviewer that the notion of parity required further explanation. Since it is not necessary for the understanding of this figure, we removed it entirely from the manuscript. We also added more explanation to the caption to make sure the figure can be understood independently from the article. Regarding “spin”, please see the answer below.

In the original version of the article, we indeed use the word “spin” which may be potentially associated with quantum mechanics, even though there is also a classical notion of spin systems, like the Ising model. Nevertheless, we have removed the use of the word spin to avoid confusion.

We thank the reviewer for this feedback. We have updated the text following these suggestions.

We thank the reviewer for this suggestion, which has led us to improve the figure. First, we change the interactivity such that it becomes more clear that hidden nodes indeed allow more complex models. We use a simple symmetry breaking example for this. Furthermore, we changed the color of the unused hidden nodes for the Hopfield network. We have also added the names and values of the weights and nodes. This information is now shown when hovering over them with the mouse.

We thank the reviewer for this feedback. We clarify in the caption that this is not the real energy landscape, but a simplified representation. We decided to remove the claim of generalization because the aim of the figure is to explain sampling from an energy landscape.

We thank the reviewer for this suggestion. We have added a subheading for Gibbs sampling and added another one at the beginning of the article for the Boltzmann distribution.

We corrected the typo, and thank the reviewer for catching it.

We thank the reviewer for their positive feedback on this figure.

The idea to use the same figure as a teaser is to attract the reader, while at the end of the article the figure should be fully understood by the reader. So the figures are in fact the same, but we explain more details in the caption appearing in the main text. The caption of the teaser is very general and should not require any knowledge by the reader.

Since we rewrote the whole final section, we decided to remove the “list” of possible research directions because we believe it does not add a lot of value. Therefore, we believe there are now no further citations required.

We thank the reviewer for this suggestion. We have added a sentence about the curse of dimensionality.

We corrected the typo. We thank the reviewer for spotting it.

We corrected the typo. We thank the reviewer for spotting it.

Conclusions

Following the suggestions of the reviewer, we have shortened the article and made several clarifications throughout. We believe the article is greatly improved as a consequence.

We thank the reviewer for this feedback. We have also improved Figure 2 (architectures), which was lacking substance in the last version.

Based on the reviewer’s comments, we have endeavoured to improve the clarity of the text. We have improved critical passages, especially in the introduction, which should make the article much clearer.

Following the reviewer's advice, we have shortened, clarified, and simplified the article. We note that now the only mention of quantum mechanics occurs in the last section, where we present quantum energy-models as a possible future research direction.

We have added this to the Figures in question.