HSF / documents

Repository for HSF documents (e.g. technical notes)
6 stars 29 forks source link

CWP version for CSBS journal: comments from reviewers #105

Closed jouvin closed 5 years ago

jouvin commented 5 years ago

This is the comments received by email on Sept. 5 turned into a task list for easier tracking. **Be sure to update appropriately the Google Docs-based draft answer to CSBS reviewers before marking a task done.***

Introduction

Section 3.2:

Section 3.3:

Section 3.4:

Section 3.5 (Machine Learning):

Section 3.7:

Section 3.8:

Section 3.10:

Section 3.11:

Section 3.12:

Section 3.13 (Security)

Section 4:

Section 4.1:

Section 4.2 and 4.3:

These detailed comments where preceded by the following general remarks:

This document provides to the High Energy Physics community a clear status of the current software and computing practices which have been set along the last decade in order to achieve the exploitation of the data produced by the four detectors installed on the Large Hadron Collider at CERN. It also, and this is its main objective, proposes a program of work (PoW) in various identified areas that must be done to be able to successfully process the data that will be produced by the upgraded LHC (HL-LHC) in less than 10 years from now.

Reviewing the document was made more difficult due to the poor formatting of the manuscript. Therefore, some of the line numbers mentioned below might be wrong and we encourage the authors to seek a solution for the formatting problems before resubmission.

In addition, it seems that many/most of the references (only analysing the reference names) are from within the HEP community. A broader view should be taken, i.e. references from outside the HEP community should be considered. 
 The proposed program of work appears to be exhaustive from a technology point of view; at least with the current HEP community knowledge of the evolution of hardware and software that may occur during this period. 


Some general points were mentioned in the referee reports. These might be addressed in the Introduction and/or Conclusion
 (or Section 2?):

Attention must be better paid to
 ⁃ Carefully take into account the consequences on the computing facilities (Tiers 0 to 1) of such or such solution that may be raised from the POW and that may lead to extra cost. For instance, HPC or GP/GPU facilities if they provide more efficient (in CPU time) ways to do specific processing are much more expensive traditional HTC farms.
 ⁃ extra cost that may be generated by Hybrid Computing especially if made of commercial resources
. ⁃ not to produce solutions that may lead to the set up of dedicated hardware that then may not be mutualized with other experiments yet supported by the same funding agencies as HL-LHC
.

  • Tier 1s must be part of these works at the earliest possible stage.



Even though it is well understood that this document focuses on HEP computing challenges, the proposed program does not make any reference to the European or worldwide computing for research context. It is mandatory that all these works take into account the evolution of the e-infrastructures they rely on . This may require some closed interactions with bodies like EGI, OSG, EUDAT, PRACE, NRENs… This « collaborative » work must be more clearly identified within the relevant POW if not within a dedicated one.

Finally, the work package relative to Security and Authentication and Authorization Infrastructure purposes a sum-up of the most important actions expected to be achieved. As these issues are beyond the scope of the HEP community and depend on the various national security policy it is of first importance that this work package is handled at the very stage of the proposed roadmap

davidlange6 commented 5 years ago

i started a google doc to organize thoughts - it would be too complex on a GitHub issue I think

https://docs.google.com/document/d/1y1xrJxPkJcGxpuF_7NrCfCqLiy4b4Ahg2zZoQ9SWNCk/edit

jouvin commented 5 years ago

@davidlange6 sure the issue was here to track what has been addressed. In fact I hesistate between a Google doc and the file I added in the PR #104 (answers_to_csbs_reviewers.txt) to share thoughts... Thanks for the Google Docs.

davidlange6 commented 5 years ago

off topic - how are people cloning this documents repository? The entire thing just to get one of the N documents?

jouvin commented 5 years ago

This is the easiest with Git... Do you really care? It's true that it is non optimal because of the PDF in this repo. We should move to LFS for storing them more efficiently...

graeme-a-stewart commented 5 years ago

Yes, it's only 74MB and should be mainly 'sources' for LaTeX (some PDFs, of course). I think that having one repository for every document would just be unwieldy in a different way...

davidlange6 commented 5 years ago

Eventually we will all care when there are enough papers there. Should think how to organize for the future.

On 18 Oct 2018, at 17:00, Michel Jouvin notifications@github.com<mailto:notifications@github.com> wrote:

This is the easiest with Git... Do you really care? It's true that it is non optimal because of the PDF in this repo. We should move to LFS for storing them more efficiently...

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHubhttps://github.com/HSF/documents/issues/105#issuecomment-431042727, or mute the threadhttps://github.com/notifications/unsubscribe-auth/AEzyw6H--lXRfU32gE3mz3TM8MNUBeeMks5umJc9gaJpZM4XlQLH.

jouvin commented 5 years ago

Let's stop this discussion here so that the issue remains about its original topic... I opened #106 to follow up the discussion started by @davidlange6 .

jouvin commented 5 years ago

Closed as the work has been done and followed in a different place (https://docs.google.com/document/d/1y1xrJxPkJcGxpuF_7NrCfCqLiy4b4Ahg2zZoQ9SWNCk/edit). Not worth updating the issue...