DARMA-tasking / vt

DARMA/vt => Virtual Transport
Other
35 stars 9 forks source link

Meeting Agenda [do not close] #925

Open lifflander opened 4 years ago

lifflander commented 4 years ago

This issue shall be used to log (weekly) meeting agendas and the corresponding resolution of the topics addressed (minutes). This will help us maintain a record of topics discussed and the resolution of these issues. Each comment shall contain a meeting agenda for a given week, edited later with the resolution for each point of order.

Other impromptu meetings that are relevant to the whole group can also be logged here for posterity and members of the team (or people following the project) who couldn't join.

Rendered template for each meeting:

Descriptor Information
Date [xx/xx/xxx]
Attendees [list-of-attendees]
Description [longer-description]

Agenda:

Template:

| Descriptor | Information |
| --: | --------- |
| Date | [xx/xx/xxx] |
| Attendees | [list-of-attendees] |
| Description | [longer-description] |

### Agenda:
- Item 1
- Item 2
nlslatt commented 3 years ago
Descriptor Information
Date [07/13/2021]
Attendees @PhilMiller @JacobDomagala @nmm0 @cz4rs @jstrzebonski @fnrizzi @ppebay @nlslatt
Description Weekly Meeting

Agenda:

@nlslatt:

@jstrzebonski:

@cz4rs:

@nmm0:

@JacobDomagala:

@PhilMiller:

@ppebay:

lifflander commented 3 years ago
Descriptor Information
Date [07/20/2021]
Attendees @PhilMiller @cz4rs @nlslatt @lifflander
Description Weekly Meeting

Agenda:

@lifflander:

@nlslatt:

@jstrzebonski: My update:

Update:

@PhilMiller:

lifflander commented 3 years ago
Descriptor Information
Date [07/27/2021]
Attendees @PhilMiller @cz4rs @nlslatt @lifflander @JacobDomagala @pierrepebay @fnrizzi @ppebay
Description Weekly Meeting

Agenda:

@lifflander:

@cz4rs :

@JacobDomagala:

@nlslatt:

@jstrzebonski: Out today

@PhilMiller:

lifflander commented 3 years ago
Descriptor Information
Date [08/03/2021]
Attendees @PhilMiller @cz4rs @lifflander @JacobDomagala @pierrepebay @fnrizzi @ppebay
Description Weekly Meeting

Agenda:

@lifflander:

@cz4rs :

@JacobDomagala:

@jstrzebonski: Out today

@PhilMiller:

lifflander commented 3 years ago
Descriptor Information
Date [08/24/2021]
Attendees @PhilMiller @lifflander @JacobDomagala @pierrepebay @ppebay @nlslatt @nmm0
Description Weekly Meeting

Agenda:

@lifflander:

@jstrzebonski:

@PhilMiller:

lifflander commented 3 years ago
Descriptor Information
Date [09/21/2021]
Attendees @PhilMiller @lifflander @JacobDomagala @jstrzebonski @ppebay @nlslatt @nmm0
Description Weekly Meeting

Agenda:

@lifflander:

@jstrzebonski: I was working on:

@nlslatt: My update:

@cz4rs: Update: no DARMA work this week wrapping up Kokkos activities, I should be back full-time next week

@JacobDomagala: Update: nothing on DARMA for last week

@PhilMiller:

lifflander commented 3 years ago
Descriptor Information
Date [09/28/2021]
Attendees @PhilMiller @lifflander @JacobDomagala @jstrzebonski @ppebay @nlslatt @nmm0
Description Weekly Meeting

Agenda:

@lifflander:

@jstrzebonski: On DARMA, I was working on: DARMA-tasking/vt#1393 - Undefined Behavior Sanitizer DARMA-tasking/checkpoint#161 - Using reconstruction logic for std::vectors

@nlslatt:

@cz4rs: Nothing on DARMA, additional Kokkos work prioritized

@JacobDomagala: nothing on DARMA for last week

@PhilMiller:

@nmm0: Working on BVH timing issue

lifflander commented 3 years ago
Descriptor Information
Date [10/05/2021]
Attendees @PhilMiller @lifflander @JacobDomagala @jstrzebonski @cz4rs @ppebay @nlslatt @nmm0
Description Weekly Meeting

Agenda:

@lifflander:

@jstrzebonski: My update:

@nlslatt:

@cz4rs: 1.1.1 Beta v3 release candidate is ready No DARMA work planned for October (100% Kokkos Core)

@JacobDomagala: nothing on DARMA for last week

@PhilMiller:

@nmm0: My update:

@ppebay: My update:

lifflander commented 2 years ago
Descriptor Information
Date [10/12/2021]
Attendees @PhilMiller @lifflander @JacobDomagala @jstrzebonski @cz4rs @ppebay @nlslatt
Description Weekly Meeting

Agenda:

@lifflander:

@jstrzebonski: My update:

@nlslatt:

@cz4rs: 1.1.1 Beta v3 release candidate is ready No DARMA work planned for October (100% Kokkos Core)

@JacobDomagala: Update: nothing on DARMA for last week

@PhilMiller:

@nmm0: Won't be able to make the meeting due to a conflict. Here is my update: https://github.com/DARMA-tasking/vt/pull/1579 (currently draft). Still need to fix minor issues and use in CollectionManager Investigation of weird performance drop in BVH when running with longer intervals between LB runs

@ppebay: no update

lifflander commented 2 years ago
Descriptor Information
Date [11/02/2021]
Attendees @PhilMiller @lifflander @JacobDomagala @jstrzebonski @ppebay @nlslatt
Description Weekly Meeting

Agenda:

@lifflander:

@jstrzebonski: My update: I was working on DARMA-tasking/vt#1581 - adding option to choose VT compilation type (shared/static).

@nlslatt: My update: I'm trying to add reading of commications data from json files (#1589) because I need it for my GEMMA work. Although the node ID for communication between a collection element and a node passes the assertion of being a number, trying to use it as a number or cast it is giving me the error “type must be a number, but is object”. Any thoughts?

@cz4rs: Update: Prepared release candidate 1.1.1 Beta v4 No other DARMA related work this week

@JacobDomagala: Update: nothing on DARMA for last week

@PhilMiller: I was on vacation for some of the of last week, but I reviewed some LB infrastructure changes Pushed through some Kokkos changes for the EMPIRE async GPU work

@nmm0: Sorry I'm so late with my update: More focused on NimbleSM this week https://github.com/DARMA-tasking/vt/pull/1579 ready for review and merge as soon as I figure out why the license formatting is failing :stuck_out_tongue:

@ppebay: My update: 07:51

lifflander commented 2 years ago
Descriptor Information
Date [11/09/2021]
Attendees @PhilMiller @lifflander @JacobDomagala @jstrzebonski @ppebay @nlslatt @cz4rs @nmm0
Description Weekly Meeting

Agenda:

@lifflander:

@jstrzebonski: My update: I resolved DARMA-tasking/vt#1581 and DARMA-tasking/vt#1586. At the moment I'm working on DARMA-tasking/vt#1577 - most of the callbacks types are covered, 4 are left.

@nlslatt: My update: I put up PR #1603 for reading in communication data from json files.

@cz4rs: Update: Back to DARMA (25% availability) Fixed vt #1202 - added Intel oneAPI build to CI Opened vt #1602 - stop using CMAKE_CXX_FLAGS for setting compile options

@JacobDomagala: Update: nothing on DARMA for last week

@PhilMiller: Worked through a bunch of issues arising from updating vt in EMPIRE Worked on LB evolution, including my changes to the modeling bits, and reviewing Jonathan’s changes to the stats structures

@nmm0: Update: Minor changes to epoch guard due to reviews A test is failing that I'm not quite sure about Discovered a bug with usage of CLI11 though it's not really "our fault" :P

@ppebay: No update

lifflander commented 2 years ago
Descriptor Information
Date [11/16/2021]
Attendees @PhilMiller @lifflander @JacobDomagala @jstrzebonski @ppebay @nlslatt @cz4rs
Description Weekly Meeting

Agenda:

@lifflander:

@jstrzebonski: My update: DARMA-tasking/vt#1577 - ready for review - https://github.com/DARMA-tasking/vt/pull/1580 DARMA-tasking/vt#1607 - resolved another UBSAN error DARMA-tasking/vt#1609 - cleaning up workspace before compiling vt-sample-project with vt as TPL - ready for review - https://github.com/DARMA-tasking/vt/pull/1610

@nlslatt: My update: Just gemma work last week @cz4rs: Update: Back to DARMA (25% availability) Fixed vt #1202 - added Intel oneAPI build to CI Opened vt #1602 - stop using CMAKE_CXX_FLAGS for setting compile options

@JacobDomagala: Update: nothing on DARMA for last week

@PhilMiller: Worked through a bunch of issues arising from updating vt in EMPIRE Worked on LB evolution, including my changes to the modeling bits, and reviewing Jonathan’s changes to the stats structures

@nmm0: My update: Pretty much all on NimbleSM this past week Will miss some of the meeting due to a SC talk I'm attending

@ppebay: No update

lifflander commented 2 years ago
Descriptor Information
Date [11/30/2021]
Attendees @PhilMiller @lifflander @JacobDomagala @jstrzebonski @ppebay @nlslatt @cz4rs
Description Weekly Meeting

Agenda:

@lifflander:

@jstrzebonski: Last week I was working on: DARMA-tasking/vt#1604 - ODR violations for applications using CLI11 DARMA-tasking/comment-on-pr#7 - covering code with unit tests DARMA-tasking/checkpoint#229 - investigate if there's more efficient way of serializing std::vector

@cz4rs Update: waiting for feedback after merging PR #1613 - see here vt #1435 (printing time units) - in progress

@nlslatt: My update: Just gemma work last week @cz4rs: Update: Back to DARMA (25% availability) Fixed vt #1202 - added Intel oneAPI build to CI Opened vt #1602 - stop using CMAKE_CXX_FLAGS for setting compile options

@JacobDomagala: Update: nothing on DARMA for last week

@PhilMiller: Update: Finished (tentatively) proposed reassignment load model, with post-LB statistics computation, and isolation of LB decisions from application of reassignment: https://github.com/DARMA-tasking/vt/pull/1583

@nmm0: Update: Mostly NimbleSM stuff Wrote a small reproducer for the ODR issue #1604 https://github.com/nmm0/vt_odr_test ping me if you want access. I didn't want to put it in the main repo since it's just a 10 line test program

@ppebay: My update: worked on CFD miniapp #40 reviewed and approved LBAF PR109 completed replacement of “processor” by “rank” and redefinition and usage of corresponding functions reworked entirely “total work” model resolved LBAF #19 and finalized PR108 created total work testing issue and started parametric testing LBAF #110 addressed comments from MW on LBAF PR#108 worked on total work testing and debugging LBAF #110

lifflander commented 2 years ago
Descriptor Information
Date [12/14/2021]
Attendees @PhilMiller @lifflander @JacobDomagala @jstrzebonski @ppebay @nlslatt @cz4rs
Description Weekly Meeting

Agenda:

@lifflander:

@jstrzebonski: My update: I was working on: DARMA-tasking/vt#1605 - Potential ODR violations with fmt DARMA-tasking/vt#1606 - Potential ODR violation with nlohmann json DARMA-tasking/vt#1615 - Azure pipeline 'gcc-7, ubuntu, mpich, trace runtime, LB' failes to compile vt

@cz4rs Status: vt #1435 - pretty-print LB times (add custom fmt formatter) vt #1439 - store time in seconds consistently (follow up to above)

@nlslatt: My update: Nothing on DARMA directly I will be out Dec 18-Jan 10

@JacobDomagala: Update: nothing on DARMA for last week

@PhilMiller: My update: Proposed reassignment model merged. Plans for further refactorings to allow things like running multiple LBs, multiple trials of single LBs, and iterative refinement

@nmm0: My update: Migrating my code to use new API for the insertable collections vt #1620 for adding move assignment to ModifierToken (we should discuss in the meeting since I assume there was a reason it was left out) On vacation 12/20-1/14

@ppebay: My update:

lifflander commented 2 years ago
Descriptor Information
Date [12/21/2021]
Attendees @PhilMiller @lifflander @JacobDomagala @jstrzebonski @ppebay @cz4rs
Description Weekly Meeting

Agenda:

@lifflander:

@jstrzebonski: My update:

@cz4rs Status: vt #1435 - pretty-print LB times - integrating EngFormat-Cpp library into vt (DARMA-tasking/EngFormat-Cpp)

@nlslatt: Out this week

@JacobDomagala: Update: nothing on DARMA for last week

@PhilMiller: My update: Proposed reassignment model merged. Plans for further refactorings to allow things like running multiple LBs, multiple trials of single LBs, and iterative refinement

@nmm0: Out this week

@ppebay: My update: resolved LBAF #123, validated it with tests and created PR124 studied LBAF #122 created LBAF #126 #127 following discussion with SDR

lifflander commented 2 years ago
Descriptor Information
Date [1/04/2021]
Attendees @PhilMiller @lifflander @JacobDomagala @jstrzebonski @ppebay @cz4rs
Description Weekly Meeting

Agenda:

@lifflander:

@jstrzebonski: My update:

@cz4rs Status: vt #1435 - pretty-print LB times - integrate EngFormat-Cpp, add custom fmt formatter EngFormat-Cpp #2 - ready for review (required for the above) Misc: 1.1.1 Beta v6 RC released Some response appeared in our issue in m.css repository - WIP, workaround on our side still required.

@nlslatt: Out this week

@JacobDomagala: Update: I'm back on DARMA currently working on #1445

@PhilMiller: My update: Pushing latest code into EMPIRE

@nmm0: Out this week

@ppebay: My update: created LBAF #132 #133 following discussions with SDR resolved LBAF #126 created PR128 resolved LBAF #127 created PR129 resolved LBAF #130 created PR131 resolved LBAF #133 created PR135 completed LBAF #134, addressed review and comments by MW on PR136 created LBAF #137, implemented it, created and finalized PR #138 worked on LBAF #122 created PR140 implemented LBAF #141 created PR142 reviewed and merged LBAF #139 manually resolved conflicts between LBAF #122 and #132 continued to work on #122 (still unresolved)

lifflander commented 2 years ago
Descriptor Information
Date [1/11/2021]
Attendees @PhilMiller @lifflander @JacobDomagala @jstrzebonski @cz4rs
Description Weekly Meeting

Agenda:

@lifflander:

@jstrzebonski: My update:

@cz4rs Status: vt #1435 - LB times will now be printed using engineering notation - ready for merging vt #1652 - use upstream m.css without modification - ready for review EngFormat-Cpp #3 - make sure that the library installs correctly - ready for review vt #1439 - store times in seconds consistently across all LBs - WIP

@nlslatt:

@JacobDomagala: My update: Working on: vt - 1449 - Regression tests for send combinations vt - 1445 - Schedule message instead of doing MPI self-send for bcast root After updating my old branch with changes in develop, some of the tests started to fail - I'm looking into that

@PhilMiller: My update: -Nothing to report

@nmm0: Out this week

@ppebay: My update: manually resolved conflicts between LBAF #122 and #132 completed implementation of alternate, non-CMF based transfer strategy (LBAF #122 PR140) discussed logging with MW, created LBAF #143; reviewed and merged PR144 addressed comment by PhM on LBAF #122 and modified PR140 to include both deterministic and probabilistic transfer options

lifflander commented 2 years ago
Descriptor Information
Date [1/18/2021]
Attendees @PhilMiller @lifflander @JacobDomagala @jstrzebonski @cz4rs @nmm0
Description Weekly Meeting

Agenda:

@lifflander:

@jstrzebonski: My update:

@cz4rs Status: vt #1495 - fix Alpine build (error while generating stacktrace) - PR #1657 WIP vt #1439 - store times in seconds consistently across all LBs - WIP Done: vt #1435 - LB times will now be printed using engineering notation - merged vt #1652 - use upstream m.css without modification - merged EngFormat-Cpp #3 - fix library installation - merged

@JacobDomagala: My update: Working on: vt - 1449 - Regression tests for send combinations vt - 1445 - Schedule message instead of doing MPI self-send for bcast root After updating my old branch with changes in develop, some of the tests started to fail - I'm looking into that

@PhilMiller: Nothing for my update - all my SNL time has been EMPIRE or Kokkos

@nmm0: Update: Getting caught up to where I was back from vacation Working on some changes to make broadcasts return pendingSends Looking at an issue where VT goes into an infinite recursive loop when trying to output a certain bad unterminated epoch graph

@ppebay:

lifflander commented 2 years ago
Descriptor Information
Date [1/25/2021]
Attendees @PhilMiller @lifflander @JacobDomagala @jstrzebonski @cz4rs @nmm0 @nlslatt
Description Weekly Meeting

Agenda:

@lifflander:

@jstrzebonski: I'm working on: API that temporarily enables/disables debug prints - https://github.com/DARMA-tasking/vt/issues/1636 (pull request - https://github.com/DARMA-tasking/vt/pull/1658) allowing migrations to same node for testing serialization - https://github.com/DARMA-tasking/vt/issues/476

@cz4rs My update: Working on vt #1495 - got macos build working, still getting some random segfaults On a related note, I've had to reinstall my system from scratch after Ubuntu failed to upgrade, after switching to Fedora - vt builds and tests run fine (latest Fedora workstation, clang-13). I intend to wrap that into a Dockerfile and submit separate PR. Limited hours for DARMA until the end of January - I will be focusing on Kokkos.

@JacobDomagala: My update: Ready for review: vt - 1445 - Schedule message instead of doing MPI self-send for bcast root

@PhilMiller: Nothing to report on DARMA other than participation in design discussions - focus has been on EMPIRE and Kokkos issues, though mostly that touch DARMA stuff

@nmm0: My update: Working on 1662 Working on 1661 Discussed CollectionChainSet design with Jonathan and Phil, leading to 1661 and 1660

@ppebay: My update: Completed and optimized implementation of criterion-based CMF for transfer selection LBAF #122 and finalized PR148

@nlslatt: My update: focused on GEMMA

lifflander commented 2 years ago
Descriptor Information
Date [2/1/2021]
Attendees @PhilMiller @lifflander @JacobDomagala @jstrzebonski @cz4rs @nmm0 @nlslatt @ppebay
Description Weekly Meeting

Agenda:

@lifflander:

@jstrzebonski: My update: API that temporarily enables/disables debug prints - https://github.com/DARMA-tasking/vt/issues/1636 (pull request - https://github.com/DARMA-tasking/vt/pull/1658) - is ready to merge I continue working on allowing migrations to same node for testing serialization - https://github.com/DARMA-tasking/vt/issues/476

@cz4rs Update: No work on DARMA this week Misc: I will be away on vacation next week (Feb 7th - 11th)

@JacobDomagala: Update: Started working on https://github.com/DARMA-tasking/vt/issues/1544

@PhilMiller: Nothing to report on DARMA other than participation in design discussions - focus has been on EMPIRE and Kokkos issues, though mostly that touch DARMA stuff

@nmm0: Update: Draft PR for objgroup pending sends (I need to add some tests): 1666 Working through some issues on BVH Worked on some sample programs for static template registration

@ppebay: My update: implemented LBAF #150 #151 completed PR152 fixed a bug in PR152 (incorrect computation of CMF) reviewed and merged LBAF PR153

@nlslatt: My update: focused on GEMMA

lifflander commented 2 years ago
Descriptor Information
Date [2/8/2021]
Attendees @PhilMiller @lifflander @JacobDomagala @jstrzebonski @nmm0 @nlslatt @ppebay
Description Weekly Meeting

Agenda:

@lifflander:

@jstrzebonski: My update: Refactoring includes in vt/config - https://github.com/DARMA-tasking/vt/issues/1667 - ready for review (https://github.com/DARMA-tasking/vt/pull/1670) Working on allowing passing AppConfig to vt during initialization - https://github.com/DARMA-tasking/vt/issues/1550

@JacobDomagala: Update: No work on DARMA last week

@PhilMiller: Update: Some work on LB matters otherwise focused on Kokkos and applications

@nmm0: Current status: Working on #1668 Should be working but I'm still having issues in my application so I'm debugging further Still working on #1666 , still need to add tests Worked with Keita to review jacobi3d

@ppebay: My update: tested and verified post-rebase of LBAF #151 created and worked on LBAF #160 for configurability of recursiveness reviewed and merged LBAF PR158 resolved LBAF #161 created PR162 resolved LBAF #160 created PR163 reviewed and merged LBAF PR164 PR165

@nlslatt: My update: focused on GEMMA

lifflander commented 2 years ago
Descriptor Information
Date [2/15/2021]
Attendees @PhilMiller @lifflander @JacobDomagala @jstrzebonski @nmm0 @nlslatt @ppebay
Description Weekly Meeting

Agenda:

@lifflander:

@jstrzebonski: My update

@JacobDomagala: Update: Only few hours spent on Darma https://github.com/DARMA-tasking/vt/pull/1675 small PR opened for review

@PhilMiller: Update: Some work on LB matters otherwise focused on Kokkos and applications

@nmm0: Update: Not a lot on Darma, still have PRs I need to finish Working on seeing if I can add sendMsgSz support to collections, no issue set up for that yet

@ppebay: My update: investigated MoveCountsViewer runtime failure with current version of PyPI created LBAF #166 resolved LBAF #121 created PR167 tried to understand why Tempered criterion sometimes let maximum work increase in some configurations, created LBAF #168, revised it as per NS comments created LBAF #170 #171 #172 worked on LBAF #166 with MW reviewed and merged LBAF #173 #174

@nlslatt: My update: focused on empire and gemma

@cz4rs: Update: vt #1495: ready for merging - PR #1657 freshly rebased on top of develop (waiting for the CI to finish) vt #1484: investigating footprinting test failure with fcontext enabled

lifflander commented 2 years ago
Descriptor Information
Date [2/22/2021]
Attendees @PhilMiller @lifflander @JacobDomagala @jstrzebonski @nmm0 @nlslatt @ppebay
Description Weekly Meeting

Agenda:

@lifflander:

@jstrzebonski: My update:

@JacobDomagala: Update: Only few hours spent on Darma https://github.com/DARMA-tasking/vt/pull/1675 added few changes after reviews

@PhilMiller: Update: Application and Kokkos work Nothing directly on DARMA, besides reviews

@nmm0: Update Same tasks from last week With respect to SendMsgSz, seeing if I can embed size information into the shared pointer Fix for 1677 to prevent CMake from running multiple times

@ppebay: My update: revised LBAF PR167 it as per NS comments, resolved conflicts and merged it tried to understand why Tempered criterion sometimes let maximum work increase in some configurations, created LBAF #168 created LBAF #177 #180 #181 on LBAF #166 with MW LBAF #180 created PR182 created LBAF #178 #181 #183 to investigate sub-optimal convergence for mixed comm/load cases worked on suboptimal configurations LBAF #178, wrote a method to exhaustively explore, found explanation of the phenomenon and wrote detailed summary about it in LBAF #178 continued to work on suboptimal minima; implemented recursive method to explore all r transitions between reachable arrangements LBAF #178 worked on reviews of LBAF PR182 and finalized it; fixed #185 CI issue in same PR worked on LBAF #168 created PR190 reviewed and merged LBAF PR179 PR180 PR187

@nlslatt: My update: I’ve been updating up the stats replay branch to work with current vt. The replay code has some limitations that could be overcome with a much simpler implementation enabled by the ProposedReassignment load model. However, at least minor refactoring of the LB manager and collection manager are needed. Refactoring BaseLB as well would further simplify the replay implementation.

@cz4rs: Update: vt #1495: replace libexecinfo with libunwind - PR #1657 merged vt #1679: Create 1.1.1 beta v7 release candidate - PR #1682 ready for review vt #1484: fix memory footprint tests - work in progress vt #1659: Improve communication statistics in VT + the follow-up from LB meeting - work in progress

nlslatt commented 2 years ago
Descriptor Information
Date [03/01/2022]
Attendees @nlslatt @jstrzebonski @nmm0 @JacobDomagala @cz4rs @PhilMiller @ppebay
Description Weekly Meeting

Agenda:

@lifflander is out today.

@jstrzebonski My update:

@nmm0 My update:

@JacobDomagala Update:

@cz4rs Update:

@PhilMiller Nothing on DARMA

@nlslatt My update:

@ppebay Thinks the optimization problem is solved in https://github.com/DARMA-tasking/LB-analysis-framework/pull/190

lifflander commented 2 years ago
Descriptor Information
Date [03/15/2022]
Attendees @nlslatt @jstrzebonski @nmm0 @JacobDomagala @cz4rs @PhilMiller @ppebay @stmcgovern
Description Weekly Meeting

Agenda:

@lifflander:

@jstrzebonski My update: https://github.com/DARMA-tasking/vt/issues/1696 - done https://github.com/DARMA-tasking/vt/pull/1694 - waiting for approvals and merging finishing working on https://github.com/DARMA-tasking/vt/issues/1551

@nmm0 Update: Wrote tests for makemessagesz -- currently sending to a different node with collections doesn't work Jonathan recommended I use the allocation metadata to send instead because shared message ptrs are ephemeral (can be deleted and recreated arbitrarily), so working on that

@JacobDomagala Update: Back on Darma https://github.com/DARMA-tasking/vt/pull/1675 I think it's ready to be merged Continuing the work on https://github.com/DARMA-tasking/vt/issues/1544

@cz4rs My update: vt #1484 - fix memory footprint tests - ready for review vt #1672 - Implement the total work LB algorithm in TemperedLB - work in progress vt #1699 - Update .clang-format syntax - ready for review posted vt #1702 - including app_config.h crashes applications 1.1.1 Beta v7 RC is ready

@PhilMiller Nothing on DARMA

@nlslatt My update: Mostly EMPIRE and GEMMA work this week Still working on stats replay

@ppebay My update: I started to experiment and study new Tempered/Affine scheme with real data

lifflander commented 2 years ago
Descriptor Information
Date [03/29/2022]
Attendees @nlslatt @jstrzebonski @nmm0 @JacobDomagala @cz4rs @PhilMiller @ppebay @stmcgovern
Description Weekly Meeting

Agenda:

@lifflander:

@jstrzebonski My update:

@nmm0 My update: Not really any VT stuff this week, just working on mempool changes

@JacobDomagala Update: Almost done with https://github.com/DARMA-tasking/vt/issues/1544

@cz4rs My update: vt #1713 - fix spurious warning in startup banner - merged :white_check_mark: vt #1702 - Implement the total work LB algorithm in TemperedLB - on hold Working almost exclusively on Kokkos past week and until the end of month.

@PhilMiller Nothing on DARMA

@nlslatt My update: Put up PR https://github.com/DARMA-tasking/vt/pull/1720 for LB workload data replay; has unit tests but could use a regression test Working on https://github.com/DARMA-tasking/vt/issues/1723, adding user-defined data to the LB workload data json files

@ppebay My update: continued to experiment and study new Tempered/Affine scheme with real data discussed LBAF #156 with MJ discussed ROE issues and testing with MW reviewed LBAF PR203 and created follow-on #206 discussed packaging of LBAF/ continuous deployment via PyPI with SDR and MW, created #209 finalized review of LBAF PR203 and merged it discussed private-ification of member functions and variables in LBAF with MW and created #210 reviewed and merged LBAF PR211 PR213

@stmcgovern My update: vt #1423 remove maximimum ref-count check - draft pull request vt #1716-rename-lb-stats-files-to-data-files - almost done vt #1709 create test to checkpoint and restore for collection with a rank with zero elements -working on it

nlslatt commented 2 years ago
Descriptor Information
Date [04/05/2022]
Attendees @nlslatt @jstrzebonski @JacobDomagala @nmm0 @stmcgovern @cz4rs @PhilMiller @ppebay
Description Weekly Meeting

Agenda:

@lifflander had a conflicting milestone meeting.

@jstrzebonski My update:

@JacobDomagala Update:

@nmm0 Update:

@stmcgovern My update:

@cz4rs Update:

@ppebay My update:

@nlslatt My update:

lifflander commented 2 years ago
Descriptor Information
Date [04/19/2022]
Attendees @nlslatt @jstrzebonski @JacobDomagala @nmm0 @stmcgovern @cz4rs @PhilMiller
Description Weekly Meeting

Agenda:

@lifflander:

@jstrzebonski My update: I'm working on https://github.com/DARMA-tasking/vt/issues/1715

@JacobDomagala My update: https://github.com/DARMA-tasking/vt/pull/1738 Ready to be merged https://github.com/DARMA-tasking/vt/pull/1730 Waiting for #1738

@nmm0 Hi all, sorry for the late update. I've been in meetings constantly this week. Writing tests for #1662

@stmcgovern My update: VT issue #1423 remove maximimum ref-count check -- PR1712 merged VT issue #1716-rename-lb-stats-files-to-data-files -- added to PR1731 VT issue #1709 create test to checkpoint and restore for collection with a rank with zero elements -- in progress

@cz4rs My update: vt #1672 - Total Work LB - PR #1695 ready for review vt #1702 - AppConfig crash - PR #1739 posted with a fresh ASan report Misc: posted vt #1740 - upgrade MPICH version magistrate #235 can probably be closed

@ppebay

@nlslatt My update: Helping Jonathan with https://github.com/DARMA-tasking/vt/pull/1707

lifflander commented 2 years ago
Descriptor Information
Date [04/26/2022]
Attendees @jstrzebonski @JacobDomagala @nmm0 @stmcgovern @cz4rs @PhilMiller @ppebay
Description Weekly Meeting

Agenda:

@lifflander:

@jstrzebonski My update: I'm working on https://github.com/DARMA-tasking/vt/issues/1715

@JacobDomagala My update: nothing on Darma for last week

@nmm0 Update: Working on test failures on with my Pr for 1662: https://github.com/DARMA-tasking/vt/pull/1666

@stmcgovern My update: VT issue #1716-rename-lb-stats-files-to-data-files -- added more to PR1731 VT issue #1709 create test to checkpoint and restore for collection with a rank with zero elements -- created PR1743 (draft)

@cz4rs My update: vt #1740 - upgrade MPICH version to 4.0.2 - PR #1741 ready for review vt #1695 - Total Work LB - addressing comments in PR #1695 Misc: Ubuntu 22.04 is available and I have switched one of vt's builds to use it in #1741. Available compilers: clang 11 - 14, gcc 9 - 12. Using Ubuntu 22.04 + clang-11 + UBSan combo causes all tests to fail with: ==2386==ERROR: UndefinedBehaviorSanitizer failed to allocate 0x0 (0) bytes of SetAlternateSignalStack (error code: 22) This seems to be a known issue in libsanitizer - no action required on our side, posted a question in LBAF #233 - possibly a bug in LBAF's LoadReader @PPP @Marcin Wróbel

@ppebay My update: reviewed VT PR 1695 reviewed and merged LBAF PR223 surveyed all remaining pending LBAF issues for work planning discussed LBAF #156 with MJ created LBAF #229 for strawman balancing algorithm for comparison purposes completed LBAF #79 #229 created PR 227 created LBAF #230 to overhaul configuration parameter handling of LBAF_app that is currently both unclear and inconsistent completed LBAF #230 worked with SDR on SC tutorial and abstract

@nlslatt

@PhilMiller:

lifflander commented 2 years ago
Descriptor Information
Date [05/10/2022]
Attendees @jstrzebonski @JacobDomagala @stmcgovern @cz4rs @PhilMiller @ppebay @nlslatt
Description Weekly Meeting

Agenda:

@lifflander:

@jstrzebonski My update: I resolved https://github.com/DARMA-tasking/vt/issues/1742 by adding a command to build_cpp.sh that sets vt src directory as safe; For https://github.com/DARMA-tasking/vt/issues/1763 I ensured that all pipelines are canceled whether new changes to a PR are pushed, and I blocked building drafts. I still need to research how to provide the possibility to force a build for a draft PR; https://github.com/DARMA-tasking/vt/pull/1754 is ready for review.

@JacobDomagala My update: Started working on https://github.com/DARMA-tasking/vt/issues/1753 and https://github.com/DARMA-tasking/build-stats/issues/13

@nmm0 Update: I will miss the meeting because I have to present at the Kokkos community BoF Been busy with other projects this week so will be putting in the PR for storing message size in the allocation block later today

@stmcgovern My update: Build without Libunwind: VT #1691 --PR1781 and VT #1786 -- PR1787. Still working on VT #1382.

@cz4rs My update: vt #1765: use vt compilation flags for bundled libraries - PR #1776 ready for review vt #1782: fix dangling pointer warning - PR #1783 approved, ready for merging vt #1672: Implement the total work LB algorithm - finishing PR #1695 Misc: We currently have 21 builds posting comments in PRs (Compilation - successful or any errors that occured). We could consider posting only on error.

@ppebay My update: reviewed VT PR 1695 reviewed and merged LBAF PR223 surveyed all remaining pending LBAF issues for work planning discussed LBAF #156 with MJ created LBAF #229 for strawman balancing algorithm for comparison purposes completed LBAF #79 #229 created PR 227 created LBAF #230 to overhaul configuration parameter handling of LBAF_app that is currently both unclear and inconsistent completed LBAF #230 worked with SDR on SC tutorial and abstract

@nlslatt My update: Merged https://github.com/DARMA-tasking/vt/pull/1760 Merged https://github.com/DARMA-tasking/vt/pull/1759 Merged https://github.com/DARMA-tasking/vt/pull/1758 Merged https://github.com/DARMA-tasking/vt/pull/1762 Added a type field to vt json files to indicate which schema is to be used in validating it Helped Jonathan debug https://github.com/DARMA-tasking/vt/pull/1766 Added to wiki a description of how to run lldb non-interactively for parallel debugging on a Mac Still working on https://github.com/DARMA-tasking/vt/pull/1707 Still working on https://github.com/DARMA-tasking/vt/pull/1720

@PhilMiller:

nmm0 commented 2 years ago
Descriptor Information
Date [05/24/2022]
Attendees [@jstrzebonski @JacobDomagala @stmcgovern @cz4rs @PhilMiller @ppebay @nlslatt @nmm0]
Description [Weekly Meeting]

Agenda:

Updates:

Additional Discussion

nmm0 commented 2 years ago
Descriptor Information
Date [05/31/2022]
Attendees @PhilMiller, @cz4rs, @JacobDomagala, @stmcgovern, @nlslatt, @jstrzebonski, @ppebay, @nmm0
Description Weekly Meeting

Agenda

Discussion

Updates

nmm0 commented 2 years ago
Descriptor Information
Date [06/07/2022]
Attendees @lifflander, @cz4rs, @JacobDomagala, @PhilMiller, @ppebay, @thearusable, @nmm0
Description Weekly Meeting

Agenda

Discussion

Updates

nmm0 commented 2 years ago
Descriptor Information
Date [06/14/2022]
Attendees @lifflander, @nlslatt, @cz4rs, @JacobDomagala, @jstrzebonski, @stmcgovern, @thearusable, @nmm0, @ppebay
Description Weekly Meeting

Agenda

Discussion

Updates

nlslatt commented 2 years ago
Descriptor Information
Date [06/21/2022]
Attendees @cz4rs @stmcgovern @jstrzebonski @thearusable @JacobDomagala @nlslatt @lifflander @ppebay
Description Weekly Meeting

Agenda:

@cz4rs Update:

@stmcgovern My update:

@jstrzebonski My update:

@thearusable Update:

@JacobDomagala Update:

@nlslatt My update:

@ppebay My upate:

@ppebay, @lifflander, and @nlslatt agreed that we should add an interface for manually specifying object to object communication for apps that do these comms using MPI RMA instead of through vt. Without knowledge of these edges, a comm-aware LB will not succeed.

nmm0 commented 2 years ago
Descriptor Information
Date [06/28/2022]
Attendees @lifflander, @thearusable, @cz4rs, @JacobDomagala, @jstrzebonski, @nlslatt, @PhilMiller, @nmm0, @ppebay
Description Weekly Meeting

Agenda

Discussion

Updates

nmm0 commented 2 years ago
Descriptor Information
Date [07/05/2022]
Attendees @lifflander, @thearusable, @cz4rs, @JacobDomagala, @stmcgovern, @nmm0, @jstrzebonski, @ppebay
Description Weekly Meeting

Agenda

Updates

nmm0 commented 2 years ago
Descriptor Information
Date [07/19/2022]
Attendees @lifflander, @nlslatt, @thearusable, @cz4rs, @stmcgovern, @nmm0, @jstrzebonski, @PhilMiller
Description Weekly Meeting

Agenda

Updates

nmm0 commented 2 years ago
Descriptor Information
Date [07/26/2022]
Attendees @lifflander, @thearusable, @cz4rs, @stmcgovern, @nmm0, @jstrzebonski, @PhilMiller, @pierrepebay, @ppebay
Description Weekly Meeting

Agenda

Discussion

Updates

nmm0 commented 2 years ago
Descriptor Information
Date [08/02/2022]
Attendees @lifflander, @nlslatt, @cz4rs, @stmcgovern, @nmm0, @PhilMiller, @pierrepebay, @ppebay
Description Weekly Meeting

Agenda

Discussion

Updates

nmm0 commented 2 years ago
Descriptor Information
Date [08/09/2022]
Attendees @thearusable, @cz4rs, @JacobDomagala, @lifflander, @PhilMiller, @pierrepebay, @ppebay, @stmcgovern, @nmm0
Description Weekly Meeting

Agenda

Discussion

Updates

nmm0 commented 2 years ago
Descriptor Information
Date [08/16/2022]
Attendees @thearusable, @cz4rs, @JacobDomagala, @PhilMiller, @pierrepebay, @ppebay, @stmcgovern, @nlslatt, @nmm0
Description Weekly Meeting

Agenda

Dicussion

Updates

nmm0 commented 2 years ago
Descriptor Information
Date [08/23/2022]
Attendees @thearusable, @cz4rs, @JacobDomagala, @lifflander, @PhilMiller, @pierrepebay, @ppebay, @stmcgovern, @nlslatt, @nmm0
Description Weekly Meeting

Agenda

Dicussion

Updates

nmm0 commented 2 years ago
Descriptor Information
Date [08/30/2022]
Attendees @thearusable, @JacobDomagala, @lifflander, @PhilMiller, @pierrepebay, @ppebay, @stmcgovern, @nlslatt, @nmm0
Description Weekly Meeting

Agenda

Updates

nmm0 commented 2 years ago
Descriptor Information
Date [09/13/2022]
Attendees @thearusable, @cz4rs, @JacobDomagala, @lifflander, @PhilMiller, @ppebay, @stmcgovern, @nlslatt, @nmm0
Description Weekly Meeting

Agenda

Updates

nmm0 commented 2 years ago
Descriptor Information
Date [09/27/2022]
Attendees @thearusable, @cz4rs, @JacobDomagala, @lifflander, @ppebay, @stmcgovern, @nmm0
Description Weekly Meeting

Agenda

Discussion

Updates

nmm0 commented 2 years ago
Descriptor Information
Date [10/04/2022]
Attendees @thearusable, @cz4rs, @JacobDomagala, @lifflander, @ppebay, @stmcgovern, @nlslatt, @nmm0
Description Weekly Meeting

Agenda

Discussion

Updates

nmm0 commented 1 year ago
Descriptor Information
Date [10/11/2022]
Attendees @thearusable, @cz4rs, @JacobDomagala, @lifflander, @ppebay, @stmcgovern, @nlslatt, @nmm0
Description Weekly Meeting

Agenda

Discussion

Updates

nlslatt commented 1 year ago
Descriptor Information
Date [10/18/2022]
Attendees @nlslatt @thearusable @PhilMiller @stmcgovern @JacobDomagala @ppebay
Description Weekly Meeting

Agenda:

@nlslatt My update:

@jstrzebonski My update:

@thearusable Update:

@PhilMiller I’m just returning from a break, so nothing to report

@stmcgovern My update:

@JacobDomagala Update:

@ppebay

nmm0 commented 1 year ago
Descriptor Information
Date [10/25/2022]
Attendees @thearusable, @cz4rs, @JacobDomagala, @lifflander, @PhilMiller, @ppebay, @stmcgovern, @nlslatt, @nmm0
Description Weekly Meeting

Agenda

Discussion

Updates