SAIN-Project / Frontend

0 stars 0 forks source link

Please add the following ground-truth architectures to the SAIN datasets #15

Open jgarci40 opened 3 years ago

jgarci40 commented 3 years ago

Please add the following ground-truth architectures for individual versions of Apache Hadoop, Bash (Bourne Again Shell), Apache OODT, and ArchStudio: https://softarch.usc.edu/wiki/doku.php?id=recoveries:start

Please add the associated source code of these versions of Hadoop, Bash, OODT, and ArchStudio to the SAIN datasets as well.

jgarci40 commented 3 years ago

Please add the following ground-truth architectures and associated source code as well: https://www.cs.purdue.edu/homes/lintan/ArchRecovery/

jgarci40 commented 2 years ago

@khoiminhpham @Ibrahimvse @MehdiMirakhorli Whatever happened to this issue guys? :)

khoiminhpham commented 2 years ago

Doesn't look like any one of us have solved this yet :D Leave it to me Sempai Josh, tmr I'll have a look at it.

Best,

On Sun, Feb 20, 2022 at 7:41 PM Joshua Garcia @.***> wrote:

@khoiminhpham https://github.com/khoiminhpham @Ibrahimvse https://github.com/Ibrahimvse @MehdiMirakhorli https://github.com/MehdiMirakhorli Whatever happened to this issue guys? :)

— Reply to this email directly, view it on GitHub https://github.com/SAIN-Project/Frontend/issues/15#issuecomment-1046443321, or unsubscribe https://github.com/notifications/unsubscribe-auth/ALKELTB7JHCSXVODWVHEAHTU4GYAPANCNFSM4Y2JUPMA . Triage notifications on the go with GitHub Mobile for iOS https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675 or Android https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub.

You are receiving this because you were mentioned.Message ID: @.***>

jgarci40 commented 2 years ago

Thanks, Khoi!

Josh

On Sun, Feb 20, 2022, 10:37 PM khoiminhpham @.***> wrote:

Doesn't look like any one of us have solved this yet :D Leave it to me Sempai Josh, tmr I'll have a look at it.

Best,

On Sun, Feb 20, 2022 at 7:41 PM Joshua Garcia @.***> wrote:

@khoiminhpham https://github.com/khoiminhpham @Ibrahimvse https://github.com/Ibrahimvse @MehdiMirakhorli https://github.com/MehdiMirakhorli Whatever happened to this issue guys? :)

— Reply to this email directly, view it on GitHub < https://github.com/SAIN-Project/Frontend/issues/15#issuecomment-1046443321 , or unsubscribe < https://github.com/notifications/unsubscribe-auth/ALKELTB7JHCSXVODWVHEAHTU4GYAPANCNFSM4Y2JUPMA

. Triage notifications on the go with GitHub Mobile for iOS < https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675

or Android < https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub .

You are receiving this because you were mentioned.Message ID: @.***>

— Reply to this email directly, view it on GitHub https://github.com/SAIN-Project/Frontend/issues/15#issuecomment-1046521045, or unsubscribe https://github.com/notifications/unsubscribe-auth/ACDUX2UWDCWKTTE7GXJJU6LU4HMR5ANCNFSM4Y2JUPMA . Triage notifications on the go with GitHub Mobile for iOS https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675 or Android https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub.

You are receiving this because you authored the thread.Message ID: @.***>

jgarci40 commented 2 years ago

Hi Khoi,

Could I please get an update on this issue?

Thanks, Josh

On Sun, Feb 20, 2022 at 11:40 PM Joshua Garcia @.***> wrote:

Thanks, Khoi!

Josh

On Sun, Feb 20, 2022, 10:37 PM khoiminhpham @.***> wrote:

Doesn't look like any one of us have solved this yet :D Leave it to me Sempai Josh, tmr I'll have a look at it.

Best,

On Sun, Feb 20, 2022 at 7:41 PM Joshua Garcia @.***> wrote:

@khoiminhpham https://github.com/khoiminhpham @Ibrahimvse https://github.com/Ibrahimvse @MehdiMirakhorli https://github.com/MehdiMirakhorli Whatever happened to this issue guys? :)

— Reply to this email directly, view it on GitHub < https://github.com/SAIN-Project/Frontend/issues/15#issuecomment-1046443321 , or unsubscribe < https://github.com/notifications/unsubscribe-auth/ALKELTB7JHCSXVODWVHEAHTU4GYAPANCNFSM4Y2JUPMA

. Triage notifications on the go with GitHub Mobile for iOS < https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675

or Android < https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub .

You are receiving this because you were mentioned.Message ID: @.***>

— Reply to this email directly, view it on GitHub https://github.com/SAIN-Project/Frontend/issues/15#issuecomment-1046521045, or unsubscribe https://github.com/notifications/unsubscribe-auth/ACDUX2UWDCWKTTE7GXJJU6LU4HMR5ANCNFSM4Y2JUPMA . Triage notifications on the go with GitHub Mobile for iOS https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675 or Android https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub.

You are receiving this because you authored the thread.Message ID: @.***>

khoiminhpham commented 2 years ago

Hi Sempai Josh,

I've added projects in your 1st link to SAIN datasets, (only thing missing is bash 1.14.4, src code, I can't find it). In your 2nd link, the projects don't have specific version, I'd need to know what version they are. Also I can see Bash and ArchStudio appear again, should we add new dataset or should we merge them with the recently uploaded bash 1.14.4 and ArchStudio 4 datasets?

Thanks,

jgarci40 commented 2 years ago

Here's the source for Bash 1.14.4: https://www.ibiblio.org/pub/Linux/system/shells/!INDEX.short.html Here's a direct link to the source too: https://www.ibiblio.org/pub/Linux/system/shells/bash-1.14.4.tgz

For the second link, the version info can be found in this paper: https://jgarcia.ics.uci.edu//wp-content/uploads/lutellier_tse_2017.pdf

If you have a hard time getting hold of the source or other artifacts, please contact the first author of that paper to ask him and CC me on the email.

Thanks again, @Khoi Pham @.***>!

Josh

On Wed, Feb 23, 2022 at 3:08 PM khoiminhpham @.***> wrote:

Hi Sempai Josh,

I've added projects in your 1st link to SAIN datasets, (only thing missing is bash 1.14.4, src code, I can't find it). In your 2nd link, the projects don't have specific version, I'd need to know what version they are. Also I can see Bash and ArchStudio appear again, should we add new dataset or should we merge them with the recently uploaded bash 1.14.4 and ArchStudio 4 datasets?

Thanks,

— Reply to this email directly, view it on GitHub https://github.com/SAIN-Project/Frontend/issues/15#issuecomment-1049302030, or unsubscribe https://github.com/notifications/unsubscribe-auth/ACDUX2TNHZCACB5X5QXK7C3U4VR2FANCNFSM4Y2JUPMA . Triage notifications on the go with GitHub Mobile for iOS https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675 or Android https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub.

You are receiving this because you authored the thread.Message ID: @.***>

khoiminhpham commented 2 years ago

Thank you Sempai Josh, I've uploaded all of them, last thing is, in the paper and in your 1st link we both have ArchStudio 4. Should I put those files together in one datasets, or do you want to separate them?

Thanks,

jgarci40 commented 2 years ago

What kind of files do you intend to combine together? If you mean the ground-truth architecture and source for ArchStudio 4, you can combine them. Isn't that what you've been doing for other projects already?

On Thu, Feb 24, 2022 at 12:15 AM khoiminhpham @.***> wrote:

Thank you Sempai Josh, I've uploaded all of them, last thing is, in the paper and in your 1st link we both have ArchStudio 4. Should I put those files together in one datasets, or do you want to separate them?

Thanks,

— Reply to this email directly, view it on GitHub https://github.com/SAIN-Project/Frontend/issues/15#issuecomment-1049597926, or unsubscribe https://github.com/notifications/unsubscribe-auth/ACDUX2TN4GHKOLBXODVW7UTU4XSHPANCNFSM4Y2JUPMA . Triage notifications on the go with GitHub Mobile for iOS https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675 or Android https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub.

You are receiving this because you authored the thread.Message ID: @.***>

khoiminhpham commented 2 years ago

These files https://user-images.githubusercontent.com/47465932/155608859-fa22e381-a44f-44c5-8ae3-f31be9923574.JPG and these files https://user-images.githubusercontent.com/47465932/155608862-191e577a-28c9-4156-8013-9ccfb9caf025.png I can see the rsf files are identical, should we keep them all under one dataset Sempai ?

Thanks,

jgarci40 commented 2 years ago

Sure. That works. @khoiminhpham

khoiminhpham commented 2 years ago

Alright, everything is done Sempai Josh.

Best,

jgarci40 commented 2 years ago

Hi Taran,

Could you please test this out? You should be able to run an architecture recovery (e.g., ACDC or BatchClusteringEngine, which is actually called ARC, from the Architecture Reconstruction pane) on any of these systems that Khoi added and then use a2a, c2c, or cvg* (available in the Architectural Analysis and Metrics pane) to compare the recovered architecture with the ground-truth architecture. There are a bunch of details there that may be tricky (e.g., how do you supply the source so that ACDC or ARC can process it or which specific file to use). I think you, me, and Khoi may need to set up a meeting to hash this out. What do you think? Essentially, you are going to try to perform a partial replication of these papers:

  1. Joshua Garcia, Igor Ivkovic, and Nenad Medvidović. A Comparative Analysis of Software Architecture Recovery Techniques https://www.dropbox.com/s/7afeoly4bm7p93s/Garcia%20et%20al.%20-%202013%20-%20A%20Comparative%20Analysis%20of%20Software%20Architecture%20Re.pdf?dl=0. In the Proceedings of the 28th IEEE/ACM International Conference on Automated Software Engineering (ASE), 2013.
  2. Thibaud Lutellier, Devin Chollak, Joshua Garcia, Lin Tan, Derek Rayside, Nenad Medvidovic, and Robert Kroeger. Measuring the Impact of Code Dependencies on Software Architecture Recovery Techniques https://jgarcia.ics.uci.edu//wp-content/uploads/lutellier_tse_2017.pdf. IEEE Transactions on Software Engineering (TSE), 2017.

Do you think you need a meeting with me and Khoi to see how to do this for at least one of the ground-truth datasets?

I'm also CCing Sam and Neno since I think they'd be interested in how this goes or at least be aware that we're doing this.

Best, Josh

---------- Forwarded message --------- From: khoiminhpham @.> Date: Thu, Feb 24, 2022 at 2:30 PM Subject: Re: [SAIN-Project/Frontend] Please add the following ground-truth architectures to the SAIN datasets (#15) To: SAIN-Project/Frontend @.> Cc: Joshua Garcia @.>, Author @.>

Alright, everything is done Sempai Josh.

Best,

— Reply to this email directly, view it on GitHub https://github.com/SAIN-Project/Frontend/issues/15#issuecomment-1050325608, or unsubscribe https://github.com/notifications/unsubscribe-auth/ACDUX2V3IZPG64JOHDRY4B3U42WKHANCNFSM4Y2JUPMA . Triage notifications on the go with GitHub Mobile for iOS https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675 or Android https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub.

You are receiving this because you authored the thread.Message ID: @.***>

jgarci40 commented 2 years ago

I'm afraid the source code (in each project) I uploaded in those datasets aren't ready to run with acdc/batchclustering component, Sempai Josh. At least I'll need to generate jar files from those src code and add those jar files in the dataset too.

Thanks,

On Thu, Feb 24, 2022, 4:40 PM Joshua Garcia @.***> wrote:

Hi Taran,

Could you please test this out? You should be able to run an architecture recovery (e.g., ACDC or BatchClusteringEngine, which is actually called ARC, from the Architecture Reconstruction pane) on any of these systems that Khoi added and then use a2a, c2c, or cvg* (available in the Architectural Analysis and Metrics pane) to compare the recovered architecture with the ground-truth architecture. There are a bunch of details there that may be tricky (e.g., how do you supply the source so that ACDC or ARC can process it or which specific file to use). I think you, me, and Khoi may need to set up a meeting to hash this out. What do you think? Essentially, you are going to try to perform a partial replication of these papers:

  1. Joshua Garcia, Igor Ivkovic, and Nenad Medvidović. A Comparative Analysis of Software Architecture Recovery Techniques https://www.dropbox.com/s/7afeoly4bm7p93s/Garcia%20et%20al.%20-%202013%20-%20A%20Comparative%20Analysis%20of%20Software%20Architecture%20Re.pdf?dl=0. In the Proceedings of the 28th IEEE/ACM International Conference on Automated Software Engineering (ASE), 2013.
  2. Thibaud Lutellier, Devin Chollak, Joshua Garcia, Lin Tan, Derek Rayside, Nenad Medvidovic, and Robert Kroeger. Measuring the Impact of Code Dependencies on Software Architecture Recovery Techniques https://jgarcia.ics.uci.edu//wp-content/uploads/lutellier_tse_2017.pdf. IEEE Transactions on Software Engineering (TSE), 2017.

Do you think you need a meeting with me and Khoi to see how to do this for at least one of the ground-truth datasets?

I'm also CCing Sam and Neno since I think they'd be interested in how this goes or at least be aware that we're doing this.

Best, Josh

---------- Forwarded message --------- From: khoiminhpham @.> Date: Thu, Feb 24, 2022 at 2:30 PM Subject: Re: [SAIN-Project/Frontend] Please add the following ground-truth architectures to the SAIN datasets (#15) To: SAIN-Project/Frontend @.> Cc: Joshua Garcia @.>, Author @.>

Alright, everything is done Sempai Josh.

Best,

— Reply to this email directly, view it on GitHub https://github.com/SAIN-Project/Frontend/issues/15#issuecomment-1050325608, or unsubscribe https://github.com/notifications/unsubscribe-auth/ACDUX2V3IZPG64JOHDRY4B3U42WKHANCNFSM4Y2JUPMA . Triage notifications on the go with GitHub Mobile for iOS https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675 or Android https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub.

You are receiving this because you authored the thread.Message ID: @.***>

jgarci40 commented 2 years ago

@Khoi Pham @.>, I see. I know building some of these projects can be a pain. Crista has some technology that is supposed to simplify this for Java systems because the Gradle and Maven build files don't necessarily work well for open-source projects. I have a student, @Yuqi Huai @.>, that is working with one of Crista's students on her build tool. Maybe, we should incorporate it into SAIN. @Yuqi Huai @.***>, what do you think? I know the tool may have some issues working with Java 9+ projects, but maybe we can ask Rakib and Crista about trying to add their tool to SAIN.

Best, Josh

On Thu, Feb 24, 2022 at 6:05 PM Khoi Pham @.***> wrote:

I'm afraid the source code (in each) project I uploaded in those datasets aren't ready to run with acdc/batchclustering component, Sempai Josh. At least I'll need to generate jar files from those src code and add those jar files in the dataset too.

Thanks,

On Thu, Feb 24, 2022, 4:40 PM Joshua Garcia @.***> wrote:

Hi Taran,

Could you please test this out? You should be able to run an architecture recovery (e.g., ACDC or BatchClusteringEngine, which is actually called ARC, from the Architecture Reconstruction pane) on any of these systems that Khoi added and then use a2a, c2c, or cvg* (available in the Architectural Analysis and Metrics pane) to compare the recovered architecture with the ground-truth architecture. There are a bunch of details there that may be tricky (e.g., how do you supply the source so that ACDC or ARC can process it or which specific file to use). I think you, me, and Khoi may need to set up a meeting to hash this out. What do you think? Essentially, you are going to try to perform a partial replication of these papers:

  1. Joshua Garcia, Igor Ivkovic, and Nenad Medvidović. A Comparative Analysis of Software Architecture Recovery Techniques https://www.dropbox.com/s/7afeoly4bm7p93s/Garcia%20et%20al.%20-%202013%20-%20A%20Comparative%20Analysis%20of%20Software%20Architecture%20Re.pdf?dl=0. In the Proceedings of the 28th IEEE/ACM International Conference on Automated Software Engineering (ASE), 2013.
  2. Thibaud Lutellier, Devin Chollak, Joshua Garcia, Lin Tan, Derek Rayside, Nenad Medvidovic, and Robert Kroeger. Measuring the Impact of Code Dependencies on Software Architecture Recovery Techniques https://jgarcia.ics.uci.edu//wp-content/uploads/lutellier_tse_2017.pdf. IEEE Transactions on Software Engineering (TSE), 2017.

Do you think you need a meeting with me and Khoi to see how to do this for at least one of the ground-truth datasets?

I'm also CCing Sam and Neno since I think they'd be interested in how this goes or at least be aware that we're doing this.

Best, Josh

---------- Forwarded message --------- From: khoiminhpham @.> Date: Thu, Feb 24, 2022 at 2:30 PM Subject: Re: [SAIN-Project/Frontend] Please add the following ground-truth architectures to the SAIN datasets (#15) To: SAIN-Project/Frontend @.> Cc: Joshua Garcia @.>, Author @.>

Alright, everything is done Sempai Josh.

Best,

— Reply to this email directly, view it on GitHub https://github.com/SAIN-Project/Frontend/issues/15#issuecomment-1050325608, or unsubscribe https://github.com/notifications/unsubscribe-auth/ACDUX2V3IZPG64JOHDRY4B3U42WKHANCNFSM4Y2JUPMA . Triage notifications on the go with GitHub Mobile for iOS https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675 or Android https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub.

You are receiving this because you authored the thread.Message ID: @.***>

jgarci40 commented 2 years ago

Hi Dr. Garcia,

I am open to having a meeting about this issue to resolve it, and I would be interested to see how to do it for at least one of the ground-truth datasets. Thanks.

Tarandeep

On Thu, Feb 24, 2022 at 7:10 PM Joshua Garcia @.***> wrote:

@Khoi Pham @.>, I see. I know building some of these projects can be a pain. Crista has some technology that is supposed to simplify this for Java systems because the Gradle and Maven build files don't necessarily work well for open-source projects. I have a student, @Yuqi Huai @.>, that is working with one of Crista's students on her build tool. Maybe, we should incorporate it into SAIN. @Yuqi Huai @.***>, what do you think? I know the tool may have some issues working with Java 9+ projects, but maybe we can ask Rakib and Crista about trying to add their tool to SAIN.

Best, Josh

On Thu, Feb 24, 2022 at 6:05 PM Khoi Pham @.***> wrote:

I'm afraid the source code (in each) project I uploaded in those datasets aren't ready to run with acdc/batchclustering component, Sempai Josh. At least I'll need to generate jar files from those src code and add those jar files in the dataset too.

Thanks,

On Thu, Feb 24, 2022, 4:40 PM Joshua Garcia @.***> wrote:

Hi Taran,

Could you please test this out? You should be able to run an architecture recovery (e.g., ACDC or BatchClusteringEngine, which is actually called ARC, from the Architecture Reconstruction pane) on any of these systems that Khoi added and then use a2a, c2c, or cvg* (available in the Architectural Analysis and Metrics pane) to compare the recovered architecture with the ground-truth architecture. There are a bunch of details there that may be tricky (e.g., how do you supply the source so that ACDC or ARC can process it or which specific file to use). I think you, me, and Khoi may need to set up a meeting to hash this out. What do you think? Essentially, you are going to try to perform a partial replication of these papers:

  1. Joshua Garcia, Igor Ivkovic, and Nenad Medvidović. A Comparative Analysis of Software Architecture Recovery Techniques https://www.dropbox.com/s/7afeoly4bm7p93s/Garcia%20et%20al.%20-%202013%20-%20A%20Comparative%20Analysis%20of%20Software%20Architecture%20Re.pdf?dl=0. In the Proceedings of the 28th IEEE/ACM International Conference on Automated Software Engineering (ASE), 2013.
  2. Thibaud Lutellier, Devin Chollak, Joshua Garcia, Lin Tan, Derek Rayside, Nenad Medvidovic, and Robert Kroeger. Measuring the Impact of Code Dependencies on Software Architecture Recovery Techniques https://jgarcia.ics.uci.edu//wp-content/uploads/lutellier_tse_2017.pdf. IEEE Transactions on Software Engineering (TSE), 2017.

Do you think you need a meeting with me and Khoi to see how to do this for at least one of the ground-truth datasets?

I'm also CCing Sam and Neno since I think they'd be interested in how this goes or at least be aware that we're doing this.

Best, Josh

---------- Forwarded message --------- From: khoiminhpham @.> Date: Thu, Feb 24, 2022 at 2:30 PM Subject: Re: [SAIN-Project/Frontend] Please add the following ground-truth architectures to the SAIN datasets (#15) To: SAIN-Project/Frontend @.> Cc: Joshua Garcia @.>, Author @.>

Alright, everything is done Sempai Josh.

Best,

— Reply to this email directly, view it on GitHub https://github.com/SAIN-Project/Frontend/issues/15#issuecomment-1050325608, or unsubscribe https://github.com/notifications/unsubscribe-auth/ACDUX2V3IZPG64JOHDRY4B3U42WKHANCNFSM4Y2JUPMA . Triage notifications on the go with GitHub Mobile for iOS https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675 or Android https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub.

You are receiving this because you authored the thread.Message ID: @.***>

jgarci40 commented 2 years ago

I'll send a separate message to you, @Tarandeep Singh @.***>, and Khoi to set up a time.

Thanks, Josh

On Thu, Feb 24, 2022 at 7:34 PM Tarandeep Singh @.***> wrote:

Hi Dr. Garcia,

I am open to having a meeting about this issue to resolve it, and I would be interested to see how to do it for at least one of the ground-truth datasets. Thanks.

Tarandeep

On Thu, Feb 24, 2022 at 7:10 PM Joshua Garcia @.***> wrote:

@Khoi Pham @.>, I see. I know building some of these projects can be a pain. Crista has some technology that is supposed to simplify this for Java systems because the Gradle and Maven build files don't necessarily work well for open-source projects. I have a student, @Yuqi Huai @.>, that is working with one of Crista's students on her build tool. Maybe, we should incorporate it into SAIN. @Yuqi Huai @.***>, what do you think? I know the tool may have some issues working with Java 9+ projects, but maybe we can ask Rakib and Crista about trying to add their tool to SAIN.

Best, Josh

On Thu, Feb 24, 2022 at 6:05 PM Khoi Pham @.***> wrote:

I'm afraid the source code (in each) project I uploaded in those datasets aren't ready to run with acdc/batchclustering component, Sempai Josh. At least I'll need to generate jar files from those src code and add those jar files in the dataset too.

Thanks,

On Thu, Feb 24, 2022, 4:40 PM Joshua Garcia @.***> wrote:

Hi Taran,

Could you please test this out? You should be able to run an architecture recovery (e.g., ACDC or BatchClusteringEngine, which is actually called ARC, from the Architecture Reconstruction pane) on any of these systems that Khoi added and then use a2a, c2c, or cvg* (available in the Architectural Analysis and Metrics pane) to compare the recovered architecture with the ground-truth architecture. There are a bunch of details there that may be tricky (e.g., how do you supply the source so that ACDC or ARC can process it or which specific file to use). I think you, me, and Khoi may need to set up a meeting to hash this out. What do you think? Essentially, you are going to try to perform a partial replication of these papers:

  1. Joshua Garcia, Igor Ivkovic, and Nenad Medvidović. A Comparative Analysis of Software Architecture Recovery Techniques https://www.dropbox.com/s/7afeoly4bm7p93s/Garcia%20et%20al.%20-%202013%20-%20A%20Comparative%20Analysis%20of%20Software%20Architecture%20Re.pdf?dl=0. In the Proceedings of the 28th IEEE/ACM International Conference on Automated Software Engineering (ASE), 2013.
  2. Thibaud Lutellier, Devin Chollak, Joshua Garcia, Lin Tan, Derek Rayside, Nenad Medvidovic, and Robert Kroeger. Measuring the Impact of Code Dependencies on Software Architecture Recovery Techniques https://jgarcia.ics.uci.edu//wp-content/uploads/lutellier_tse_2017.pdf. IEEE Transactions on Software Engineering (TSE), 2017.

Do you think you need a meeting with me and Khoi to see how to do this for at least one of the ground-truth datasets?

I'm also CCing Sam and Neno since I think they'd be interested in how this goes or at least be aware that we're doing this.

Best, Josh

---------- Forwarded message --------- From: khoiminhpham @.> Date: Thu, Feb 24, 2022 at 2:30 PM Subject: Re: [SAIN-Project/Frontend] Please add the following ground-truth architectures to the SAIN datasets (#15) To: SAIN-Project/Frontend @.> Cc: Joshua Garcia @.>, Author @.>

Alright, everything is done Sempai Josh.

Best,

— Reply to this email directly, view it on GitHub https://github.com/SAIN-Project/Frontend/issues/15#issuecomment-1050325608, or unsubscribe https://github.com/notifications/unsubscribe-auth/ACDUX2V3IZPG64JOHDRY4B3U42WKHANCNFSM4Y2JUPMA . Triage notifications on the go with GitHub Mobile for iOS https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675 or Android https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub.

You are receiving this because you authored the thread.Message ID: @.***>

jgarci40 commented 2 years ago

Hi all!

I think adding the tool to SAIN can be helpful. From what I have learned the tool will provide a large collection of compilable Java projects. The idea is we can run further analysis on those compiled Java projects.

I asked Rakib to compile a small number of repositories for me, so far those repositories haven't been able to be compiled. I am not exactly sure why therefore I need more time understanding what the tool exactly does and how to use it.

The most recent update I got is, they will begin compiling at the beginning of March, I should soon have a better idea of how and if we should add it to SAIN.

Sincerely, Yuqi Huai

On Thu, Feb 24, 2022 at 7:54 PM Joshua Garcia @.***> wrote:

I'll send a separate message to you, @Tarandeep Singh @.***>, and Khoi to set up a time.

Thanks, Josh

On Thu, Feb 24, 2022 at 7:34 PM Tarandeep Singh @.***> wrote:

Hi Dr. Garcia,

I am open to having a meeting about this issue to resolve it, and I would be interested to see how to do it for at least one of the ground-truth datasets. Thanks.

Tarandeep

On Thu, Feb 24, 2022 at 7:10 PM Joshua Garcia @.***> wrote:

@Khoi Pham @.>, I see. I know building some of these projects can be a pain. Crista has some technology that is supposed to simplify this for Java systems because the Gradle and Maven build files don't necessarily work well for open-source projects. I have a student, @Yuqi Huai @.>, that is working with one of Crista's students on her build tool. Maybe, we should incorporate it into SAIN. @Yuqi Huai @.***>, what do you think? I know the tool may have some issues working with Java 9+ projects, but maybe we can ask Rakib and Crista about trying to add their tool to SAIN.

Best, Josh

On Thu, Feb 24, 2022 at 6:05 PM Khoi Pham @.***> wrote:

I'm afraid the source code (in each) project I uploaded in those datasets aren't ready to run with acdc/batchclustering component, Sempai Josh. At least I'll need to generate jar files from those src code and add those jar files in the dataset too.

Thanks,

On Thu, Feb 24, 2022, 4:40 PM Joshua Garcia @.***> wrote:

Hi Taran,

Could you please test this out? You should be able to run an architecture recovery (e.g., ACDC or BatchClusteringEngine, which is actually called ARC, from the Architecture Reconstruction pane) on any of these systems that Khoi added and then use a2a, c2c, or cvg* (available in the Architectural Analysis and Metrics pane) to compare the recovered architecture with the ground-truth architecture. There are a bunch of details there that may be tricky (e.g., how do you supply the source so that ACDC or ARC can process it or which specific file to use). I think you, me, and Khoi may need to set up a meeting to hash this out. What do you think? Essentially, you are going to try to perform a partial replication of these papers:

  1. Joshua Garcia, Igor Ivkovic, and Nenad Medvidović. A Comparative Analysis of Software Architecture Recovery Techniques https://www.dropbox.com/s/7afeoly4bm7p93s/Garcia%20et%20al.%20-%202013%20-%20A%20Comparative%20Analysis%20of%20Software%20Architecture%20Re.pdf?dl=0. In the Proceedings of the 28th IEEE/ACM International Conference on Automated Software Engineering (ASE), 2013.
  2. Thibaud Lutellier, Devin Chollak, Joshua Garcia, Lin Tan, Derek Rayside, Nenad Medvidovic, and Robert Kroeger. Measuring the Impact of Code Dependencies on Software Architecture Recovery Techniques https://jgarcia.ics.uci.edu//wp-content/uploads/lutellier_tse_2017.pdf. IEEE Transactions on Software Engineering (TSE), 2017.

Do you think you need a meeting with me and Khoi to see how to do this for at least one of the ground-truth datasets?

I'm also CCing Sam and Neno since I think they'd be interested in how this goes or at least be aware that we're doing this.

Best, Josh

---------- Forwarded message --------- From: khoiminhpham @.> Date: Thu, Feb 24, 2022 at 2:30 PM Subject: Re: [SAIN-Project/Frontend] Please add the following ground-truth architectures to the SAIN datasets (#15) To: SAIN-Project/Frontend @.> Cc: Joshua Garcia @.>, Author @.

Alright, everything is done Sempai Josh.

Best,

— Reply to this email directly, view it on GitHub https://github.com/SAIN-Project/Frontend/issues/15#issuecomment-1050325608, or unsubscribe https://github.com/notifications/unsubscribe-auth/ACDUX2V3IZPG64JOHDRY4B3U42WKHANCNFSM4Y2JUPMA . Triage notifications on the go with GitHub Mobile for iOS https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675 or Android https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub.

You are receiving this because you authored the thread.Message ID: @.***>

jgarci40 commented 2 years ago

Thanks a lot for the input, Yuqi!

Josh

On Fri, Feb 25, 2022 at 3:04 PM Yuqi Huai @.***> wrote:

Hi all!

I think adding the tool to SAIN can be helpful. From what I have learned the tool will provide a large collection of compilable Java projects. The idea is we can run further analysis on those compiled Java projects.

I asked Rakib to compile a small number of repositories for me, so far those repositories haven't been able to be compiled. I am not exactly sure why therefore I need more time understanding what the tool exactly does and how to use it.

The most recent update I got is, they will begin compiling at the beginning of March, I should soon have a better idea of how and if we should add it to SAIN.

Sincerely, Yuqi Huai

On Thu, Feb 24, 2022 at 7:54 PM Joshua Garcia @.***> wrote:

I'll send a separate message to you, @Tarandeep Singh @.***>, and Khoi to set up a time.

Thanks, Josh

On Thu, Feb 24, 2022 at 7:34 PM Tarandeep Singh @.***> wrote:

Hi Dr. Garcia,

I am open to having a meeting about this issue to resolve it, and I would be interested to see how to do it for at least one of the ground-truth datasets. Thanks.

Tarandeep

On Thu, Feb 24, 2022 at 7:10 PM Joshua Garcia @.***> wrote:

@Khoi Pham @.>, I see. I know building some of these projects can be a pain. Crista has some technology that is supposed to simplify this for Java systems because the Gradle and Maven build files don't necessarily work well for open-source projects. I have a student, @Yuqi Huai @.>, that is working with one of Crista's students on her build tool. Maybe, we should incorporate it into SAIN. @Yuqi Huai @.***>, what do you think? I know the tool may have some issues working with Java 9+ projects, but maybe we can ask Rakib and Crista about trying to add their tool to SAIN.

Best, Josh

On Thu, Feb 24, 2022 at 6:05 PM Khoi Pham @.***> wrote:

I'm afraid the source code (in each) project I uploaded in those datasets aren't ready to run with acdc/batchclustering component, Sempai Josh. At least I'll need to generate jar files from those src code and add those jar files in the dataset too.

Thanks,

On Thu, Feb 24, 2022, 4:40 PM Joshua Garcia @.***> wrote:

Hi Taran,

Could you please test this out? You should be able to run an architecture recovery (e.g., ACDC or BatchClusteringEngine, which is actually called ARC, from the Architecture Reconstruction pane) on any of these systems that Khoi added and then use a2a, c2c, or cvg* (available in the Architectural Analysis and Metrics pane) to compare the recovered architecture with the ground-truth architecture. There are a bunch of details there that may be tricky (e.g., how do you supply the source so that ACDC or ARC can process it or which specific file to use). I think you, me, and Khoi may need to set up a meeting to hash this out. What do you think? Essentially, you are going to try to perform a partial replication of these papers:

  1. Joshua Garcia, Igor Ivkovic, and Nenad Medvidović. A Comparative Analysis of Software Architecture Recovery Techniques https://www.dropbox.com/s/7afeoly4bm7p93s/Garcia%20et%20al.%20-%202013%20-%20A%20Comparative%20Analysis%20of%20Software%20Architecture%20Re.pdf?dl=0. In the Proceedings of the 28th IEEE/ACM International Conference on Automated Software Engineering (ASE), 2013.
  2. Thibaud Lutellier, Devin Chollak, Joshua Garcia, Lin Tan, Derek Rayside, Nenad Medvidovic, and Robert Kroeger. Measuring the Impact of Code Dependencies on Software Architecture Recovery Techniques https://jgarcia.ics.uci.edu//wp-content/uploads/lutellier_tse_2017.pdf. IEEE Transactions on Software Engineering (TSE), 2017.

Do you think you need a meeting with me and Khoi to see how to do this for at least one of the ground-truth datasets?

I'm also CCing Sam and Neno since I think they'd be interested in how this goes or at least be aware that we're doing this.

Best, Josh

---------- Forwarded message --------- From: khoiminhpham @.> Date: Thu, Feb 24, 2022 at 2:30 PM Subject: Re: [SAIN-Project/Frontend] Please add the following ground-truth architectures to the SAIN datasets (#15) To: SAIN-Project/Frontend @.> Cc: Joshua Garcia @.>, Author < @.>

Alright, everything is done Sempai Josh.

Best,

— Reply to this email directly, view it on GitHub https://github.com/SAIN-Project/Frontend/issues/15#issuecomment-1050325608, or unsubscribe https://github.com/notifications/unsubscribe-auth/ACDUX2V3IZPG64JOHDRY4B3U42WKHANCNFSM4Y2JUPMA . Triage notifications on the go with GitHub Mobile for iOS https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675 or Android https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub.

You are receiving this because you authored the thread.Message ID: @.***>