Ricks-Lab / gpu-utils

A set of utilities for monitoring and customizing GPU performance
GNU General Public License v3.0
142 stars 23 forks source link

User Guide - Contributors Needed #13

Closed Ricks-Lab closed 4 years ago

Ricks-Lab commented 5 years ago

I have started a new markdown format file as a User Guide. If you would like to contribute, just edit this file: USER_GUIDE.md and do a pull request. If you are not that familiar with markdown, don't worry about format too much, and I will tune the look and feel of the document. Thanks!

Ricks-Lab commented 5 years ago

@csecht I have made some progress in writing a user guide. It would be great if you could look it over and make modification that you think appropriate. You should be able to fork it, make edits, and make a pull request.

csecht commented 5 years ago

Yes, that worked. Something new for me! I only made a few edits, mostly typographical.
in the amdgpu-pac section, I feel like something should be said about how, in current version, fan speed with decrease with each PAC Save but that there are stable values, but don’t know whether that's specific to my system or general to other AMD cards, drivers, etc.

On Mar 19, 2019, at 8:45 AM, Rick notifications@github.com wrote:

@csecht https://github.com/csecht I have made some progress in writing a user guide. It would be great if you could look it over and make modification that you think appropriate. You should be able to fork it, make edits, and make a pull request.

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/Ricks-Lab/amdgpu-utils/issues/13#issuecomment-474376737, or mute the thread https://github.com/notifications/unsubscribe-auth/AtlRQhp6F2qkr9WUCfFLtVXMLKjyCiG0ks5vYOoPgaJpZM4b5esq.

Ricks-Lab commented 5 years ago

Yes, that worked. Something new for me! I only made a few edits, mostly typographical. in the amdgpu-pac section, I feel like something should be said about how, in current version, fan speed with decrease with each PAC Save but that there are stable values, but don’t know whether that's specific to my system or general to other AMD cards, drivers, etc.

I saw the same effect for fans on the Radeon VII, so I think it is a good idea the mention it.

Have you done a pull request yet? It would be good to make sure we can merge your changes before we get to far.

csecht commented 5 years ago

Hmmm, I thought I did. I’m new at this, so let me check and try again

On Mar 19, 2019, at 6:17 PM, Rick notifications@github.com wrote:

Yes, that worked. Something new for me! I only made a few edits, mostly typographical. in the amdgpu-pac section, I feel like something should be said about how, in current version, fan speed with decrease with each PAC Save but that there are stable values, but don’t know whether that's specific to my system or general to other AMD cards, drivers, etc. … <x-msg://1/#> I saw the same effect for fans on the Radeon VII, so I think it is a good idea the mention it.

Have you done a pull request yet? It would be good to make sure we can merge your changes before we get to far.

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/Ricks-Lab/amdgpu-utils/issues/13#issuecomment-474621232, or mute the thread https://github.com/notifications/unsubscribe-auth/AtlRQuy0RmVK0Q7IZLMy4VOeBq3rHinXks5vYXASgaJpZM4b5esq.

csecht commented 5 years ago

Yes, I have a pull request open under my cecht fork.

Ricks-Lab commented 5 years ago

I think that pull request only applies to your fork. You need to push to mine. I think the easiest approach is to click the top right icon for edit when viewing in my repository. This will create a fork and put you in edit mode with a button at the bottom to request pull.

Ricks-Lab commented 5 years ago

Here is an article on how to push a change from a clone of your fork to the original repository. I think that is the best approach. fork

Ricks-Lab commented 5 years ago

@csecht I was considering 2 more sections to the guide:

  1. GPU Type - a discussion on the characteristics and amdgpu-utils handling of pre and post Radeon vii GPUs.
  2. Describe how to have change effective on boot up.

Number requires some investigation.

csecht commented 5 years ago

Yes, sounds good. For #1, it would be good to get more BOINC crunchers trying out amdgpu-utils and giving feed back. That would provide real world info on the the full range of AMD cards. I’ve mentioned it in a couple posts a few weeks back but, other than Keith, haven’t heard of anyone else trying it. Maybe I should start an amdgpu-utils discussion thread at E@H? Others would need to follow up at S@H and M@H.

2 would be very handy indeed.

On Mar 21, 2019, at 7:57 PM, Rick notifications@github.com wrote:

@csecht https://github.com/csecht I was considering 2 more sections to the guide:

GPU Type - I discussion on the characteristics and andgpu-utils handling of pre and post Radeon vii GPUs. Describe how to have changed effective on boot up Number requires some investigation.

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/Ricks-Lab/amdgpu-utils/issues/13#issuecomment-475456779, or mute the thread https://github.com/notifications/unsubscribe-auth/AtlRQtE_7tGJu3xX4ZHcgZvo9u4js6CXks5vZCqBgaJpZM4b5esq.

csecht commented 5 years ago

In the Getting Started section of the Users Guide, it says to check that an amdgpu driver package is installed with this command:

dpkg -l amdgpu-core amdgpu amdgpu-pro

On my system, Ubuntu 18.04.3, it reported:

Desired=Unknown/Install/Remove/Purge/Hold
| Status=Not/Inst/Conf-files/Unpacked/halF-conf/Half-inst/trig-aWait/Trig-pend
|/ Err?=(none)/Reinst-required (Status,Err: uppercase=bad)
||/ Name                     Version           Architecture      Description
+++-========================-=================-=================-=====================================================
ii  amdgpu-core              19.20-812932      all               Core meta package for unified amdgpu driver.
dpkg-query: no packages found matching amdgpu
dpkg-query: no packages found matching amdgpu-pro

which is actually sufficient, but only so long as the opencl drivers have been installed from within a amdgpu-pro-19.20-* package directory using this command, for example:

<user>:~/amdgpu-pro-19.20-812932-ubuntu-18.04$ ./amdgpu-pro-install --opencl=legacy --headless

For Polaris and earlier AMD GPUs, the 5.0.0 Linux kernel has the necessary amdgpu drivers, so no additional amdgpu driver installation is needed, just the OpenCL components from the amdgpu-pro package. (I'm assuming for Vega cards that the amdgpu-pro driver stack does need to be installed, but am unsure.) In fact, when I install 19.20 amdgpu drivers from a downloaded AMD package, something in that installation prevents Ubuntu Desktop from loading after a reboot, resulting in a login screen loop (while fixable, is a bit of a pain, and somewhat panic-inducing for inexperienced users.) I don't know how it works with other AMD GPU configurations, but it seems like more information or clarification is needed concerning the dpkg check for amdgpu drivers. For example, when it reports "no packages found matching amdgpu", folks shouldn't think they need to install amdgpu or amdgpu-pro when amdgpu-core alone is sufficient. I'm not sure how this should be generally worded in the User Guide. Is it okay if only one of the three packages are reported as installed? Also, perhaps users should check whether OpenCL is installed? I know this is needed for Einstein@Home crunching, but not required for amdgpu-utils. This may be too much detail for the User Guide, but on the other hand that section is about amdgpu installation.

Ricks-Lab commented 5 years ago

Good point. The original comment of checking all three packages is how it is implemented in the code. It checks all 3, one at a time, and verifies that at least one is valid. I have modified the user_guide to indicate the execution of the following to check:

dpkg -l 'amdgpu*'
csecht commented 5 years ago

Much better!

On Sep 12, 2019, at 9:26 AM, Rick notifications@github.com wrote:

Good point. The original comment of checking all three packages is how it is implemented in the code. It checks all 3, one at a time, and verifies that at least one is valid. I have modified the user_guide to indicate the execution of the following to check:

dpkg -l 'amdgpu*' — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/Ricks-Lab/amdgpu-utils/issues/13?email_source=notifications&email_token=ALMVCQUPIMPD7CDBQQSD2ODQJJGRDA5CNFSM4G7F5MVKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOD6SCKDI#issuecomment-530851085, or mute the thread https://github.com/notifications/unsubscribe-auth/ALMVCQT52B2VIBSDWOVM7BDQJJGRDANCNFSM4G7F5MVA.