UNCG-CSE / Library-Computer-Usage-Analysis

The University Libraries at UNCG currently track the state of a computer, determining whether or not a particular computer is in use. This data is compiled into a database, and a web app pulls from this database to show a map and number of available computers. As of Fall 2017, the data had not been used to determine which computers are used more frequently, aside from counting the number of times a computer transitions into/away from the 'in-use' state. This project attempts to correlate the usage of these computers with various factors, including: campus scheduling, equipment configuration, placement, population in the library, and area weather. Using this data, this project also uses machine learning to determine the best placement of computers for future allocation, and possible reconfiguration of equipment and space.
1 stars 1 forks source link

Computers that don't exist #61

Closed PatriciaTanzer closed 6 years ago

PatriciaTanzer commented 6 years ago

As I'm looking at our utilization data, I think some statistics might be really low because we are including all the years for which usage doesn't exist. For example, it doesn't look as if BL001 was every in use in 2010, and I think that's because we didn't even have it at the time. But entries do exists - those entries are 0.0, not NaN. (I'm talking about the pickled data from LibData.pkl).

Do we have some dates for when computers were purchased/added to libraries? I think we should add code to discard entries for those computers before said dates and see if anything changes.

smindinvern commented 6 years ago

Good catch. We should be able to do that just by looking at the first event involving each computer, i.e. that would be the time at which it was installed.

I'll look at doing that hopefully this weekend.

Thanks, Nick

On November 11, 2017 4:55:36 PM EST, Patricia Tanzer notifications@github.com wrote:

As I'm looking at our utilization data, I think some statistics might be really low because we are including all the years for which usage doesn't exist. For example, it doesn't look as if BL001 was every in use in 2010, and I think that's because we didn't even have it at the time. But entries do exists - those entries are 0.0, not NaN. (I'm talking about the pickled data from LibData.pkl).

Do we have some dates for when computers were purchased/added to libraries? I think we should add code to discard entries for those computers before said dates and see if anything changes.

-- You are receiving this because you are subscribed to this thread. Reply to this email directly or view it on GitHub: https://github.com/UNCG-CSE/Library-Computer-Usage-Analysis/issues/61

-- Sent from my Android device with K-9 Mail. Please excuse my brevity.

brownworth commented 6 years ago

Unfortunately, I don't have a way to determine when a machine was placed in that location. I figure we can look at this one of two ways:

I cannot speak to which would be necessarily better at this point. I can say that we only have 2 weeks to have this finished (Nov. 27) and the grad students in this group still need to write a paper. Considering those elements, I would go for least effort.

PatriciaTanzer commented 6 years ago

I've been working in Assignment 4 with this problem and divided the usage stats by semester. (Will upload the code once I get home). If a computer's first use is during a particular semester, I think we should drop it from that semester and any preceding ones. That should be sufficient, and not too difficult to code.

Sent from my iPhone

On Nov 12, 2017, at 10:19, Brown Biggers notifications@github.com<mailto:notifications@github.com> wrote:

Unfortunately, I don't have a way to determine when a machine was placed in that location. I figure we can look at this one of two ways:

I cannot speak to which would be necessarily better at this point. I can say that we only have 2 weeks to have this finished (Nov. 27) and the grad students in this group still need to write a paper. Considering those elements, I would go for least effort.

— You are receiving this because you were assigned. Reply to this email directly, view it on GitHubhttps://github.com/UNCG-CSE/Library-Computer-Usage-Analysis/issues/61#issuecomment-343744390, or mute the threadhttps://github.com/notifications/unsubscribe-auth/AdqT4jbZ2iHIJxYFr6e5VM7iunTOzs_eks5s1wxkgaJpZM4Qapqp.

PatriciaTanzer commented 6 years ago

62 Should now contain a notebook where I've graphed the differences between the median and mean for every semester except Spring 2010. (That one the median stays at 0, so the graph is just a line and useless). You can see how the 0's distort the earliest semesters - the later semesters show a much greater contrast between mean and median usage of each computer.

Edit: the pull request I mentioned isn't working the way I thought it would. Just look for Semester_Analysis.ipynb in Develop

PatriciaTanzer commented 6 years ago

We seem to have taken care of this, closing it for now