info-design-lab / Plaksha_Dataviz

0 stars 0 forks source link

Assignment 2: Burtin’s Antibiotic Data #12

Open venkatrajam opened 2 years ago

venkatrajam commented 2 years ago

In the post-World War II world, antibiotics were called “wonder drugs,” for they provided quick and easy cures for what had previously been intractable diseases. Data were being gathered to aid in learning which drug worked best for which bacterial infection. Being able to see the structure of drug performance from outcome data was an enormous aid for practitioners and scientists alike. In the fall of 1951, Burtin published a graph showing the performance of the three most popular antibiotics on 16 bacteria.

The data used in his display are shown in attached Excel file. The entries of the table are the minimum inhibitory concentration (MIC), a measure of the effectiveness of the antibiotic. The MIC represents the concentration of antibiotic required to prevent growth in vitro. The covariate “gram staining” describes the reaction of the bacteria to Gram staining. Gram-positive bacteria are those that are stained dark blue or violet; whereas, Gram-negative bacteria do not react that way.

Present a graphical representation of these data and an accompanying written description of the graph.

(this assignment is a rerun of the Chance magazine's contest on the 100th birth anniversary of designer Will Burtin who published the first visualization of the data in 1951. Contest winners' submissions are here.)

Check out the last batch's submissions for the assignment.

pujan2000 commented 2 years ago

Name - Pujan Chitalia

Aim: To understand the most effective antibiotic and the dosage requirement to cure a patient suffering from any of the given bacteria.

image

Started with plotting proportions of three antibiotics required for each Bacteria. In this graph some proportions are so low that it is not visible. Also, this type of bar graph represents that we need some combination of 3 antibiotics to diagnose/kill any given Bacteria. The graph type was bar graph since the data is discrete.

image

In this graph, I have used ln(minimum inhibitory concentration) and changed the axis.

image image

For better representation purposes, I plotted the graph using logarithmic scale and added a data table at the end. • The MIC is basically the concentration of antibiotic required to kill the bacteria. So, lower the MIC, better is the performance of the antibiotic. • The MIC is plotted on the x axis and the bacteria is plotted on the y axis. • I have separated the plots on the basis of gram stain in order to check the effectiveness of MIC on the bacteria in case of negative or positive gram stain. • Penicilin seems to be extremely effective on gram positive bacteria. • For gram negative bacteria, Neomycin seems to be most effective. Also, Neomycin’s performance for gram positive bacteria is decent. • Performance of Streptomycin as an antibiotic is not good in gram positive or gram negative bacteria.

Stuti-018 commented 2 years ago

Name- Stuti Pasricha BAD_new

This is the Visual representation of Burtin's Antibiotics data. Three antibiotics' information against 16 germs is revealed by the data set. Please take note that a certain antibiotic is regarded to be more effective the lower the value. I have normalized the data by taking log of each column at base 10. But, that may give negative values and so we added a number (4) to the result to shift all the values at positive axis. I grouped all three medications, plotted their efficacy, and stacked all 16 bacteria on the x axis to enable a visual comparison of all three drugs. By doing so, it would be possible to compare all the antibiotics both in relation to one specific bacteria and to the other 15 microorganisms. I also encode the gram nature by spearing the bacteria as per their category and color encoding is there to differentiate between negative and positive gram stain. Though neomycin was the most successful of the three, I discovered an intriguing pattern: the results were varied depending on whether the samples were Gram Staining Positive or Gram Staining Negative. I discovered that penicillin was ineffective in killing gramme negative bacteria, but was incredibly successful at killing gramme positive bacteria.

kirubanath commented 2 years ago

Name: Kirubananth Sankar

kk

Objective: To give an easy and faster way for the medical professional to check the chart and give the appropriate dosage of antibiotic

Description:

  1. The chart is divided into two parts based on Negative gram stain and Positive Gram stain.
  2. The x-axis is a weight-adjusted logarithmic scale. (weight adjusted to improve visibility)
  3. The y-axis has the types of bacteria.
  4. Three different colors have been allocated to the three different antibiotics we have.
  5. The bold annotation denotes the appropriate shot that needs to be administered for a given bacteria.

Rationale:

  1. The main reason is choosing this was because I was thinking from the perspective of a medical practitioner. I wanted to give a way for them to access the information quickly
  2. The scale was adjusted for the same purpose to create more visibility. But the annotations which are in the bold, which are what they will access are accurate.
  3. Bar chart was chosen rather than any other graph, to reduce the complexity and to order it systematically in alphabetic order, so it becomes easy for them to search.
Beastwham commented 2 years ago

Name: Prashant Kapoor

image

image

Description:

Note: I'd like to mention that while the above graphs do a good job of representing the information, if we were to create a simple instructional infographic for doctors then it'd have to be much simpler.

I tried to look for clusters by plotting them on scatterplot and tried to create an infographic that would serve this purpose.

Here's the infographic:

data viz assignment 2_page-0001

Here's the link for the jupyter notebook in which I plotted the data: https://colab.research.google.com/drive/16-5fRceo8K3Z_h564yw92T0_SJIFKtMz?usp=sharing

vanshu2k commented 2 years ago

Name: Vanshu Saini

dsad12121

Platform Used: Tableau

A different Graph link: https://drive.google.com/drive/folders/1yv5KIcEtBh34qP1muhBi5D4rlBInlS4m?usp=sharing

Description:

-The graph is divided between positive & negative gram staining. -There are three different colors used for each Penicillin, Streptomycin, and Neomycin. -The x-axis on the graph has been adjusted and normalized to the logarithmic scaling of base 10. -Bars are used to show an accurate representation of the data and also to aid in comparing the effects of various antibiotics. -The y-axis has been labeled with the names of different bacteria.

subandwho commented 2 years ago

Name: Subandhu

image

Platform : Tableau Visualization used: Line Chart.

Description:

vivek6311 commented 2 years ago

Vivek Kumar, TLP-23 Positve Gram Negative Gram

Description

  1. It is easy to visualise when you divide the graph into two parts i.e. with positive and negative gram stains.
  2. Penicillin is used in high amounts when gram staining is negative.
  3. Penicillin is used in Low amounts when gram staining is positive.
  4. While the other two components are used in high amounts when gram staining is positive.
  5. The data representation in the form of a column chart is easy to visualise as it also shows the percentage of different components used to represent this.
gaurav639 commented 2 years ago

Name - Gaurav Singh

Burtin's Antibiotic Data Viz Platform Used: Tableau

Communication Goals of my Visualization:

Design rational:

Suvrojyoti-Paul commented 2 years ago

Name- Suvrojyoti Paul Psoitive@2x negative@2x (1)

Visualization used: Bar Chart.

Description:

There are two charts, one is for the positive gram straining and negative gram straining. The x-axis shows percentage proportion Penicillin is shown by green; streptomycin is shown by black, and neomycin was shown by blue, Each bar shows what percentage of penicillin, streptomycin and neomycin needed for each kind of bacteria. Penicillin is more dominating for negative straining Streptomycin is more dominating in positive straining.

Design rational:

I chose this structure because I wanted the user to visualize the combination of antibiotics (penicillin, streptomycin and neomycin) needed for each bacteria straining and what is dominating more in negative and positive gram straining.

anchitna commented 2 years ago

Name - Anchit Narayan

image

image

Platform : Tableau Visualisation : Bar Chart Description :

PratProduct commented 2 years ago

Name: Pratyay Prakhar

The following 3 visualisations have been created to answer some questions by drawing inferences from 'Brutin's antibiotic data'.

Question 1: How does one antibiotic compare to another in terms of efficacy ? Red Modern Line Chart Diagram Produt Data Graph (14)

Design rationale:

Question 2: Which is the most effective antibiotic for each bacteria? assignment_solution_2

Design Rationale:

Question 3: Which antibiotic is effective if only the gram strain of the bacteria is known? Green Minimalist 4 Points Donut Chart Graph (2)

Design Rationale:

anav99 commented 2 years ago

Name: Anav Bhatia Display

Description:

  1. X-axis indicates the Minimum Inhibitory Concentration(MIC)
  2. Y-axis indicates the Bacterial Infection
  3. The graph has been divided into 2 parts for Gram Positive and Gram negative Staining
  4. Logarithmic scale has been used in x-axis.
  5. The whole display has been divided into two parts- one with the graph, the other with the legends and some important information.

I choose this style because it represents the information in a minimalistic way and appeals to the eye.

ghost commented 2 years ago

Name - Savyasachi Pandey

Screenshot 2022-10-19 at 11 26 09 PM Screenshot 2022-10-19 at 11 27 14 PM

Platform - Tableau Visualization - Horizontal Bar Chart

Rationale for bar chart: I chose the horizontal bar chart as in this the labels are easier to display. It shows each data category in a frequency distribution, also it estimates the key values at a glance.

DESCRIPTION:

  1. X Axis depicts the Minimum Inhibitory Concentration (MIC), which has been converted using logarithmic scale to make it easier to compare as the values were varied.
  2. As the value on the horizontal axis goes down, the effectiveness increases and vice versa.
  3. The vertical axis has been differentiated into different bacterias, each showing its MIC with Neomycin, Penicillin and Streptomycin.
  4. Two different graph for positive and negative gram straining makes it easier to read and understand the graph as well as makes the mind mapping easy.
  5. The Visualization above shows that the most effective antibiotic for gram-positive bacteria is Neomycin, whereas for gram-negative, it is Pencillin.
  6. Simple colour coding technique is used with the help of 3 colours, i.e. Orange for Neomycin, Purple for Pencillin and Red for Streptomycin.
Shiv907 commented 2 years ago
Screenshot 2022-10-19 at 11 22 21 AM

Description:

1) Log (base 10) has been applied to all the numerical values of the data. This is done to reduce the variability and normalize the data. 2) The data has been divided into two categories: Positive gram staining and Negative gram staining. 3) It is clear from the data that Penicillin is extremely ineffective as an antibiotic for gram-negative bacteria whereas it does fairly against gram-positive. The indicator used for this inference is the presence of blue color in both charts on the left side and on the right side.

29pranav commented 2 years ago

image

image

Description of the Graph:

1) The data has been divided into two segments: Positive staining and Negative staining. 2) To reduce the large variance inside the data, logarithm (log to the base 10) has been taken. Otherwise, it was difficult to include such a large range (0.1 to 870) in a single chart. 3) Each bacterium's MIC has been compared against all 3 drugs in each of the two charts (Positive stain as well as Negative stain)

Conclusion:

1) The performance of Penicillin is the best amongst all three against gram positive bacteria. 2) The performance of Neomycin is the best amongst all three against gram negative bacteria.

** The drug having least value of MIC for a particular bacterium is the best performing.

ShivayNagpal commented 2 years ago

Name - Shivay Nagpal Graph used- Radial column chart

Artboard 1

Since our data only had $16$ categories of bacteria, I chose a radial column chart to plot the data as $16 \bigtimes 3$ types of antibiotics $= 48$ data points, which can be easily encoded in a radial graph and still remain legible. I initially chose a logarithmic scale ranging from $10^{-3}$ to $10^3$, since all of our data was captured in this range. The problem with this range was that some data points ended up having zero height, which defeated the entire purpose of creating the infographic since a reader will not be able to distinguish the most effective antibiotic (the shortest height bar for the particular bacteria). To fix this issue, I changed the radial axis to vary from $10^{-4}$ to $10^3$, which ensured a minimum height for all the bars.

Aayush4396 commented 2 years ago

image image

  1. I have converted the Minimum inhibitory concentrations (MIC) values and taken the logarithmic values of MIC as the order of magnitude is huge for the data. I get negative values and the min value is -3, so I've added a constant of 5 to all the log values: Converted MIC values => log(MIC) + 5
  2. After having all the values as positive I took the percentage (antibiotic-wise). The y-axis consists of the bacteria type and the x-axis consisted of the converted MIC values. There are 2 separate plots of gram-negative and gram-positive.
  3. Now the effectiveness is decided if the MIC value is low, hence I have highlighted the lowest converted MIC value for each bacteria.
  4. The questions that the plot answers are: a. Which antibiotic to take given the MIC values? b. Comparison of effectiveness of antibiotics when MIC values are given.
SonicShotGaming commented 2 years ago

Name: Priyam Shah

Gram Positive Bacteria (Lesser_Leftmost is better)

Gram Negative Bacteria (Lesser_Leftmost is better)

So what I aimed to do with this visualization is to first split the bacteria into Gram Positive and Gram Negative. This would make it easier for the viewer to easily access the information that they need. What the graphs are depicting is the amount of each antibiotic a bacteria needs to reach MIC, with the lesser amount being the better antibiotic. This means that the bars which extends the most towards the left side is the most effective antibiotic for that bacteria.

I chose to go with a dark theme with very vibrant colours so that the most important information would pop out, contrasting with the dark background. I also chose a bar graph like this because it would be very easy for a viewer to see which bar extends the most to the left and then chose the best antibiotic for the bacteria. In a life or death situation, the viewer might not have the time to process and figure out which antibiotic might save the life of the patient, and having the graphs be so vibrant will draw their eyes to the bar, and finding the leftmost bar is also very intuitive.

The questions that my visualisation answer are:

1) What are the MIC values for antibiotics for certain bacteria? 2) Which antibiotic is the best for each specific bacteria? 3) Which antibiotic can be used for the overall Gram Positive and Gram negative group.

GaneshBShelke commented 2 years ago

Name: Ganesh Shelke

Negative Gram Staining

Positive Gram Staining

vrk7 commented 2 years ago

Name: Vysakh R K Platform used: Power BI Link to Power BI: Performance of three most prescribed antibiotics

image

image

image

The principles which I have taken in mind when coming up with this graph is:

  1. Minimalistic design
  2. Easier for the users to comprehend the graph
  3. Using a line graph to come to an inference, rather than using a complicated graph.

Description: Data from Burtin's paper published in the year is used to create the above graph, which shows the performance of the three most prescribed antibiotics on 16 bacteria. I have used the type of bacteria on the Y-axis and Minimum Inhibitory Concentration (MIC) on the X-axis on a log scale as the scaling factor so that large values can be normalized. My graph illustrates how vaccines affect different types of bacteria from the dataset. Taking this approach was my way of illustrating how each antibiotic contributes to a particular type of bacteria while also showing generalizations. Also, the maximum and minimum line (dashed lines) is mentioned in the graph to show the effectiveness of the drug.

Results: The most effective vaccine for most of the Positive Gram Straining bacteria: Neomycin The most effective vaccine for most of the Negative Gram Straining bacteria: Penicillin Streptomycin is not very effective against Positive gram nor Negative gram-straining bacteria.

AdarshGowda03 commented 2 years ago

Name: Adarsh Gowda Tool used: Tableau

image

Description

  1. I used a bar graph because it allows me to easily compare different sets of data among different groups. The Minimum Inhibitory Concentration (MIC) of antibiotics required for different bacteria is compared here.
  2. The graph design is minimalistic because it includes 'Gram Staining' in the same graph that is differentiated by two colours, making it easier for the user to analyze the data in a single graph.
  3. I used a logarithmic scale for effective comparison the data was skewed due to large values.
  4. When compared to the concentration required for active growth, we find that Neomycin is more effective for negative gram staining.
  5. Except for two bacteria, we found penicillin to be more effective against most bacteria when stained with positive gram staining.
promwonka commented 2 years ago
Screenshot 2022-10-19 at 5 54 03 PM

Name : Pramod Krishna Title: Effectiveness of antibiotic wrt gram strain

Description:

  1. I segregated the positive and negative gram strain into 2 sections of the graphs with 2 different colours.
  2. Also labels have been given for clear unerstanding.
  3. The x-axis represents the MIC and the y-axis represents the Bacteria.
itsnitiz commented 2 years ago

Sheet 2

Submitted by Nitish Mallick

Description:

BhanuN1997 commented 2 years ago

Name: Bhanu Nalluri Platform: Excel

Technical limitation(Edited on the 20th): I couldn't increase the text size and contrast due to technical limitations (spoke with the professor on this in class). But the underlying idea is to present everything in one chart and differentiate based on the family strain

Research Methodology: I read through a few articles and research papers to understand the concept of strains and the significance of MIC values.

Findings and implementations:

Good vs Bad: We should not assume that lower MIC values are better. The ideal values depend on the ingredients, and complexity of manufacturing, and the reactions of the API. So, I have avoided mentioning what's good and what's bad while benchmarking MIC values

Units used: I have used µg/mL for measuring MIC concentration. MIC is generally represented in µg/mL (1 µg = 1000 mg)

Importance of MIC values: A slight change in MIC numbers even at the decimal level may impact the effectiveness of the drugs. So I felt it's important to represent the raw data. Plotting a histogram based on ranges along with raw data will give a better picture to causal readers

Users: The Viz. is targeted at both normal users and users from the healthcare industry. I have mentioned the raw numbers below the Histogram for medical users as they prefer to know the exact MIC values of each drug while benchmarking. I have benchmarked the MIC values based on ranges in the form of a histogram to provide a snapshot for normal users.

Constraints: The given data is relatively hard to represent in one picture. I felt it's important to represent the whole data in one picture, so it will be easy for the user to benchmark different values just by looking at one picture

Aesthetics: To avoid unnecessary distractions for the readers I have avoided the usage of fancy designs/pics and presented the data in a minimalistic format.

Untitled

monikrish2698 commented 2 years ago

Name: Monish Krishnan Tool Used: Power Bi

image

I initially thought of representing the data using a stacked bubble chart or vertically stacked bar chart. While designing them, I realized it could lead end users to misinterpret bubbles and stacked bars. I wanted to design a visualization that should be easy for viewvers' eyes but also help them gain some insights. To achieve my viz-representation goals, I finally chose vertically bar charts with the different bacteria on the y-axis, MIC values on the x-axis, and the antibiotics on the legend. I differentiated them by their gram stain test output and placed them side to side to make comparison easy. It was interesting to see the increased concentration of Penicillin in negative gram stain and almost zero concentration of Penicillin in positive gram stain.

My Design principles:

  1. Easy to viewers' eyes
  2. Insights
  3. Minimalistic design
Pragyawasan commented 2 years ago

Chart type- Bar chart Tool- Tableau

Burtin 6

ada-nai commented 2 years ago

Name: Adarsh Nair Tool used: Matplotlib, Seaborn (Python3)

image image image

walia-muskaan007 commented 2 years ago

Name - Muskaan Walia Chart Type - Bar Chart Tool - Tableau

Screenshot 2022-10-19 203200

I have used a vertical bar graph to differentiate between the efficacy of the antibiotics on the Gram Staining results to find the minimum inhibitory concentration (MIC) on the logarithmic scale.

tirthatilakpani commented 2 years ago

Antibiotic Data Set visualization.pdf Name- Tirtha Tilak Pani Important clarification- In the case of the Bacteria Proteus Vulgaris, we see that the MIC values for Streptomycin and Neomycin are the same. I have chosen Neomycin over Streptomycin in that case because Neomycin is the mode in the case of the second visualisation. While I feel in the initial stages, the distinction between Gram positive and Gram negative is essential, as we progress, I think that it does not have as much practical impact on data analysis as the three anti-biotics.

ShashankPrasad0506 commented 2 years ago

Name: Shashank Prasad

Positive Gram Staining

Negative Gram Staining

The y-axis represents the categories of bacteria. The x-axis represents the MIC (Minimum Inhibitory Concentration) for each antibiotic on a log scale.

Bar chart is used to effectively show the comparison of the Minimum Inhibitory Concentration (MIC) of the antibiotics required for different bacteria. The scale is changed to log scale to make it easier for visualization. The antibiotic which has a lower value against a bacteria works better against that bacteria.

Two separate charts for positive and negative gram staining are shown for easy visualization and less clutter.

Tamanna134 commented 2 years ago

Name: Tamanna Sahu Antibiotic_Data

Platform used: Tableau Bar chart has been used to compare the result of three different antibiotics on each of the 16 bacteria. It's easy to compare which antibiotic has been more efficient by the length of the bar for that particular bacteria. On x-axis, MIC values have been normalized to logarithmic base 10. On y-axis, bacteria have been labelled and arranged in alphabetical order along with mention of positive/negative gram bacteria type.

Harsss commented 2 years ago

Burtin's Antibiotic Data

dv21 DV22

abdksyed commented 2 years ago

Name: Syed Abdul Khader Tool Used: Python - Seaborn

Artboard 1

The y-axis represents the nominal data which are the categories of the bacteria. The x-axis represents the MIC(Minimum Inhibitory Concentration) for each antibiotic on a log scale.

For each bacteria, we have all three antibiotics plotted, with their values, we can pick the antibiotic as better for that bacteria which has a lower value. So the visualisation is made accordingly to help with that.

Also, the gram strain is binary, so it is visualised as the background colour and the bacteria's font colour where the positive is green and the negative is red to match with the user's mental model.

kishlaykumar1995 commented 2 years ago

Name - Kishlay Kumar Chart Type - Bar Chart Sheet2

Description

AyushiNM commented 2 years ago

Name - Ayushi Mittal Title - Burtin's Antibiotic Data Visualization Platform used - Tableau

Book1

Screenshot (162)

Screenshot (163)

AdityaPandey0901 commented 2 years ago

Visualizations

https://docs.google.com/drawings/d/1v6qNvKe_6EosIzH6Hos-t9_tcV7Lmab65hIgaokQP44/edit?usp=sharing

Explanation:

The larger goal of this visualization is to provide a holistic view of

a) How Gram +/- has an effect on the MCI's required for each drug.

b) The comparative MCIs of each drug to combat each bacterial agent.

To achieve this, I've opted for a set of 3 plots:

1) The first of these is a 3d autorotating plot, stored as a gif. This might stretch the definition of a "static visualization" (Note the definition we were given in class was no user interactivity, which gifs statisfy), but in doing so it provides a holistic representation of Gram and the role it plays in the efficacy of different drugs. For the researcher, this provides an excellent insight into an otherwise unseen trend.

2) The second and third, split each bacterial agent by Gram positivity, and provide multi-bar plots side by side for each. The bars are in high contrast colors, with the idea being that a doctor can easily look at the bacterial agent they want to combat and instantly weigh their options.

The axes chosen everywhere are logarithmic, and I've used plotly for the entire viz.

HimanshuBhardwaj2398 commented 2 years ago

Name: Himanshu Bhardwaj Title: Comparing Effectiveness of Antibiotics Platform/Tool used: Jupyter Notebook(pandas, matplotlib )

image

Comments:

1.) Data contains MIC for three antibiotics(Penicilin, Streptomycin and Neomycin) for various gram- positive and gram-negatuve bacteria. The most effective drug for a particular bacteria would be the one, having least MIC value 2.) We can present the data in a form suitable to compare all three anti biotics for all kinds of given bacteria. The most intuitive choice for comparing seems to be a bar plot. For each bacteria, three bars represent the amount of MIC required and the drug corresponding to the bar with the least value will be the most effective.
3.) As MIC values vary a lot, values are modified using log(MIC)(base 10) to convert them into comparable ranges. Hence, wherever values are not present in the plot, it means a MIC of 1 4.) Looking at the visualization, a general trend can be observed: Penicilin is observed to be most effective(out of the given three drugs) against gram-positive bacteria and Neomycin is the most effective drug for gram-negative bacteria

Kunalbhardwajj commented 2 years ago

Name - Kunal Bhardwaj Title - Burtin's Antibiotic Data Visualization Platform used - Python gram(-ve) Based on this plot, Neomycin is more effective against gram -ve bacteria than the other two. gram(+ve) Based on this plot, Penicilin is more effective against gram +ve bacteria than the other two, here it seems like that the plotted line for Penicilin is passing through 0 but that is due to the normalizing effect, in actual its concentration needed to fight against gram +ve bacteria is very less in comparison with the other two, so the plotted line lies slightly above 0, not passing through 0.

Description:

asheesh4545 commented 2 years ago

Name - Asheesh Kumar Singh Chart Type - Bar Chart

fdfds Tools Used : Tableau

Description :

Tejasvi337 commented 2 years ago

Name: Tejasvi Kumar Singh Tool: Plotly

antibiotic_performance2

The aim was to provide a descriptive visualisation of the performance of three antibiotics with minimalist aesthetics

Design rationale and features

SumanthSonnathi commented 2 years ago

Name: Sumanth Sonnathi Title: Burtin's Antibiotic data analysis

image

Description: -> Data visualization happens to be easy when appropriate data has been inscribed in the graph. -> For every bacteria the amount of MIC needed has been captured and plotted in the graph with appropriate color coding for the antibiotics and the gran strain has been specified with the bacteria.

malhotra-bhavini11 commented 2 years ago

heatmap_1

The purpose of the visualization is to produce insightful and clear observations on the set of comparative data regarding antibiotic efficacy. There are 3 variables, 2 of which are categorical and 1 of which is numerical. The latter is the Minimum Inhibitory Concentration (MIC) which is the smallest concentration or volume required to halt bacteria growth. Gram staining is a test which indicates whether the bacteria colors blue (positive) or pink (negative) and is a quick way of separating the data into 2 classes.

With MIC on a logarithmic scale, we can observe through the data that the larger the concentration, the less effective an antibiotic is at inhibiting bacterial growth. We can also observe from the data that the MIC values have a huge range - from 0.001 to 870 so I thought to transform the values logarithmically. The effect of that is that the values are more evenly spread out and also reflect the log of the concentration as the time to grow the bacteria(antibiotic cultures grow exponentially) This visualization aims to answer 2 questions:

  1. Does the bacteria group by resistance ?
  2. Do different drugs correlate?

In order to answer the 1st first question, I have grouped bacteria into 2 groups: positive and negative gram staining within the visualization. This has been done by sorting the data so the values are grouped together and shaded in order to make a clear demarcation between the 2 groups. Another way this avenue could be explored is to group certain genus together (eg. Brucella abortus and Brucella anthracis ). I think that the table of data itself produced a clear image of the data except for the fact that any possible patterns are obscured by the sorting in alphabet order.

In order to answer the second question, I assumed a good choice for the visualization would be a heat map. In a heatmap, every number is represented by a switch of color. It could help in identifying extreme values and other patterns. In this case, the gradation of blue allows the viewer to quickly find extreme values without any difficulty. Additionally, the different antibiotics are the categories, so while it is easy to find extremes in the overall data, it is also easy to do so within one category by scanning down a column. From a horizontal perspective, it is easy to compare the MIC of the three different antibiotics for 1 particular pathogen.

As a statistics major, I believed that clustering the bacteria which react similarly to a particular antibiotic would be helpful from a research standpoint. Below is the dendrogram based on Fig 1. which was obtained through cluster analysis- it resembles Fig 1. However similar bacteria are grouped together which suggests further research in this area. Clustering can help build associations between data points and their features.

The graph thus shows:

Compared to other choices of representation such as stacked bar charts, it might be hard to distinguish the colors and it would be hard for viewers to overcome the ingrained mindset that the bars aren’t stacked but actually mean to show a shorter bar in front of a longer bar. There might have also been places where 2 bars coincide. This situation would decrease the readability of visualization since it would be difficult to differentiate the values of three antibiotics for 1 pathogen.

Meanwhile, using a grouped panel chart for all three antibiotics could cause sorting issues. This would cause confusion between sorting via efficacy for streptomycin or neomycin etc. given that the chart has 3 panels, each for one antibiotic. These 2 approaches discussed also do not cover the gram-staining characteristic of the bacteria that was present in the dataset.

malhotra-bhavini11 commented 2 years ago

final This is a representation of what manner the heatmap would appear like with the addition of clustering. This was the first time I did clustering analysis, so it is just acting as a demonstration of the type of graph as there is possibility for a mistake in this area.