Open Jimmi-Kr opened 8 months ago
Amongst the 24 tools explored by the Lisa Charlotte Rost, I have decided to try these 5:
I have tried to ensure that the 5 chosen tools belong to as many quadrants as depicted in the graph as possible.
I used weight on the x-axis, the number of cylinders on the y-axis, bubble size to represent horsepower, and color gradient for acceleration to create a focused analysis on the interplay between the physical attributes of the car (weight and engine size) and its performance characteristics (horsepower and acceleration).
This configuration allows you to directly assess:
How weight affects acceleration and horsepower.
How the number of cylinders relates to horsepower (which often increases with more cylinders) and acceleration.
Whether heavier cars have more horsepower or if they tend to accelerate more slowly due to the increased mass.
Horsepower as the size of the bubble could provide a very intuitive visual cue because we naturally associate "bigger" with "more powerful".
A color gradient for acceleration could work well since it could range from cooler colors for slower acceleration to warmer colors for faster acceleration.
I have used Matplotlib in a Jupyter Notebook for this chart. Used numpy for numerical pre-processing, like scaling, adding jitters to avoid overlapping. Used Matplotlib's Scatter plot for plotting, and RedS Sequential Color for color coding.
Used scaleLinear() of D3.js for X-axis (weight) and Y-axis (no. of cylinders), scaleSequential() for colors (acceleration), and scaleSqrt() for size of the bubble. Then, I've simply used svg circles to build the bubble chart with 0.7 opacity.
The code and description is pretty similar to the matplotlib graph. I have used scatter() of plotly.graph_objects.
I have used Bubble Chart of Google Sheets chart. Since Sheets does not support color gradients natively, I have binned the acceleration values, and manually assigned colors to simulate color gradients.
It is a drag and drop interface.
Name - Saikat Samanta Roll - 21f1003501
Plotting Miles per gallon vs Weight for the graphs from the Given Dataset
Google Sheets:
RAWgraphs:
Tableau:
PowerBI:
Matplotlib:
Name - Shri Krishna Pandey Roll No. - 21f1006966
Dataset Used - Automobile Dataset
Given dataset had 9 columns:
To find the relation between Horsepower and mileage (mpg) based on number of Cylinders.
Preprocessing:
Data Used:
Parameters used in Chart
Detail about the graph: X-axis (MPG) is in range of 5-50, Y-axis (Horsepower) is in range of 0-250 and dots are colored based on number of cylinder (i.e. 3,4,5,6,8)
Detail about the graph: X-axis (MPG) is in range of 0-50, Y-axis (Horsepower) is in range of 0-250 and dots are colored based on number of cylinder (i.e. 3,4,5,6,8)
Detail about the graph: X-axis (MPG) is in range of 0-49, Y-axis (Horsepower) is in range of 0-240 and dots are colored based on number of cylinder (i.e. 3,4,5,6,8)
Detail about the graph: X-axis (MPG) is in range of 5-50, Y-axis (Horsepower) is in range of 0-240 and dots are colored based on number of cylinder (i.e. 3,4,5,6,8)
Detail about the graph: X-axis (MPG) is in range of 5-50, Y-axis (Horsepower) is in range of 45-240 and dots are colored based on number of cylinder (i.e. 3,4,5,6,8)
Name: Debapriyo Saha Roll No.: 21f1004645
The five visualization tools that are being used for plotting are the following:
Dataset used: The dataset contains details of different cars, along with it's different features and also it's miles per gallon (mpg) values.
x-axis: Acceleration y-axis: Weight
Color scale is used by Origin and Size of the bubbles are based on the acceleration value (more the acceleration larger is the size of each bubble)
Name: ROYCE TOMY (21F1001916)
1) MS Excel
2) Power BI
3) Tableau
4) Plotly (library)
5) Altair (library)
Name: Soumya V Namboodiripad Roll Number: 21f1004752
Dataset: auto-mpg (Automobile Dataset)
Variables Used: mpg, horsepower
Type of Chart: Scatterplot
Name: Manaswita Mandal Roll no: 21f1004567
The five visualization tools that I have used for my analysis are:
Data: The dataset contains details of different cars, along with it's different features and also it's miles per gallon (mpg) values. I tried to find some correlation between some of the variables to find whether they are affecting the mpg of different cars. Attributes used for analysing:
Following charts are plotted:
Name: Kaushik V Roll No: 21f1001083
Observation from the chart: One can see the median fuel efficiency has significantly improved over the years and also the range band has gotten shorter, indicating, irrespective of the brand and specs, car manufacturers have tried to become more fuel efficient across categories.
Tool: Seaborn / Matplotlib
Tool: Plotly
Tool: Vega-lite / Altair
Tool: Tableau
Tool: ggplot2 / R
Name : Viraj Sharma Roll No: 21f1003723
For this assinment, I used a dataset auto-mpg.csv
to make a bubble chart with five different tools. A bubble chart is like a scatter plot, but it uses circles to show more information. I picked the mpg
(miles per gallon) and displacement
(engine size) variables from the dataset, and the size of the bubbles was determined by the cylinders
of the car engines.
The 5 tools that I have used are following:
Plotly:
Altair:
Matplotlib and Seaborn:
Google Sheets:
Bokeh:
I learned that different tools are better for different jobs. Some are good for making simple charts fast, others are better for when you need to make something very detailed, and some are best when you want to put a chart on a website. Which tool to use depends on what we need the chart to do.
Puravasu Jaideep Sesha 21f1000162
Dataset : Data
Seaborn
Plotly
Tableau
Raw
Altair(Vega)
I wanted to try out Datawrapper, but they do not offer boxplots, since they are not very well known to the general public. They offer alternatives like range plot to make up for it. 😊
Variables used:
Tools used:
Raw graph:
Flourish:
Datawrapper
Matplotlib:
Plotly:
Google sheet:
Yalini S 21f1004138
Variable Used:
Tools Used:
SEABORN
TABLEAU
FLOURISH
PLOTLY
POWER BI
Anushka Aggarwal 21f2000407
Name: Ajeet Kumar, Roll Number: 21f1006807
Scatter Plot: Relation between MPG (miles per gallon ) and Horsepower on the number of Cylinders
Variable Used:
x-axis: MPG y-axis: Horsepower categories color: Cylinder
Tools Used:
Excel Tableau Flourish Plotly PowerBI
Excel
Tableau
Flourish
Plotly
Name : Jallipalli Phani Kumar Roll No : 21f3002478
This is a comparative evaluation of scatter plots generated for the relationship between "Cylinders" and "Displacement" using various visualization tools. The purpose of this evaluation is to assess the strengths and weaknesses of each tool in creating effective visualizations.
Tools Used: Excel Flourish Bokeh Orange Tool Plotnine
Dataset: The dataset used for this evaluation is the "auto_data.xlsx", containing information about automobile specifications including cylinders and displacement.
Findings:
Excel: Excel provides a user-friendly interface for creating scatter plots. The scatter plot generated in Excel allows for basic customization but lacks advanced features. It is suitable for quick visualization tasks but may not be ideal for complex visualizations.
Flourish: Flourish is an online tool for creating data visualizations with interactive features. The scatter plot created in Flourish may offer more interactivity compared to other tools but requires uploading the data to the platform.
Bokeh: Bokeh is a Python library for creating interactive visualizations. The scatter plot generated with Bokeh allows for customization and interactivity, such as zooming and panning. Bokeh is suitable for creating professional-grade visualizations with programmable features.
Orange Tool: Orange is a data visualization and machine learning tool with a graphical interface. The scatter plot created in Orange offers basic visualization capabilities with limited customization options. Orange is user-friendly and suitable for users with less programming experience.
Plotnine: Plotnine is a Python implementation of ggplot2, a popular plotting system for the R programming language. The scatter plot generated with Plotnine offers a high level of customization and follows the grammar of graphics principles. Plotnine is ideal for users familiar with ggplot2 syntax and seeking advanced visualization capabilities.
Name: Rohan Khandelwal Roll Number: 21f1005976
Scatter Plot: Relation between Weight, acceleration and model year
Axes description: x-axis: weight y-axis: acceleration categories colour: model year
Tools: Flourish R studio Matplotlib & Seaborn DataWrapper Plotly
Flourish
R studio
Plotly
Matplotlib & Seaborn
DataWrapper
Name - Chandana Nisankara Roll Number - 21f1005727
I have chosen Mpg(miles per gallon) and weight from the data , to understand how a car's weight impacts its fuel efficiency. Heavier cars have lower Mpg due to increased energy requirements.
Tools / Applications that i have chosen to represent the data are : 1.Google sheets (Application) 2.Orange (Application) 3.Plotly (Library) 4.Seaborn (Library) 5.Tableau (Application)
1. Google sheets :
2. Orange :
3.Seaborn:
4.Plotly:
5.Tableau :
Name: Om Sharma Roll No.: 21f1004424
Variables used: MPG and Weight
Tools Used: a) Matplotlib
b) Seaborn
c) Plotly
d) Excel
e) Datawrapper
Name: Amol HATWAR Roll No.: 21f1000451
From the given dataset, three columns were chosen to be encoded. These were:
A bubble chart was chosen as it would allow easy extension by encoding of additional data like horsepower, colours could be used for region of origin etc.
While most tools were simple and easy to use, using seaborn offered the most flexibility. However, the tool requires some Python programming skills. At the same time, using Google Sheets was a bit unwieldy as it would not offer precise control of the bubble size. This caused overlapping bubbles and made the resulting visualisation messy.
Name: DEENA GAUTAM Roll No.: 21f1001012
A scatterplot was chosen to showcase the relationship between 3 variables.
Variables used: X-axis : horsepower Y-axis : acceleration legends : cylinders
1. Power BI (tool)
2. Plotly (library)
3. Altair (library)
4. Flourish (tool)
5. Tableau Public (tool)
I have done a scatter plot on the given dataset with the three features mentioned above.
Matplotlib
Plotly
ggplot2
Flourish
Google Sheets
The tools Matplotlib, Plotly and Flourish were very easy to use. In the case of Google Sheets, it was not possible to color the dots based on the value of the number of cylinders, this prompted the split of the feature in X-axis on the basis of number of cylinders
Name: Priyanka Nathani Roll No: 21f1005807
Dataset details: The dataset consists of 398 rows of data. The data contains numerical values for miles per gallon, number of cylinders, horsepower, weight and acceleration of the car models with their model year and version (marked as origin).
My approach: From the car names, I split the values to get names of car manufacturing companies. Thereafter I plotted ‘Cylinder-wise Cars produced by Companies’ with Car companies on one axis and number of car models with 3, 4, 5, 6 or 8 cylinders.
Choice of graph: I have chosen stacked bar for finding this relationship.
Tools used:
1. Excel****
2. Tableau
3. Power BI
4. Datwrapper
5. Matplotlib
6. RawGraphs
Chart Explanation:
Insights into tools:
2. Tableau a. Easy to use tool. A lot of animation etc and different types of graphical representations are inbuilt. b. Instead of stacked chart the in-built recommendation was multiple graphs. c. This type of graph gave excellent visualization in terms of which manufacturer prefers to deal with cars of how many cylinders, which manufacturer has produced maximum cars in any given category etc.
Power BI a. Also, not very difficult to use. Although some hands on is required before the tool can be utilized to maximum. b. I liked that the stacked chart here has normalized all the company data to 100%. Therefore, the comparison here between which company releases more cars in which category is simple.
Datawrapper a. The tool is very user friendly. Beautiful visualizations can be created in the very first go. b. The chart attached here has ease of understanding as it is easy to play around with colours making separation between categories stark. c. Details like source of data etc add to the authenticity of the chart
Matplotlib a. The use of this tool requires coding in python. b. The tool has some limitations like regarding placement of legends etc c. If the data is very large and requires cleanup, it can be a very useful tool.
6. RawGraphs a. The tool is very easy to use b. Does not take empty cells while tools like Datawrapper takes empty cells as well c. Limited possibilities, e.g., I could not change the direction of xticks.
P.s. I had uploaded the graphs much before the deadline. I has missed the sentence that either excel or matplotlib can be used. I just learnt about it today. Therefore, today I uploaded the 6th graph using RawGraphs. I request you to kindly consider the submission of this graph as well. Thank you and Regards, Priyanka Nathani
Name - Rajkishore Nandi Roll No - 21f1006016
The five visualization tools used are :
Variables Used :
Plot :
Plotted scatter plot with Weight on X-axis, Mpg(Miles per Gallon) on Y-axis and colour gradient for Acceleration to find out the correlation between the three variables.
GGPlot using R
Plotly
Power Bi
Flourish
Matplotlib
About the data - The dataset auto-mpg.csv provided describe the various attributes of different car models. The dataset comprises information about multiple car models, with features including their fuel efficiency (mpg), engine specifications such as cylinder count, displacement, and horsepower, along with weight, acceleration, manufacturing year, origin, and car name. Each row represents a distinct car model, and the dataset provides a comprehensive overview of these vehicles' key characteristics.
The Five tool/libraries are listed below that I explored and used for data visualization on some of the key observation and fearures on the dataset.
Here are few of the charts and visualization created using the above five tools/libraries:
Pandas profiling earlier it was called ydata-profiling gives a very good initial overview of the data. Together with statistical information on the dataset it provides some of the key charts to visualize the data. Heat Map Word Cloud Image :- Word cloud image on the car name.
Correlation chart beetween 'mpg' and 'weight' feature, ploted using Flourish.
Name: Prakhar Bansal Roll No.: 21f1003810
Title- Relationship between Weight and MPG based on number of Cylinders
Tools Used-
Variables used- Y-axis - Weight X-axis - MPG (Miles Per Gallon) Color - Number of Cylinders
Altair (Python):
This scatterplot visualizes the relationship between acceleration and mileage (mpg) based on the number of cylinders using Altair, a Python library for statistical visualization. Each point represents a car in the dataset, with the X-axis showing horsepower, the Y-axis showing mileage, and colors indicating the number of cylinders. The plot provides a clear overview of how the mileage varies with acceleration across different cylinder configurations.
import altair as alt import pandas as pd alt.Chart(df).mark_point().encode( x='acceleartion', y='mpg', color='cylinders:N' ).properties( title='Horsepower vs. Mileage by Number of Cylinders (Altair)' )
Flourish:
The Flourish scatterplot displays the correlation between acceleration and mileage (mpg) categorized by the number of cylinders in the car. Each data point represents a car, with acceleration on the X-axis, mileage on the Y-axis, and cylinder count indicated by color. The interactive nature of the plot allows users to hover over points for specific data values and explore how mileage relates to acceleration across different cylinder configurations.
Power BI:
The Power BI scatterplot illustrates the relationship between acceleration and mileage (mpg) based on the number of cylinders in the car. Each point on the plot represents a car in the dataset, with acceleration plotted on the X-axis, mileage on the Y-axis, and cylinder count represented by color. Users can interact with the plot to filter data or drill down into specific details, making it a versatile tool for exploring the relationship between these variables.
Seaborn (Python):
This Seaborn scatterplot visualizes the correlation between acceleration and mileage (mpg) categorized by the number of cylinders in the car. Each point represents a car, with acceleration on the X-axis, mileage on the Y-axis, and cylinder count indicated by color. The plot provides insights into how mileage changes with acceleration across different cylinder configurations, with Seaborn's built-in styling and aesthetics enhancing the presentation of the data.
import seaborn as sns import matplotlib.pyplot as plt sns.scatterplot(data=df, x='acceleration', y='mpg', hue='cylinders') plt.title('acceleration vs. Mileage by Number of Cylinders (Seaborn)') plt.show()
Tableau:
The Tableau scatterplot depicts the relationship between acceleration and mileage (mpg) based on the number of cylinders in the car. Each point on the plot represents a car in the dataset, with acceleration plotted on the X-axis, mileage on the Y-axis, and cylinder count differentiated by color. Tableau's intuitive interface allows users to explore the data dynamically, enabling interactive analysis and visualization of how mileage varies with acceleration across different cylinder configurations.
Name: Aarya Motiwala Roll Number: 21f1003998
Scatter Plot: Relation between MPG and Displacement
Axes description: x-axis: MPG y-axis: Displacement
Tools: Matplotlib Seaborn Excel PowerBI Bokeh
Excel- Excel is a widely accessible tool that offers intuitive interfaces for plotting data through its spreadsheet environment. Plotting MPG vs. Displacement in Excel allows users to quickly visualize trends and patterns, with easy-to-use formatting and styling options. However, Excel's capabilities are more limited for advanced statistical visualizations compared to Python libraries.
Power BI- When plotting MPG vs. Displacement, PowerBI enables users to create dynamic and interactive dashboards. Its strength lies in data manipulation, sharing capabilities, and integrating plots into comprehensive reports for decision-making.
Seaborn- Built on top of Matplotlib, Seaborn simplifies creating complex visualizations with more aesthetically pleasing defaults and a variety of plot types designed for statistical analysis. When plotting the same dataset, Seaborn automatically manages finer details like plot style and color palettes, making it easier to generate more informative and visually appealing plots.
Matplotlib- Matplotlib offers a highly customizable framework for creating a wide variety of plots in Python. When plotting MPG vs. Displacement, it provides a straightforward approach, focusing on clarity and simplicity. Users can directly manipulate plot aspects like size, labels, and color, but interactive capabilities are limited compared to tools like Bokeh.
Bokeh- When plotting MPG vs. Displacement with Bokeh, it provides users with interactive elements like hover tools, zooming, and panning, enhancing the exploratory analysis experience and making the data more accessible and interactive for end-users.
Tools Used-
Matplotlib Seaborn Pandas Flourish Datawrapper
Variables used- Y-axis - Weight X-axis - MPG (Miles Per Gallon) Color - Model Year
Dataset details: The dataset comprises 398 data entries, each representing various attributes of car models. These attributes include numerical values for factors such as miles per gallon, number of cylinders, horsepower, weight, and acceleration. Additionally, the dataset includes information about the model year and origin of each car version.
My approach: There are many interesting features and bivariate relationships between features from the table.The relationship between MPG i.e Miles Per Gallon and Weight will be a very interesting relationship to observe i.e as the weight decreases so will the mpg increase i.e lighter cars should ideally be able to go longer miles per gallon as the body mass and weight shall be lower for the car.And colouring the above trend based on the Model Years would also be very interesting as a hypothesis i.e throughout the years the cars become lighter,stronger and faster..due to the advancement of technology and the change of liking as per the time.
Choice of graph: Scatter Plot is the best graph to observe the trends.These points are put up in the graph and we can clearly see that these points showcase a trend..i,e downward trend...coloured according to the Model Year.
We observe as the Years pass the hypothesis is validated i.e with years cars become faster,lighter and stronger and we can see it clearly here.
Here are the trends as per the Tools.
Name : Manish Kumar Roll : 21F1006597 Dataset : link
Draw a scatter plot between horsepower and acceleration using these different tools
P V Shabarish 21F1001346
Here we will see the plotting of the vehicle Weight and MPG (Miles per Gallon) with the number of cylinders in that vehicle by using scatter plot and observing the differences among different visualization tools for the same plot.
Features Used:
Tools Used:
Here I have taken the dark backgrounds for all plots because as this is posted in GITHUB, it's entire background is in the dark. So, it will be visually compelling when someone observes the plot without much stress to the eyes. Also, the color for data points are chosen in a way that it can easily notice.
In Matplotlib, the entire plot comes with very thick borders and labels. The data points are also a little blurry in nature. I have used the spring color palette for this plot.
In Seaborn, the quality of the plot is very good. The data points are also in appropriate size with a decent opacity in the centers which makes the data points differentiable and can be noticed easily. I tried increasing the dpi in matplotlib but the image quality still looks the same but for the seaborn, it works perfectly fine. For seaborn also, I have used the spring color palette.
In Plotly, it mainly offers interactive plots. Although the quality of the plot looks a little low but the features that it has with the interactive zooming, box select, lasso select the data points that we are interested in.
All these are visualization libraries that are mentioned above. All these libraries have the color palette inbuilt but most of the color palettes are with light color on one end and darker on the other end. As I have selected the darker background, light color palettes will be well suitable for my visualizations. Hence, I preferred the spring palette in Matplotlib and Seaborn. But unfortunately, there is no spring palette in Plotly. As I want to show the differences among various tools, I wanted it to be in the same palette for every tool. For this issue, I have taken the RGB values for the spring palette from Matplotlib and Seaborn and have used those values manually in Plotly and other tools that we are going to see next.
Google Sheets are mainly useful for quick and simple generated charts. In sheets, the scatter plot simply takes two variables and plots it with a single color. There is no direct option to consider the 3rd feature and color it accordingly. To achieve it, we have to split the mpg feature into multiple features according to the cylinders.
Flourish is well popular for visualizations. It is very easy to implement. Copy pasting the data in the desired visualization template and can customize the chart according to our needs. But still it offers only a limited number of visualizations.
Conclusion: After these plotting in different tools and observed that if we are using visualization libraries, I would recommend Seaborn or if we want interactive visualization from the libraries, we can go for Plotly. From the visualization tools, google sheets can be useful only for some quick plots by analyzing or filtering the datasets. We can achieve some rough visualizations from Google Sheets and can apply those into bigger visualization tools like Flourish. If we are planning for more unique and different visualizations which are not offered by any of the tools above, we can go for some other tools like Tableau, Power BI etc.
Name: Kruthiventi M R S Sai Charan Roll No: 21f1004450 Level: BSc. Level Email: 21f1004450@ds.study.iitm.ac.in Data: auto-mpg.csv
1. Matplotlib(Python):
2. Bokeh(Python):
3. Plotly(Python):
4. Fourish(Web-Based Open Source):
5. Altair(Python):
Name: Gokulakrishnan B Roll No: 21f1006866
The first thought that came to mind is that heavy weight vehicles consume more fuel to move, thus the mileage will be low. My intuition is that mpg is inversely proportional to weight. Higher the mpg, lighter the vehicle. So I am going to plot mpg vs weight scatter plot to observe the pattern and validate my assumption.
Name : Gaikwad Sanket Sanjay Roll no: 21f1007096
Title- Relationship between Horsepower and MPG based on number of Cylinders
Features Used: X-axis - mpg Y-axis - horsepower Legend - cylinders
1) Seaborn
2) Power BI
3) Flourish
4) Plotly
5) Datawrapper
Name - Aditi Krishana Roll No. - 21f1004270
Dataset Description
Dataset Used - Automobile Dataset
Given dataset had 9 columns:
Purpose:
The goal is to explore the relationship between horsepower and mileage (mpg) based on the number of cylinders in the car. We want to understand how these factors are interconnected and whether the number of cylinders affects the fuel efficiency of the vehicle.
Visualization type : Scatterplot
Preprocessing Steps:
Data Summary:
This scatter plot shows the relationship between horsepower and mileage (mpg). The points cluster around a downward trend, it suggests that cars with more horsepower tend to have lower mpg.
Name: Mukesh K Roll no: 21F1000478
The dataset contains automotive fuel economy (in miles per gallon or mpg) and associated vehicle characteristics such as cylinders, displacement, horsepower, weight, acceleration, model year, origin, and car name.
Features used: X-axis: mpg (Miles per gallon) Y-axis: engine displacement Legend: no: of cylinders
The following tools were used to plot the data containing engine displacement vs mpg (miles per gallon) • Matplotlib • Seabon • MS Excel • Tableau • Matlab
1. Matplotlib****
2. Seaborn****
3. MS Excel
4. Tableau
5. Matlab
Name : Abhishek Gupta Roll No: 21f1004820
For this task, I utilized the auto-mpg.csv dataset to generate a bubble chart across five distinct platforms. A bubble chart, akin to a scatter plot, employs circles to convey additional data. Specifically, I selected the variables of mpg (miles per gallon) and displacement (engine size) from the dataset. The size of the bubbles corresponded to the mpg values of the car engines.
I have used these variables because they have high correlation. The 5 tools that I have used are following:
Flourish Datawrapper Matplotlib & Seaborn Excel Datastudio
Data Wrapper:
DataStudio
Excel:
Flourish:
Matplotlib :
Name : ADITYA DHAR DWIVEDI Roll No: 21f1001069
For this assignment, I used a dataset auto-mpg.csv to make a bubble chart with five different tools. I picked weight and displacement variables from the dataset, and the size of the bubbles was determined by the cylinders of the car engines.
The 5 tools that I have used are following:
In this comparative evaluation, we explore the relationship between Cylinders
,Displacement
, Weight
, and Acceleration
using five different visualization tools: Excel, Flourish, Datawrapper, Seaborn, and Matplotlib.
Description: Excel offers a user-friendly interface for creating scatter plots. While it lacks advanced features, it's suitable for quick visualizations.
Description: Flourish is an online tool for creating interactive visualizations. It allows for more interactivity compared to other tools but requires uploading data.
Description: Datawrapper provides a simple yet effective way to create scatter plots. Its intuitive interface makes it easy to customize visualizations.
Description: Seaborn, a Python library, offers a high-level interface for drawing attractive statistical graphics. It's particularly useful for exploring datasets with many variables.
Description: Matplotlib is a versatile plotting library for Python. It allows for fine-grained control over plot elements, making it suitable for creating publication-quality visualizations.
Name: Shine Priyan Roll Number: 21f1003384
Plotting miles per gallon (mpg) by horsepower and color encoding by the number of cylinders wherever possible.
Excel
Stata
Flourish
Python
R
Name : Purva Sharma Roll No. : 21f1006847
1. About the Data: The dataset used for this analysis contains information about various automobile models, including their miles per gallon (mpg), weight, cylinders, and other attributes. Each row represents a different car model.
2. Features Utilized:
For this analysis, the following features were used:
Weight: Represented on the x-axis. Miles Per Gallon (MPG): Represented on the y-axis. Cylinders: Represented by different colors to distinguish between different cylinder counts.
3. Tools Used:
5 visualization tools were employed to analyze and visualize the data:
Matplotlib:
Seaborn:
Plotly:
Bokeh:
Tableau:
Name: Utkarsh Gaurav Roll No: 21F1001336
Task : Out of 24 tools and libraries shared , we were supposed to use any 5 tools to represent relationship among the features from the data.
About the Data: The data is about the cars of 70s-80s model with various feature like mpg, displacement , horsepower etc offered as the new models were introduced.
Exploratory Analysis: Using pandas library , I performed exploratory anlysis to find the presence of relationship among features and plotted using matplot library . Based on analysis , I have shown relationship between Miles per Gallon and Displacement of Engine pistons using Cylinders.
Seaborn: Python library for statistical visualization with attractive themes and color palettes.
Lyra : Interactive visualization tool enabling custom visualizations through a drag-and-drop interface.
GGPlot : R package for creating complex, layered plots following a declarative approach.
Plotly : Open-source library offering interactive visualizations across multiple programming languages.
Tableau: Data visualization software facilitating interactive dashboards and reports creation from various data sources.
These tools happen proved to be very powerful for various purpose and its upto the designer story what they want to convey.
mpg
weight
MS EXCEL
GGPLOT
PLOTLY
TABLEAU
MATPLOTLIB
Name: Pratham Bhalla Roll Number: 21f1003052
X-axis : Miles per gallon Y-axis : Horsepower Legends : No. of Cylinders
The creation of visualizations involved preprocessing of the data which included removal of rows with null value in horsepower. The chart chosen is scatterplot on the following tools:
Here are the visualizations:
Name: Devansh Gandhi Roll No: 21f1002115
To identify relationship between the Weight of the car and its corresponding MPG while considering the no. of cylinders in the car.
Name: Lehani Raj Mohanta Roll No: 21F1003574
Chart type: Scatter plot Attributes: Horsepower and Acceleration
With a plethora of both commercial & free visualization tools & libraries available, it can often be confusing to pick the right tool for your requirement. Also from the learning point of view, one doesn't know which tool or set of tools should invest time & effort in learning.
In her 2016 article "What I Learned Recreating One Chart Using 24 Tools", Lisa Charlotte Rost tried out 12 data vis applications and 12 data vis libraries and programming languages and reported a comparative evaluation.
In this assignment, you will recreate the exercise with at least 5 charting tools or libraries (total 5 not 5 each) for the given dataset (auto-mpg.csv). You may create any chart type, but using at least 2 variables from the dataset. Having decided on chart type & variables, repeat the same chart using the 5 chart tools or libraries. Paste your charts as a comment to this issue. Add text to each chart identifying the tool/library you used for the chart.
Note: You can only use one from Matplotlib, seaborn, and Excel.