Ironhack-data-bcn-oct-2023 / project-II-pipelines

0 stars 2 forks source link

Edu Project II #10

Open edwardrodgermartinez opened 11 months ago

edwardrodgermartinez commented 11 months ago

https://github.com/edwardrodgermartinez/project_2

Estoy así

image

sh-ih commented 11 months ago

You did great on this project! Some comments:

By talking about Ronaldo’s performance and sun in the zodiac sign, you need to be very careful. Remember that correlation is not causation.

In your presentation, question 2 is “Did astrology have more or less of an effect on his most important matches for Real Madrid?”, and you can’t determine that. You can only say if his performance was better or worse when the sun was on or near his sign or not. Same with question 3.

I was also left with the question of why is the sun in the sign important? Astrologically speaking. It would be a good thing to add an explanation to your project.

On your data folder you have two files called CR7, one .zip and one .csv. It is best to have only one version of a file (the .csv on this case) or, if the files contain different information, make it clear with informative file names, so the readers can have an idea of what they’ll find inside each file.

Same thing with your image files, instead of figure1.png, figure2.png, it would be better to name the files using your graph’s titles. (“total_goals_ronaldo_real_madrid.png”, for example)

I would suggest you divide your code in several notebooks: one for scrapping, another for cleaning, other for visualizations and one for creating your report, instead of having everything on notebook_clean.ipynb

A recommendation of best code practices is that a function should complete only one task, not multiple tasks. So, on your visualization.py, it would be better if you create one function for each visualization, instead of using one function to make all of your graphs.

Maybe you just want to update one graph, but by having all the code inside one function, you can’t do it.

image