Solution: Exercise: Supervised vs Unsupervised

Supervised Learning Exercise: Predicting Building Energy Efficiency trained a RandomForestRegressor model to predict the energy efficiency of buildings based on features such as wall area, roof area, overall height, and glazing area.

Expected Results Data Visualizations - Scatter plots will show the relationship between each feature and the target variable (energy efficiency). Students should observe how changes in features may relate to energy efficiency, although with synthetic data, these relationships might not show clear trends.

Model Performance - After training the model and making predictions, students will evaluate the model using Mean Squared Error (MSE). With synthetic data, the MSE value may vary, but it gives an idea of the average error in the model's predictions. The closer this value is to zero, the better the model's performance.

Prediction vs. True Value Plot - The scatter plot comparing true values and model predictions should ideally show points along the diagonal line (y=x), indicating accurate predictions. Deviations from this line suggest prediction errors.

Remember to delete the notebook instance. Graph showing True Values vs Predicted Values of supervised model True Values vs Predicted Values

Unsupervised Learning Exercise: Vehicle Clustering In this exercise, you used KMeans clustering to group vehicles based on their features like weight, engine size, and horsepower.

Expected Results Cluster Visualization - The scatter plot will visually depict how vehicles are grouped based on weight and horsepower. Each cluster will be represented by a different color. With synthetic data, the distinctness of clusters may vary, but students should be able to see groupings based on the chosen features.

Interpreting Clusters - There are no 'correct' labels in unsupervised learning, but students should observe how vehicles are grouped based on similarities in their features. They might see, for example, that heavier vehicles with higher horsepower are grouped together.

In both tasks, the exact numerical results can vary based on the randomness in the synthetic data generation and the inherent variability in machine learning models. The key learning outcome is understanding the process of applying machine learning techniques and interpreting the results, rather than achieving specific numerical accuracy or clustering results.

as discussed above, this scatter plot visually depicts how vehicles are grouped based on weight and horsepower. There are three distinct groups..

princyi / password-protected-zip-file-

Solution: Exercise: Supervised vs Unsupervised #7