Open HangXXXu opened 1 day ago
[9/30]
to [10/7]
Proposed future steps for our project as: 1.Extract geometry information from OSM file and merge them to existing energy demand dataset (we are here) 2.Based on building geometry, identify ‘uncommon’ feature for our own model, e.g footpring shape, shading 3.Develop algorithm to recognize and classify footprint shape 4.Study about how to encode such non-numeric variable to regression model 5.Model training 6.Validate our surrogate model by comparison with parametric simulation
Difficult to read and merge the datasets; Key finding is that single OSM file can be merged with metadata via ‘bldg_id’; Decided x-variable (columns) of the datasets.
For team management: Customize and assign tasks to team members; Make plan for the whole project.
For practical skills: Python coding to automatically scraper data from website
Regular meeting on Tuesday and Friday
Think more about how to organize the team and communicate efficiently.
Keep working on datasets.
[10/7]
to [10/14]
Proposed our research workflow as:
How to calculate numerical building features? X1: Building Area = largest RoofCeiling surface X2: Height = Z coordinate of the largest RoofCeiling surface X3: Wall Area = Height * Perimeter of largest RoofCeiling surface X4: WWR = Fixedwindow Area/(Wall Area-Fixedwindow Area) (Anu & Je) X5: Orientation = OS:Building - North Axis (Jo) X6: Building Type = Office, Restaurant, Hotel … X7: Footprint Shape = ? (Benj & Jo) Key is to rebuild building geometry in OSM files to calculate numerical building features.
For team management: Customize and assign tasks to team members; Make plan for the whole project.
For practical skills: Python coding to multiply rebuild building geometry; Python coding to automatically scraper data from website.
Regular meeting on Tuesday and Friday
Even though most of feature is available, remain uncertainty about building footprint classification
Keep working on datasets; OSM gives definition of building shape as ‘1,2,3...’, need to check it.
[10/14]
to [10/21]
Make plans and divide the work for GURC: Anubha - Complete Training data Joseph - Overall concept; Frontend / UI Jessica - Backend / Frontend (from Figma) Hang - Train ML model Han-Syun - Frontend (1st iteration on Figma) Kiana - Overall concept / general help Sharmista - Frontend (1st iteration on Figma) Jiayi - Slide deck and storyline
Design the prototype of web interface:
For team management: Customize and assign tasks to team members. Make plan for the whole project.
For practical skills: Python coding to successfully extract all OSM file for Georgia region (21.5GB)
Regular meeting on Tuesday and Friday
Every simulation geometry in OSM file is rectangle-based, even though they are labeled with different kind of shape information. I suppose that NREL simplify it for simulation process.
Complete the datasets.
[10/21]
to [10/28]
Proposed tasks for teams: 1.Default com building templete for comparison (Kiana, Jiayi) 2.Front-end development (Anubha, Jessica) 3.Geometry previewer (Joseph, Benj, Anubha, Jessica, Jiayi, Kiana, Sharmista) 4.ML model training (Hang, Shivam)
What is a comprehensive and professional workflow for ML training? Self-learning, read paper and watch tutorial.
For team management: Customize and assign tasks to team members. Make plan for the whole project.
For practical skills: Python coding to preprocess data for ML models 1.Read datasets 2.Clean the data (null and outliers) 3.Deal with highly correlated features to avoid multicollinearity 4.Encode non-numerical columns (one-hot, label, target) 5.Check the distribution of the numerical features and make logarithmic transformations if necessary
Data analysis result:
Regular meeting on Tuesday and Friday.
Need to read more paper about ML model for energy prediction.
Train the ML models.
[10/28]
to [11/4]
Proposed ML- train workflow:
Personally involved in: 1.Machine learning model training and comparison 2.Web application
How to find the best ML for our datasets and implement it to web application? After carefully paper review, I find that GB and tree-based model may perform better than MLR, KNN, SVR, Random forest models from last semester
For practical skills: Python coding (scikit-learn) to train ML models: 1.Based on datasets pre-processed last week, use computational software package to run MLR,SVR,KNN,XGB,RF,ET,LGB,GB,DT models for heating and cooling energy prediction 2.Select out GB as the best model, further use Bayesian optimizer to finetune model hyper-parameter for better R-square on training datasets.
Python coding to deploy ML model in Web interface: 1.Export encoder and model as (.pkl) and (.sav) 2.Modify deployment code in web app
Regular meeting on Tuesday and Friday.
Every point is covered for the conference.
Attend the conference.
[11/4]
to [11/11]
Enjoy the conference presentation; Take a break.
[11/11]
to [11/18]
To complete last section of methodology proposed in Week 10.7 to 10.14, we have the following validation workflow:
Proposed tasks for teams: For architects (5) 1.run simulation 10 times (2 times for one person) 2.compare simulation results with ML prediction results 3.refer to simulation settings in OSM files (all footprint needs to be rectangle)
For computer scientist (3) 1.check the innovation, is there any existing ML-based energy prediction plugin for Rhino/Grasshopper (cove.tool) 2.is there any more wise way to build parametric model (Sampling → Modeling) 3.how to implement ML model in grasshopper
Prepare final documents for the VIP course.
Weekly Notebook Entry — Week [03]
Overview
[9/23]
to[9/30]
Tasks for This Week
Proposed tasks for teams: 1.Rebuild an urban-scale model for energy simulation (Tech Square as a case) 2.Study about Google TensorFlow Smart Buildings Simulator 3.Feature study for ML models
Challenges & Solutions
1.Review existing paper related to ML-based energy simulation. 2.Find search gap for our project. 3.Identify ‘Building Orientation’ and ‘Footprint Shape’ as special features
General Takeaway, Skills & Knowledge Acquired
For team management: Customize and assign tasks to team members; Make plan for the whole project.
Team Meetings
Regular meeting on Tuesday and Friday
Personal Reflection
Think more about how to organize the team and communicate efficiently.
Plans for Next Week
Need to study know to extract feature information for our use.