NobleKennamer / astro_porject

Final Project for CS 273
0 stars 0 forks source link

Report is up #16

Open NobleKennamer opened 8 years ago

NobleKennamer commented 8 years ago

@abhisaarsharma @arielkruger1989 @dcockerham

Hey everyone the report is up. I wrote the intro, results and analysis, and conclusion.

At the end of the report there are 3 sections: 1) What we did 2) What we used (libaries, papers, etc...) 3) our personal experience about the project

If everyone could fill those sections directly underneath their name that would be great. If we could get that part done by tomorrow that would be best so that we can finalize the rest of the report after that.

Also I've never worked with word docs on git before so just to be safe it might worthwhile to make a copy of the document and put your changes in red. I can then go and paste them in.

Just make an issue whenever you change something and I'll go change it.

Write now the report is almost 20 pages so we are meeting the minimum page count of 7 pretty easily.

dcockerham commented 8 years ago

I found some typos. I'd edit it myself, but I'm not confident in my word processor not to screw up the formatting of a docx file, so I'll just leave this here for now.

"Thus it is extremely important to develop techniques to process and analyze al this data." - in the Problem Statement section. al should be all

"This huge difference between the number of positive example (RR Lyrae stars) and negative examples makes this a vey challenging problem." - in the Problem Statement section. vey should be very

Also, unless this is a problem with my word processor, there looks like there's been a copy/paste error at the bottom of the first page: "The LSST will actually be doing only Experience coding it up (Everybody write about your experience) photometry on tens of billions of objects bringing us close to targeting every galaxy in the observable universe." That bit about coding experience seems to have been accidentally plopped down right in the middle of a sentence.

NobleKennamer commented 8 years ago

@dcockerham

Good eye. The corrections were made.

dcockerham commented 8 years ago

Found some more: in Table 2, "Bossted" DT should be Boosted. "since our computation is happeneing in the dual space." - in Experience section, should be happening. Actually, this happens twice... looks like a bit of text got pasted from Abhisaar's section to Noble's? I'm guessing that was a github merge error.

dcockerham commented 8 years ago

Another one: in the Who Did What, Abhisaar's section, last point, Contibuted should be Contributed, and second to last point, differernt should be different.

I'm just gonna go ahead and assign myself official team proofreader.

NobleKennamer commented 8 years ago

Good catch

arielkruger1989 commented 8 years ago

here is my part in Who did what: I implemented Decision Tree for classification. I used entropy to calculate the information gain of the features and consequently decide what was the best feature to use to build the tree. As the features of our data are real values I used the library panda in order to have the data discretized and to control the maximum depth of the tree. I also tested different parameters in order to optimize and have the best models for Random Forests and Boosted Decision Trees.

Experience: I appreciated working with different models and seeing the comparison between the results generated by each one of models. Also implementing a model is always fun and give many insights on how things work internally. Using sci-kit learn library was awesome. Very easy to use, having a standard interface for all different models which makes easy to apply them to our dataset. Very useful when deciding what model will work the best for a specific dataset.

I don't have anything to add in the what did we use section.