john-science / python_for_scientists

Python Open Courseware for Scientists and Engineers
GNU General Public License v3.0
68 stars 40 forks source link

scikit-learn lecture #69

Closed john-science closed 5 years ago

john-science commented 5 years ago

This seems like a big library. But okay, let's learn it.

What is it?

The scikit-learn main page says the it deals with three kinds of learning:

The API

Looking at the api, it has 19 algorithms modules and five modules that look like data input stuff:

Algorithms and Learning Techniques

Data Tools

How do we learn this?

Learn by doing.

The website is full of wonderful practical examples, with data sets and everything. Pick an example from each of the Algorithm / Techniques modules above, and do them in turn.

As time goes on, you will see how data is input, how to clean data, and how to use the tools in the library. And once you see enough parallels of that process, assuming the library is well designed, you should just be able to grab the library to solve any problem. Assuming you understand the underlying algorithm. And you think you have a problem that is susceptible to a standard / canned approach.

john-science commented 5 years ago

This lecture would be a big undertaking. The scikit-learn library is just a grab bag of various learning algorithms and tools.

john-science commented 5 years ago

I'm not sure this library is learnable in a half-hour class. It doesn't have enough structure or organization. It's just a hodge-podge pile of algorithms, without a high enough level API to really "cover" the whole library in one short explanation.

I will probably cover this library, but not in the format of this class.