intel / dffml

The easiest way to use Machine Learning. Mix and match underlying ML libraries and data set sources. Generate new datasets or modify existing ones with ease.
https://intel.github.io/dffml/main/
MIT License
253 stars 138 forks source link

examples: Comparison demo #1307

Open pdxjohnny opened 2 years ago

pdxjohnny commented 2 years ago

Project Description

This idea came from this web page: https://www.alaska.org/how-big-is-alaska/oregon

The page shows users comparisons of several characteristics between Alaska and other States within the United States of America.

This is an excerpt reproduced from the above mentioned website.

Population Density

Oregon has 37 times more people per square mile. If Oregon's population density were the same as Alaska’s, only 97,246 people would live in Oregon rather than 3,559,596 (based on 2003 data).

High Point

Alaska’s high point, Mt. Denali (20,320 feet), rises 1.7 miles above Oregon’s high point, Mount Hood, at 11,239 feet. Not only is Mt. Denali the highest mountain in North America, but Alaska has 15 other peaks higher than any in the continental U.S.

The goal here is to build a dataflow that will allow users to compare any two arbitrary things.

Get the two things to compare.

Skills

Difficulty

Beginner/Intermediate

Related Readings

Getting Started

Potential Mentors

Tracking and Discussion

This project is related to the following issues. Please discuss and ask questions in the issue comments. Please also ping mentors on Gitter when you post on the following issues so that they are sure to see that you've commented.

Misc

Demo idea, make a page like this https://www.alaska.org/how-big-is-alaska/oregon set things to compare and have dataflow do lookups maybe using nlp.

The demo should be based on dataflows so that it can be easily extended to other domains (other than states)

Onkar627 commented 2 years ago

I am ready to work on this task but can you guide me like from where can I start?