StatCan / aaw

Documentation for the Advanced Analytics Workspace Platform
https://statcan.github.io/aaw/
Other
69 stars 12 forks source link

Research useful tools for data exploration. #1890

Closed bryanpaget closed 1 year ago

bryanpaget commented 1 year ago

While on vacation I read about some interesting tools for data analysis. I'd like to write up a proposal for these tools and we can discuss if we want to add them to the AAW. If we don't want to add/support these tools, I can still write something up on how to install and use these tools on the AAW.

For instance, we don't even need to install anything but our documentation can include something along the lines of:

Install PyGWalker

mamba install -c conda-forge pygwalker ydata-profiling ipywidgets

Run PyGWalker

import pandas as pd
import pygwalker as pyg
from ydata_profiling.utils.cache import cache_file

file_name = cache_file(
    "pokemon.csv",
    "https://raw.githubusercontent.com/bryanpaget/html/main/pokemon.csv"
)

pokemon_df = pd.read_csv(file_name)
pyg.walk(pokemon_df)

And that is all that is required to get PyGWalker running on the AAW.

Tools

PyGWalker

More information on PyGWalker.

bryanpaget commented 1 year ago

I'm working on a Gist here: https://gist.github.com/bryanpaget/4a4503431ada221c4a12407681f13efe

bryanpaget commented 1 year ago

The work for this task can be found here:

PR:

I worked on 7 of the 11 tools.