microsoft / vscode-data-wrangler

Other
443 stars 20 forks source link

Data linking? #24

Open bw-Deejee opened 1 year ago

bw-Deejee commented 1 year ago

Are there any plans on expanding the project beyond the scope of data wrangling within a single file? I'm thinking of exploring a possible merge of 2 or more files, giving various merge-types and exploring merge-keys.

jmew commented 1 year ago

Thanks for trying out Data Wrangler and the feature suggestion @bw-Deejee! Multiple files/dataset support is something our team has definitely thought about and may implement in a future update. I just wanted to get some more information to see what your ideal experience would be in Data Wrangler for working with multiple datasets/files? Are you looking for a join or merge operation that is built into Data Wrangler? And are you currently using a pandas join/merge?

MartinSher commented 1 year ago

This is something we would definitely like. Usage is to be able to have multiple dataframes (either from multiple data sources or derived from dataframes) and then be able to perform operations on each dataframe including doing mergers. Effectively similar functionality to PowerQuery

pwang347 commented 10 months ago

From @julen3

It would be really nice if there was an operation to join the table I'm working with with another one. Ideally, when selecting this operation, a window would open up asking me for the source of the other table that I want to join, as well as the columns that the join operation is going to be based on and the type of join that I want to do (left outer, left inner, etc.) It would also display the number of matching rows.

Esentially, it would use the pandas merge function, used to combine two DataFrames based on one or more common columns, and the window that pops up would look similar to the one from Power Query when selecting this kind of operation: https://learn.microsoft.com/en-us/power-query/merge-queries-overview