datahq / dataflows

DataFlows is a simple, intuitive lightweight framework for building data processing flows in python.
https://dataflows.org
MIT License
193 stars 39 forks source link

The `join` processor documentation ideas #136

Open roll opened 4 years ago

roll commented 4 years ago

Hi, there are some ideas from @adyork regarding the join processor docs:


I have some suggestions for clarifying if you think it would be helpful. I tried to keep to the source/target model language.

Essentially what I am trying to clarify is when there are key mismatches, whether the rows are dropped or added to the result with null values in fields that originated from either the source or target.

Also, it really matters when translating the LEFT/RIGHT join model to the target/source model to clarify which is considered the "target." Essentially, you can flip flop what you can consider a left or right join if you switch which resource is the source or target.

Also, if people seem to be still confused about joins and modes, what I have found useful when explaining this to people is to show them these types of circle diagrams. You could adapt these join-type circle diagrams from the SQL model to use the source/target model terms by replacing the table1&2 with source&target. image

join

https://www.w3schools.com/sql/sql_join.asp