DAGWorks-Inc / hamilton

Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage/tracing and metadata. Runs and scales everywhere python does.
https://hamilton.dagworks.io/en/latest/
BSD 3-Clause Clear License
1.8k stars 119 forks source link

Documentation: update references to external blogs on Hamilton #1046

Open skrawcz opened 3 months ago

skrawcz commented 3 months ago

Is your feature request related to a problem? Please describe. We need to update https://hamilton.dagworks.io/en/latest/get-started/learning-resources/#external-blogs.

Describe the solution you'd like We need to provide updates for blogs on Hamilton.

Some that come to mind:

Describe alternatives you've considered N/A

Additional context It would be great to semi-automate this. E.g. make updating this simpler to do, by running a command for example.

sikehish commented 2 days ago

I'd like to take this up!

skrawcz commented 2 days ago

I'd like to take this up!

Great thank you. Assigned. I believe https://github.com/DAGWorks-Inc/hamilton/blob/main/docs/get-started/learning-resources.md is what you want to modify. Do you have an understanding of where and what to change?

sikehish commented 2 days ago

If I've understood the issue correctly, we want to have the latest blogs in the external blogs section. Is that correct? If that's the case, then I was thinking if I could make use of a substack API(if it exists)/scrape https://substack.com/@dagworks/posts for the posts and then update the posts in external blog sections in the markdown file.

skrawcz commented 1 day ago

If I've understood the issue correctly, we want to have the latest blogs in the external blogs section. Is that correct? If that's the case, then I was thinking if I could make use of a substack API(if it exists)/scrape https://substack.com/@dagworks/posts for the posts and then update the posts in external blog sections in the markdown file.

yep pretty much. You'd need to filter to posts that are about hamilton. Otherwise there's a few other posts that are linked in the issue that should also be added.

sikehish commented 1 day ago

If I've understood the issue correctly, we want to have the latest blogs in the external blogs section. Is that correct? If that's the case, then I was thinking if I could make use of a substack API(if it exists)/scrape https://substack.com/@dagworks/posts for the posts and then update the posts in external blog sections in the markdown file.

yep pretty much. You'd need to filter to posts that are about hamilton. Otherwise there's a few other posts that are linked in the issue that should also be added.

Okay. So I guess I need to hardcode the other 4 posts. Is that right? And how many posts from substack should be added in learning-resources.md(programatically, through script, that is)?

skrawcz commented 1 day ago

@sikehish yes that's fine.

I think using a date cut off is fine. Since that's how we'd update/run it. Pull posts older than X and add to the docs please :)