pydelhi / talks

Talks at Python Delhi User Group!
https://pydelhi.org/talks/
68 stars 52 forks source link

Bridging Data and Code: A Software Engineer's Perspective on Data Pipeline Setup #248

Closed harshitkohli1997 closed 1 year ago

harshitkohli1997 commented 1 year ago

Title

Bridging Data and Code: A Software Engineer's Perspective on Data Pipeline Setup

Describe your Talk

From the perspective of a software engineer, I embarked on a remarkable journey within my organization as I transitioned into the role of a data engineer. This transition was driven by my passion for building sophisticated analytics ecosystems and alleviating the burden of CPU load from our production database. Armed with my software engineering skills, I embraced the challenges of data engineering and set out to establish robust data pipelines. Using my expertise in programming, particularly in languages like Python, I designed and implemented efficient data extraction, transformation, and loading processes. By doing so, I not only relieved the strain on our production database but also ensured that data flowed seamlessly through our analytics ecosystem. I employed various technologies and frameworks to optimize data storage, retrieval, and processing, thus enabling our organization to derive valuable insights from our vast data repositories. This transition not only allowed me to leverage my software engineering experience but also empowered me to contribute significantly to our organization's data-driven decision-making processes.

Pre-requisites & reading material

From the perspective of a software engineer, I embarked on a remarkable journey within my organization as I transitioned into the role of a data engineer. This transition was driven by my passion for building sophisticated analytics ecosystems and alleviating the burden of CPU load from our production database. Armed with my software engineering skills, I embraced the challenges of data engineering and set out to establish robust data pipelines. Using my expertise in programming, particularly in languages like Python, I designed and implemented efficient data extraction, transformation, and loading processes. By doing so, I not only relieved the strain on our production database but also ensured that data flowed seamlessly through our analytics ecosystem. I employed various technologies and frameworks to optimize data storage, retrieval, and processing, thus enabling our organization to derive valuable insights from our vast data repositories. This transition not only allowed me to leverage my software engineering experience but also empowered me to contribute significantly to our organization's data-driven decision-making processes.

Introduction: (5 mins)

Unlocking Data Insights: How Cloud Storage Empowers Data Lakes: (5 mins)

Setting up data pipeline on production: (12 mins)

Impact of setting up a data-lake:(3 mins)

conclusion and QNA (5mins)

Time required for the talk

30 mins

Link to slides/demos

No response

About you

Harshit Kohli is 25 year old software engineer currently working at Milkbasket(it is India’s first and largest daily micro-delivery service). also with a background working with industry giants like Blinkit and Classplus. Self studying Data Engineering

Availability

22/07/2022 or any other day

Any comments

No response

pulsar17 commented 1 year ago

Hi @harshitkohli1997 , a few questions:

  1. Are you part of the Telegram group? (If no, please share your username. If yes, please share your username)
  2. Is there a particular time slot you have in your mind? (The meetup timings are 1-5 pm generally)
Animesh-Ghosh commented 1 year ago

@harshitkohli1997 hey, just pinging since we wanted to finalize the talks.

harshitkohli1997 commented 1 year ago

HI yes i am willing to speak. any time after 2 works for me. No, i'm not part of the telegram group username: harsh_it25