Open Aakriti23 opened 6 years ago
@Aakriti23 This looks like an interesting talk, sure the audience will come away having learnt something useful. Though would recommend that you vet the content/agenda with your manager, companies can be quite rigid about any possible IP violation, even with open source stack.
@Dawny33 @manojpandey Thoughts?
Hi @manasRK
I shall be speaking to my manager regarding the proposal today. Will update the status as soon as possible.
Thanks!
Nice proposal. Data extraction tasks are some of the most underrated and tricky tasks of the data world.
Would love to see the slides or your Jupyter notebook, if any
Hi!
I have to submit the proposal to my manager in order to get the approval from him. The proposal should contain the entire agenda of my talk.
This is to ensure that I am not sharing any confidential information with people. I shall be sharing the agenda with my manager today.
Could I please get back to you on the approval bit in another 2-3 days?
Yeah, sure @Aakriti23 .
Hi guys,
As discussed with my manager, I won’t be able to show you the bank letters I worked with in my office project due to confidentiality reasons. However, I can use other PDF documents such as different credit card statements and replicate the extraction part. Let me know what you guys think about this.
I think that'd work. Can you deliver this on 29th September at our upcoming meetup?
That works for me!
Thanks!
Hey! Could you please share the timings and venue of the talk?
Hey, guys.
Do we have any update here?
@MSanKeys963 / @Arsh23 ^^
We'll have our next meetup in last week of October or first week of November. I'll let you once we schedule your talk.
Hi @Aakriti23. Can you deliver this in our upcoming meetup?
Hi Sanket,
Could you please send me the venue, date and time of the talk?
Thanks, Aakriti Jain
On 12-Dec-2018, at 15:47, Sanket Verma notifications@github.com wrote:
Hi @Aakriti23. Can you deliver this in our upcoming meetup?
— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub, or mute the thread.
Next meetup is scheduled for 22nd December. You can reach me @ svsanketverma@gmail.com for any questions.
Sure, this works for me.
On 12-Dec-2018, at 20:39, Sanket Verma notifications@github.com wrote:
Next meetup is scheduled for 22nd December. You can reach me @ svsanketverma@gmail.com for any questions.
— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub, or mute the thread.
Please check the updated agenda here.
Please find the presentation and codes using the link below:
Abstract: PDFs are one of the widely used digital media formats and are used to present and exchange information reliably, independent of the software, hardware and operating system. Extracting data from PDF files can be a tricky task. This is because of the complicated formats, font, color, and layouts that are stored in PDF files. I shall be showing you how to extract data directly from n number of PDF files using just one Python Library.
Brief Description and Contents to be covered: This is one of the projects I finished at my workplace, thereby saving hundreds of man-hours. This project was divided into two parts:
Techniques learned: Interacting with Outlook application using Python + Regular Expression (In Depth)
Use Cases for Regular Expressions: Regex can be used extensively in tasks that require searching, pattern matching, parsing, filtering or extracting data from piles of documents/text.
Pre-requisites for the talk: Basic Python knowledge.
Time required for the talk: 30 minutes
Link to slides: NA
Will you be doing hands-on demo as well? No.
Link to ipython notebook (if any): NA
About yourself: I am a Mathematics graduate currently working in the Robotics Team at S&P Global: Market Intelligence. I am a Data Science enthusiast and I employ my core theoretical knowledge concatenated with ML algorithms into my projects at work.
Are you comfortable if the talk is recorded and uploaded to PyData Delhi's YouTube channel? Yes
Any query? No